Summarizing egocentric videos using deep features and optimal clustering

被引:11
|
作者
Sahu, Abhimanyu [1 ]
Chowdhury, Ananda S. [1 ]
机构
[1] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata 700032, India
关键词
Egocentric video summarization; Deep features; Center-surround model; Integer Knapsack; FRAMEWORK;
D O I
10.1016/j.neucom.2020.02.099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of summarizing egocentric videos using deep features and an optimal clustering approach. Based on an augmented pre-trained convolutional neural network (CNN), each frame in an egocentric video is represented by deep features. An optimal clustering algorithm, based on a center-surround model (CSM) and an Integer Knapsack type formulation (IK) for K-means, termed as CSMIK K-means, is applied next to obtain the summary. In the center surround model, we compute difference in entropy and the optical flow values between the central region and that of the surrounding region of each frame. In the integer knapsack formulation, each cluster is treated as an item whose cost is assigned from the center surround model. A potential set of clusters in CSMIK K-means is obtained from the chi-square distance between color histograms of successive frames. CSMIK K-Means evaluates different cluster formations and simultaneously determines the optimal number of clusters and the corresponding summary. Experimental evaluation on four well-known benchmark datasets clearly indicate the superiority of the proposed method over several state-of-the-art approaches. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:209 / 221
页数:13
相关论文
共 50 条
  • [31] Kinship Verification from Videos using Spatio-Temporal Texture Features and Deep Learning
    Boutellaa, Elhocine
    Lopez, Miguel Bordallo
    Ait-Aoudia, Samy
    Feng, Xiaoyi
    Hadid, Abdenour
    2016 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2016,
  • [32] An automated approach to retrieve lecture videos using context based semantic features and deep learning
    POORNIMA, N.
    SALEENA, B.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2020, 45 (01):
  • [33] Self-supervised deep subspace clustering network for faces in videos
    Qiu, Yunhao
    Hao, Pengyi
    VISUAL COMPUTER, 2021, 37 (08): : 2253 - 2261
  • [34] Self-supervised deep subspace clustering network for faces in videos
    Yunhao Qiu
    Pengyi Hao
    The Visual Computer, 2021, 37 : 2253 - 2261
  • [35] Cut-in Prediction in Egocentric Videos using Extended Environment Perception with Status Descriptors
    Bian, Jiang
    Li, Bin
    Chen, Guang
    Qu, Sanqing
    Li, Zhijun
    Zou, Tianpei
    Knoll, Alois
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 611 - 616
  • [36] Pedestrian search in surveillance videos by learning discriminative deep features
    Zhang, Shizhou
    Cheng, De
    Gong, Yihong
    Shi, Dahu
    Qiu, Xi
    Xia, Yong
    Zhang, Yanning
    NEUROCOMPUTING, 2018, 283 : 120 - 128
  • [37] Multiple deep features learning for object retrieval in surveillance videos
    Guo, Haiyun
    Wang, Jinqiao
    Lu, Hanqing
    IET COMPUTER VISION, 2016, 10 (04) : 268 - 272
  • [38] Deep Clustering for Unsupervised Learning of Visual Features
    Caron, Mathilde
    Bojanowski, Piotr
    Joulin, Armand
    Douze, Matthijs
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 139 - 156
  • [39] Detecting text in videos using fuzzy clustering ensembles
    Gavata, Julinda
    Qeli, Ermir
    Freisleben, Bernd
    ISM 2006: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2006, : 283 - +
  • [40] Egocentric Vision for Human Activity Recognition Using Deep Learning
    Douache, Malika
    Benmoussat, Badra Nawal
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (06): : 730 - 744