Summarizing egocentric videos using deep features and optimal clustering

被引:11
|
作者
Sahu, Abhimanyu [1 ]
Chowdhury, Ananda S. [1 ]
机构
[1] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata 700032, India
关键词
Egocentric video summarization; Deep features; Center-surround model; Integer Knapsack; FRAMEWORK;
D O I
10.1016/j.neucom.2020.02.099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of summarizing egocentric videos using deep features and an optimal clustering approach. Based on an augmented pre-trained convolutional neural network (CNN), each frame in an egocentric video is represented by deep features. An optimal clustering algorithm, based on a center-surround model (CSM) and an Integer Knapsack type formulation (IK) for K-means, termed as CSMIK K-means, is applied next to obtain the summary. In the center surround model, we compute difference in entropy and the optical flow values between the central region and that of the surrounding region of each frame. In the integer knapsack formulation, each cluster is treated as an item whose cost is assigned from the center surround model. A potential set of clusters in CSMIK K-means is obtained from the chi-square distance between color histograms of successive frames. CSMIK K-Means evaluates different cluster formations and simultaneously determines the optimal number of clusters and the corresponding summary. Experimental evaluation on four well-known benchmark datasets clearly indicate the superiority of the proposed method over several state-of-the-art approaches. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:209 / 221
页数:13
相关论文
共 50 条
  • [21] Enhanced interpretation of novel datasets by summarizing clustering results using deep-learning based linguistic models
    Natarajan, K.
    Verma, Srikar
    Kumar, Dheeraj
    APPLIED INTELLIGENCE, 2025, 55 (04)
  • [22] Summarizing Software API Usage Examples Using Clustering Techniques
    Katirtzis, Nikolaos
    Diamantopoulos, Themistoklis
    Sutton, Charles
    FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING (FASE 2018), 2018, 10802 : 189 - 206
  • [23] Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames
    Apostolidis, Evlampios
    Balaouras, Georgios
    Mezaris, Vasileios
    Patras, Ioannis
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 407 - 415
  • [24] An Epilepsy Detection Method Using Multiview Clustering Algorithm and Deep Features
    Zhan, Qianyi
    Hu, Wei
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2020, 2020
  • [25] Saliency Driven Object recognition in egocentric videos with deep CNN: toward application in assistance to Neuroprostheses
    Pérez de San Roman P.
    Benois-Pineau J.
    Domenger J.-P.
    Paclet F.
    Cataert D.
    de Rugy A.
    Benois-Pineau, Jenny (benois-p@labri.fr), 1600, Academic Press Inc. (164): : 82 - 91
  • [26] Saliency Driven Object recognition in egocentric videos with deep CNN: toward application in assistance to Neuroprostheses
    de San Roman, Philippe Perez
    Benois-Pineau, Jenny
    Domenger, Jean-Philippe
    Paclet, Florent
    Cataert, Daniel
    de Rugy, Aymar
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 164 : 82 - 91
  • [27] Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features
    Baraldi, Lorenzo
    Grana, Costantino
    Cucchiara, Rita
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 23 - 29
  • [28] Face clustering in videos using constraint propagation
    Tao, Ji
    Tan, Yap-Peng
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 3246 - 3249
  • [29] An automated approach to retrieve lecture videos using context based semantic features and deep learning
    N POORNIMA
    B SALEENA
    Sādhanā, 2020, 45
  • [30] Egocentric Hand Gesture Recognition on Untrimmed Videos Using State Activation Gate LSTMs
    Chalasani, Tejo
    Smolic, Aljosa
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 13643 LNCS : 359 - 372