Video Summarization Based on Multimodal Features

被引:0
|
作者
Zhang, Yu [1 ]
Liu, Ju [2 ]
Liu, Xiaoxi [1 ]
Gao, Xuesong [3 ]
机构
[1] Shandong Univ, Informat & Commun Engn, Qingdao, Peoples R China
[2] Shandong Univ, Dept Elect Engn, Qingdao, Peoples R China
[3] Hisense Grp, Qingdao, Peoples R China
关键词
Feature Fusion; Information Science; LSTM; Multimedia Processing; Multimodal Features; Video Summarization;
D O I
10.4018/IJMDEM.2020100104
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this manuscript, the authors present a keyshots-based supervised video summarization method, where feature fusion and LSTM networks are used for summarization. The framework can be divided into three folds: 1) The authors formulate video summarization as a sequence to sequence problem, which should predict the importance score of video content based on video feature sequence. 2) By simultaneously considering visual features and textual features, the authors present the deep fusion multimodal features and summarize videos based on recurrent encoder-decoder architecture with bi-directional LSTM. 3) Most importantly, in order to train the supervised video summarization framework, the authors adopt the number of users who decided to select current video clip in their final video summary as the importance scores and ground truth. Comparisons are performed with the state-of-the-art methods and different variants of FLSum and T-FLSum. The results of F-score and rank correlation coefficients on TVSum and SumMe shows the outstanding performance of the method proposed in this manuscript.
引用
收藏
页码:60 / 76
页数:17
相关论文
共 50 条
  • [41] Smart Surveillance Based on Video Summarization
    Thomas, Sinnu Susan
    Gupta, Sumana
    Subramanian, Venkatesh K.
    2017 IEEE REGION 10 INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR SMART CITIES (IEEE TENSYMP 2017), 2017,
  • [42] An enhanced video summarization system using audio features for a personal video recorder
    Otsuka, I
    Radhakrishnan, R
    Siracusa, M
    Divakaran, A
    Mishima, H
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (01) : 168 - 172
  • [43] Gesture-based video summarization
    Kosmopoulos, D
    Doulamis, A
    Doulamis, N
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 3213 - 3216
  • [44] Human Based Surveillance Video Summarization
    Aydemir, M. Said
    Karsligil, M. Elif
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [45] Video summarization based on semantic representation
    Carlos, RP
    Uehara, K
    ADVANCED MULTIMEDIA CONTENT PROCESSING, 1999, 1554 : 1 - 16
  • [46] Video Summarization Based on Optical Flow
    Jadhav, Dipti
    Bhosle, Udhav
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 333 - 342
  • [47] GVSUM: generic video summarization using deep visual features
    Madhushree Basavarajaiah
    Priyanka Sharma
    Multimedia Tools and Applications, 2021, 80 : 14459 - 14476
  • [48] Unsupervised Video Summarization via Multi-source Features
    Kanafani, Hussain
    Ghauri, Junaid Ahmed
    Hakimov, Sherzod
    Ewerth, Ralph
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 466 - 470
  • [49] News Video Summarization Combining SURF and Color Histogram Features
    Liang, Buyun
    Li, Na
    He, Zheng
    Wang, Zhongyuan
    Fu, Youming
    Lu, Tao
    ENTROPY, 2021, 23 (08)
  • [50] Adaptive Features Extraction for Capsule Endoscopy (CE) Video Summarization
    Emam, Ahmed Z.
    Ali, Yasser A.
    Ben Ismail, Mohamed M.
    INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE ANALYSIS APPLICATIONS, 2015,