Video Summarization Based on Multimodal Features

被引：0

作者：

Zhang, Yu ^{[1
]}

Liu, Ju ^{[2
]}

Liu, Xiaoxi ^{[1
]}

Gao, Xuesong ^{[3
]}

机构：

[1] Shandong Univ, Informat & Commun Engn, Qingdao, Peoples R China

[2] Shandong Univ, Dept Elect Engn, Qingdao, Peoples R China

[3] Hisense Grp, Qingdao, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT | 2020年 / 11卷 / 04期

关键词：

Feature Fusion; Information Science; LSTM; Multimedia Processing; Multimodal Features; Video Summarization;

D O I：

10.4018/IJMDEM.2020100104

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this manuscript, the authors present a keyshots-based supervised video summarization method, where feature fusion and LSTM networks are used for summarization. The framework can be divided into three folds: 1) The authors formulate video summarization as a sequence to sequence problem, which should predict the importance score of video content based on video feature sequence. 2) By simultaneously considering visual features and textual features, the authors present the deep fusion multimodal features and summarize videos based on recurrent encoder-decoder architecture with bi-directional LSTM. 3) Most importantly, in order to train the supervised video summarization framework, the authors adopt the number of users who decided to select current video clip in their final video summary as the importance scores and ground truth. Comparisons are performed with the state-of-the-art methods and different variants of FLSum and T-FLSum. The results of F-score and rank correlation coefficients on TVSum and SumMe shows the outstanding performance of the method proposed in this manuscript.

引用

页码：60 / 76

页数：17

共 50 条

[41] Smart Surveillance Based on Video Summarization
Thomas, Sinnu Susan
Gupta, Sumana
Subramanian, Venkatesh K.
2017 IEEE REGION 10 INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR SMART CITIES (IEEE TENSYMP 2017), 2017,
[42] An enhanced video summarization system using audio features for a personal video recorder
Otsuka, I
Radhakrishnan, R
Siracusa, M
Divakaran, A
Mishima, H
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (01) : 168 - 172
[43] Gesture-based video summarization
Kosmopoulos, D
Doulamis, A
Doulamis, N
2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 3213 - 3216
[44] Human Based Surveillance Video Summarization
Aydemir, M. Said
Karsligil, M. Elif
2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
[45] Video summarization based on semantic representation
Carlos, RP
Uehara, K
ADVANCED MULTIMEDIA CONTENT PROCESSING, 1999, 1554 : 1 - 16
[46] Video Summarization Based on Optical Flow
Jadhav, Dipti
Bhosle, Udhav
ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 333 - 342
[47] GVSUM: generic video summarization using deep visual features
Madhushree Basavarajaiah
Priyanka Sharma
Multimedia Tools and Applications, 2021, 80 : 14459 - 14476
[48] Unsupervised Video Summarization via Multi-source Features
Kanafani, Hussain
Ghauri, Junaid Ahmed
Hakimov, Sherzod
Ewerth, Ralph
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 466 - 470
[49] News Video Summarization Combining SURF and Color Histogram Features
Liang, Buyun
Li, Na
He, Zheng
Wang, Zhongyuan
Fu, Youming
Lu, Tao
ENTROPY, 2021, 23 (08)
[50] Adaptive Features Extraction for Capsule Endoscopy (CE) Video Summarization
Emam, Ahmed Z.
Ali, Yasser A.
Ben Ismail, Mohamed M.
INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE ANALYSIS APPLICATIONS, 2015,

← 1 2 3 4 5 →