Modality correlation-based video summarization

被引:7
|
作者
Wang, Xingrun [1 ]
Nie, Xiushan [2 ]
Liu, Xingbo [1 ]
Wang, Binze [3 ]
Yin, Yilong [4 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan 250101, Shandong, Peoples R China
[2] Shandong Jianzhu Univ, Sch Comp Sci & Technol, Jinan 250101, Shandong, Peoples R China
[3] Changan Univ, Coll Geol Engn & Geomat, Xian 710054, Peoples R China
[4] Shandong Univ, Sch Software Engn, Jinan 250101, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Video summarization; Modality correlation; Modality-specific information; Attention mechanism;
D O I
10.1007/s11042-020-08690-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization is an important technique to help us browse, store, and retrieve a rapidly increasing amount of video data, which extracts frames or shots from the original video. Text information covers important content of a video, and thus a summarization can be generated by exploring the correlation between the frame and text. In this study, we propose a video summarization method based on the modality correlation. With this method, we first learn the correlation between the text and frame in the respective space, and then fuse two correlations to obtain the importance score of each shot. Finally, video shots that have a high importance score are chosen as the video summarization. Compared to previous methods that seldom apply text to generate the video summarization, or only use the latent common information between text and frame, the proposed method fully utilizes not only the latent common but also modality-specific information for a video summarization. Experiments were conducted on the TVSum50 dataset, and the results verify the effectiveness of our proposed approach.
引用
收藏
页码:33875 / 33890
页数:16
相关论文
共 50 条
  • [1] Modality correlation-based video summarization
    Xingrun Wang
    Xiushan Nie
    Xingbo Liu
    Binze Wang
    Yilong Yin
    Multimedia Tools and Applications, 2020, 79 : 33875 - 33890
  • [2] Feature Maps Correlation-based Video Quality Assessment
    Bakhtiari, Amir Hossein
    Mansouri, Azadeh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 63309 - 63328
  • [3] Multi-modality Based Affective Video Summarization for Game Players
    Farooq, Sehar Shahzad
    Aziz, Abdullah
    Mukhtar, Hammad
    Fiaz, Mustansar
    Baek, Ki Yeol
    Choi, Naram
    Yun, Sang Bin
    Kim, Kyung Joong
    Jung, Soon Ki
    FRONTIERS OF COMPUTER VISION, IW-FCV 2021, 2021, 1405 : 59 - 69
  • [4] Correlation-based Interestingness Measure for Video Semantic Concept Detection
    Lin, Lin
    Shyu, Mei-Ling
    Chen, Shu-Ching
    PROCEEDINGS OF THE 2009 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 120 - +
  • [5] Adaptive sizing of tracking window for correlation-based video tracking
    Son, JG
    Lim, CW
    Choi, I
    Kim, NC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (06) : 1015 - 1021
  • [6] Correlation-based and content-enhanced network for video style transfer
    Lin, Honglin
    Wang, Mengmeng
    Liu, Yong
    Kou, Jiaxin
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (01) : 343 - 355
  • [7] Spatial correlation-based side information refinement for distributed video coding
    Mohamed Haj Taieb
    Jean-Yves Chouinard
    Demin Wang
    EURASIP Journal on Advances in Signal Processing, 2013
  • [8] Correlation-based and content-enhanced network for video style transfer
    Honglin Lin
    Mengmeng Wang
    Yong Liu
    Jiaxin Kou
    Pattern Analysis and Applications, 2023, 26 : 343 - 355
  • [9] Spatial correlation-based side information refinement for distributed video coding
    Taieb, Mohamed Haj
    Chouinard, Jean-Yves
    Wang, Demin
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [10] Longitudinal Multi-modality Correlation-based PET Enhancement for Reducing Radiation Accumulation
    Qiao, X.
    Wang, H.
    Ding, W.
    Chen, G.
    Li, B.
    Huang, Q.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2024, 51 : S758 - S759