ROLE OF AUDIO IN VIDEO SUMMARIZATION

被引:0
|
作者
Shoer, Ibrahim [1 ]
Kopru, Berkay [1 ]
Erzin, Engin [1 ]
机构
[1] Koc Univ, Coll Engn, Multimedia Vis & Graph Grp, KUIS AI Lab, Istanbul, Turkiye
关键词
Audio-visual video summarization; canonical correlation analysis;
D O I
10.1109/ICASSPW59220.2023.10192578
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Video summarization attracts attention for efficient video representation, retrieval, and browsing to ease volume and traffic surge problems. Although video summarization mostly uses the visual channel for compaction, the benefits of audio-visual modeling appeared in recent literature. The information coming from the audio channel can be a result of audio-visual correlation in the video content. In this study, we propose a new audio-visual video summarization framework integrating four ways of audio-visual information fusion with GRU-based and attention-based networks. Furthermore, we investigate a new explainability methodology using audio-visual canonical correlation analysis (CCA) to better understand and explain the role of audio in the video summarization task. Experimental evaluations on the TVSum dataset attain F1 score and Kendall-tau score improvements for the audio-visual video summarization. Furthermore, splitting video content on TVSum and COGNIMUSE datasets based on audio-visual CCA as positively and negatively correlated videos yields a strong performance improvement over the positively correlated videos for audio-only and audio-visual video summarization.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Video summarization for large sports video archives
    Takahashi, Y
    Nitta, N
    Babaguchi, N
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 1171 - 1174
  • [32] AUDIO SALIENT EVENT DETECTION AND SUMMARIZATION USING AUDIO AND TEXT MODALITIES
    Zlatintsi, Athanasia
    Iosif, Elias
    Maragos, Petros
    Potamianos, Alexandros
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2311 - 2315
  • [33] AUDIO-VIDEO CORRESPONDENCE AND ITS ROLE IN ATTENTION AND MEMORY
    GRIMES, T
    ETR&D-EDUCATIONAL TECHNOLOGY RESEARCH AND DEVELOPMENT, 1990, 38 (03): : 15 - 25
  • [34] Action based Video Summarization
    Raksha, H.
    Namitha, G.
    Sejal, N.
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 457 - 462
  • [35] A Framework for Scalable Summarization of Video
    Herranz, Luis
    Martinez, Jose M.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (09) : 1265 - 1270
  • [36] Video Summarization of Surveillance Cameras
    Lai, Po Kong
    Decombas, Marc
    Moutet, Kelvin
    Laganiere, Robert
    2016 13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2016, : 286 - 294
  • [37] Rushes video summarization and evaluation
    Dumont, Emilie
    Merialdo, Bernard
    MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 48 (01) : 51 - 68
  • [38] Camera Network Video Summarization
    Panda, Rameswar
    Roy-Chowdhury, Amit K.
    REAL-TIME IMAGE AND VIDEO PROCESSING 2017, 2017, 10223
  • [39] Video Summarization with Supervised Learning
    Basak, Jayanta
    Luthra, Varun
    Chaudhury, Santanu
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 863 - +
  • [40] Video Lecture Summarization System
    Agrawal, Sujal
    Tirpude, Shubhangi
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2022, 13 (05): : 1091 - 1097