An Audio-video Summarization Scheme Based on Audio and Video Analysis

被引:0
|
作者
Furini, Marco [1 ]
Ghini, Vittorio [2 ]
机构
[1] Univ Piemonte Orientale, Dept Comp Sci, I-15100 Alessandria, Italy
[2] Univ Bologna, Dept Comp Sci, I-40127 Bologna, Italy
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The availability of video files in the Internet is growing at an exceptional speed and in the near future video browsing will be a common activity. To facilitate such activity it will be necessary to have a small clip for any given video. Currently, video skimming and video summarization techniques can reduce the temporal representation of a given video. However, most of these techniques do not include audio in the produced summaries. Here, we propose a mechanism that, using audio and video analysis, produces video summaries coupled with intelligible audio. Experimental results show that the summaries are largely reduced (up to 50%) and that the perceived video quality may he comparable to the one of the original video (in term of jerkiness). Consumers satisfaction has been investigated through MOS and results show that our summaries can be considered as an alternative to the original videos.
引用
收藏
页码:1209 / +
页数:2
相关论文
共 50 条
  • [31] Audio to audio-video speech conversion with the help of phonetic knowledge integration
    Bothe, HH
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 1632 - 1637
  • [32] Kalman filters for audio-video source localization
    Gehrig, T
    Nickel, K
    Ekenel, HK
    Klee, U
    McDonough, J
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 118 - 121
  • [33] Performance Enhancement for Audio-Video Proxy Server
    Kanrar, Soumen
    Mandal, Niranjan Kumar
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 605 - 613
  • [34] Multimodal speaker identification with audio-video processing
    Yemez, Y
    Kanak, A
    Erzin, E
    Tekalp, AM
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 5 - 8
  • [35] Audio-Video detection of the active speaker in meetings
    Madrigal, Francisco
    Lerasle, Frederic
    Pibre, Lionel
    Ferrane, Isabelle
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2536 - 2543
  • [36] COLLABORATIVE LEARNING TO GENERATE AUDIO-VIDEO JOINTLY
    Kurmi, Vinod K.
    Bajaj, Vipul
    Patro, Badri N.
    Venkatesh, K. S.
    Namboodiri, Vinay P.
    Jyothi, Preethi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4180 - 4184
  • [37] NO CAUSE FOR JUBILATION AT BERLIN AUDIO-VIDEO FAIR
    GOSCH, J
    ELECTRONICS, 1985, 58 (34): : 34 - &
  • [38] USING THE VOICE SPECTRUM FOR IMPROVED TRACKING OF PEOPLE IN A JOINT AUDIO-VIDEO SCHEME
    D'Arca, Eleonora
    Robertson, Neil M.
    Hopgood, James
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3622 - 3626
  • [39] INTEL-IBMS AUDIO-VIDEO KERNEL
    DONOVAN, JW
    BYTE, 1991, 16 (13): : 177 - &
  • [40] AUDIO-VIDEO TECHNOLOGIES IN LEARNING SOCIAL PROBLEMS
    Pervova, Irina L.
    Kelasyev, Viacheslav N.
    6TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI 2013), 2013, : 6948 - 6952