An Audio-video Summarization Scheme Based on Audio and Video Analysis

被引：0

作者：

Furini, Marco ^{[1
]}

Ghini, Vittorio ^{[2
]}

机构：

[1] Univ Piemonte Orientale, Dept Comp Sci, I-15100 Alessandria, Italy

[2] Univ Bologna, Dept Comp Sci, I-40127 Bologna, Italy

来源：

2006 3RD IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, VOLS 1-3 | 2006年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The availability of video files in the Internet is growing at an exceptional speed and in the near future video browsing will be a common activity. To facilitate such activity it will be necessary to have a small clip for any given video. Currently, video skimming and video summarization techniques can reduce the temporal representation of a given video. However, most of these techniques do not include audio in the produced summaries. Here, we propose a mechanism that, using audio and video analysis, produces video summaries coupled with intelligible audio. Experimental results show that the summaries are largely reduced (up to 50%) and that the perceived video quality may he comparable to the one of the original video (in term of jerkiness). Consumers satisfaction has been investigated through MOS and results show that our summaries can be considered as an alternative to the original videos.

引用

页码：1209 / +

页数：2

共 50 条

[31] Audio to audio-video speech conversion with the help of phonetic knowledge integration
Bothe, HH
SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 1632 - 1637
[32] Kalman filters for audio-video source localization
Gehrig, T
Nickel, K
Ekenel, HK
Klee, U
McDonough, J
2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 118 - 121
[33] Performance Enhancement for Audio-Video Proxy Server
Kanrar, Soumen
Mandal, Niranjan Kumar
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 605 - 613
[34] Multimodal speaker identification with audio-video processing
Yemez, Y
Kanak, A
Erzin, E
Tekalp, AM
2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 5 - 8
[35] Audio-Video detection of the active speaker in meetings
Madrigal, Francisco
Lerasle, Frederic
Pibre, Lionel
Ferrane, Isabelle
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2536 - 2543
[36] COLLABORATIVE LEARNING TO GENERATE AUDIO-VIDEO JOINTLY
Kurmi, Vinod K.
Bajaj, Vipul
Patro, Badri N.
Venkatesh, K. S.
Namboodiri, Vinay P.
Jyothi, Preethi
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4180 - 4184
[37] NO CAUSE FOR JUBILATION AT BERLIN AUDIO-VIDEO FAIR
GOSCH, J
ELECTRONICS, 1985, 58 (34): : 34 - &
[38] USING THE VOICE SPECTRUM FOR IMPROVED TRACKING OF PEOPLE IN A JOINT AUDIO-VIDEO SCHEME
D'Arca, Eleonora
Robertson, Neil M.
Hopgood, James
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3622 - 3626
[39] INTEL-IBMS AUDIO-VIDEO KERNEL
DONOVAN, JW
BYTE, 1991, 16 (13): : 177 - &
[40] AUDIO-VIDEO TECHNOLOGIES IN LEARNING SOCIAL PROBLEMS
Pervova, Irina L.
Kelasyev, Viacheslav N.
6TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI 2013), 2013, : 6948 - 6952

← 1 2 3 4 5 →