Audio visual cues for video indexing and retrieval

被引：0

作者：

Muneesawang, Paisarn ^{[1
]}

Amin, Tahir ^{[2
]}

Guan, Ling ^{[2
]}

机构：

[1] Dept. of Electrical and Computer Engineering, Naresuan University, Thailand

[2] Dept. of Electrical and Computer Engineering, Ryerson University, Toronto, Ont., Canada

来源：

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2004年 / 3331卷

关键词：

Image retrieval - Video recording;

D O I：

10.1007/978-3-540-30541-5_79

中图分类号：

学科分类号：

摘要：

This paper studies content-based video retrieval using the combination of audio and visual features. The visual feature is extracted by an adaptive video indexing technique that places a strong emphasis on accurate characterization of spatio-temporal information within video clips. Audio feature is extracted by a statistical time-frequency analysis method that applies Laplacian mixture models to wavelet coefficients. The proposed joint audio-visual retrieval framework is highly flexible and scalable, and can be effectively applied to various types of video databases. © Springer-Verlag Berlin Heidelberg 2004.

引用

页码：642 / 649

共 50 条

[1] Audio visual cues for video indexing and retrieval
Muneesawang, P
Amin, T
Guan, L
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 642 - 649
[2] MPEG-7 audio-visual indexing test-bed for video retrieval
Gagnon, L
Foucher, S
Gouaillier, V
Brun, C
Brousseau, J
Boulianne, G
Osterrath, F
Chapdelaine, C
Dutrisac, J
St-Onge, F
Champagne, B
Lu, X
INTERNET IMAGING V, 2004, 5304 : 319 - 329
[3] Semantic indexing of multimedia using audio, text and visual cues
Iyengar, G
Nock, H
Neti, C
Franz, M
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A369 - A372
[4] Indexing audio-visual sequences by joint audio and video processing
Saraceno, C
Leonardi, R
VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
[5] An Audio Indexing and Retrieval Approach using a Video Surveillance Ontology
Kazi Tani, Mohammed Yassine
Ghomari, Abdelghani
Dali Youcef, Lamia
Lablack, Adel
Bilasco, Ioan Marius
2017 COMPUTING CONFERENCE, 2017, : 258 - 261
[6] Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues
Chen, Tianxiang
Tan, Zhentao
Gong, Tao
Chu, Qi
Wu, Yue
Liu, Bin
Yu, Nenghai
Lu, Le
Ye, Jieping
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2398 - 2409
[7] Video Description Generation using Audio and Visual Cues
Jin, Qin
Liang, Junwei
ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 239 - 242
[8] Tennis video abstraction from audio and visual cues
Coldefy, F
Bouthemy, P
Betser, M
Gravier, G
2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 163 - 166
[9] Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues
W. H. Adams
Giridharan Iyengar
Ching-Yung Lin
Milind Ramesh Naphade
Chalapathy Neti
Harriet J. Nock
John R. Smith
EURASIP Journal on Advances in Signal Processing, 2003
[10] Semantic indexing of multimedia content using visual, audio, and text cues
Adams, W.H. (whadams@us.ibm.com), 1600, Hindawi Publishing Corporation (2003):

← 1 2 3 4 5 →