Audio visual cues for video indexing and retrieval

被引:0
|
作者
Muneesawang, Paisarn [1 ]
Amin, Tahir [2 ]
Guan, Ling [2 ]
机构
[1] Dept. of Electrical and Computer Engineering, Naresuan University, Thailand
[2] Dept. of Electrical and Computer Engineering, Ryerson University, Toronto, Ont., Canada
关键词
Image retrieval - Video recording;
D O I
10.1007/978-3-540-30541-5_79
中图分类号
学科分类号
摘要
This paper studies content-based video retrieval using the combination of audio and visual features. The visual feature is extracted by an adaptive video indexing technique that places a strong emphasis on accurate characterization of spatio-temporal information within video clips. Audio feature is extracted by a statistical time-frequency analysis method that applies Laplacian mixture models to wavelet coefficients. The proposed joint audio-visual retrieval framework is highly flexible and scalable, and can be effectively applied to various types of video databases. © Springer-Verlag Berlin Heidelberg 2004.
引用
收藏
页码:642 / 649
相关论文
共 50 条
  • [1] Audio visual cues for video indexing and retrieval
    Muneesawang, P
    Amin, T
    Guan, L
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 642 - 649
  • [2] MPEG-7 audio-visual indexing test-bed for video retrieval
    Gagnon, L
    Foucher, S
    Gouaillier, V
    Brun, C
    Brousseau, J
    Boulianne, G
    Osterrath, F
    Chapdelaine, C
    Dutrisac, J
    St-Onge, F
    Champagne, B
    Lu, X
    INTERNET IMAGING V, 2004, 5304 : 319 - 329
  • [3] Semantic indexing of multimedia using audio, text and visual cues
    Iyengar, G
    Nock, H
    Neti, C
    Franz, M
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A369 - A372
  • [4] Indexing audio-visual sequences by joint audio and video processing
    Saraceno, C
    Leonardi, R
    VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
  • [5] An Audio Indexing and Retrieval Approach using a Video Surveillance Ontology
    Kazi Tani, Mohammed Yassine
    Ghomari, Abdelghani
    Dali Youcef, Lamia
    Lablack, Adel
    Bilasco, Ioan Marius
    2017 COMPUTING CONFERENCE, 2017, : 258 - 261
  • [6] Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues
    Chen, Tianxiang
    Tan, Zhentao
    Gong, Tao
    Chu, Qi
    Wu, Yue
    Liu, Bin
    Yu, Nenghai
    Lu, Le
    Ye, Jieping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2398 - 2409
  • [7] Video Description Generation using Audio and Visual Cues
    Jin, Qin
    Liang, Junwei
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 239 - 242
  • [8] Tennis video abstraction from audio and visual cues
    Coldefy, F
    Bouthemy, P
    Betser, M
    Gravier, G
    2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 163 - 166
  • [9] Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues
    W. H. Adams
    Giridharan Iyengar
    Ching-Yung Lin
    Milind Ramesh Naphade
    Chalapathy Neti
    Harriet J. Nock
    John R. Smith
    EURASIP Journal on Advances in Signal Processing, 2003
  • [10] Semantic indexing of multimedia content using visual, audio, and text cues
    Adams, W.H. (whadams@us.ibm.com), 1600, Hindawi Publishing Corporation (2003):