Audio-Video based Segmentation and Classification Using SVM

被引:0
|
作者
Subashini, K. [1 ]
Palanivel, S.
Ramaligam, V. [2 ]
机构
[1] Annamalai Univ, Chidambaram 608002, India
[2] Annamalai Univ, Dept Comp Sci & Engn, Chidambaram, India
关键词
Support vector machines; Mel frequency cepstral coefficients; Color histogram; Audio classification; Video classification; Audio-video classification;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a method for combining audio and video for segmentation and classification. The objective of segmentation is to detect category change point such as news followed by advertisement. The classification system classify the audio and video data into one of the predefined categories such as news, advertisement, sports, serial and movies. Automatic audio-video classification is very useful to audio-video indexing, content based audio-video retrieval. Mel frequency cepstral coefficients is used as acoustic features and color histogram is used as visual features for segmentation and classification. Support vector machine (SVM) is used for both segmentation and classification. The experiments on different genres illustrate the results of classification are significant. Experimental results of audio classification evidence and video are combined using weighted sum rule for audio-video based segmentation and classification.
引用
收藏
页数:6
相关论文
共 50 条
  • [22] Audio-video integration for background modelling
    Cristani, M
    Bicego, M
    Murino, V
    COMPUTER VISION - ECCV 2004, PT 2, 2004, 3022 : 202 - 213
  • [23] AVATS: Audio-Video and Textual Synchronization
    Maini, Siddharth
    Rosen, Joshua
    Pierce, Marlon E.
    Fox, Geoffrey C.
    PROCEEDINGS OF THE 2009 INTERNATIONAL SYMPOSIUM ON COLLABORATIVE TECHNOLOGIES AND SYSTEMS, 2009, : 455 - 464
  • [24] Joint audio-video object tracking
    Spors, S
    Rabenstein, R
    Strobel, N
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 393 - 396
  • [25] Robust joint audio-video localization in video conferencing using reliability information
    Lo, D
    Goubran, RA
    Dansereau, RM
    Thompson, G
    Schulz, D
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2004, 53 (04) : 1132 - 1139
  • [26] Joint audio-video processing for multimedia
    Chen, T
    Rao, R
    PROCEEDINGS OF THE 1996 IEEE IECON - 22ND INTERNATIONAL CONFERENCE ON INDUSTRIAL ELECTRONICS, CONTROL, AND INSTRUMENTATION, VOLS 1-3, 1996, : 548 - 553
  • [27] MAViL: Masked Audio-Video Learners
    Huang, Po-Yao
    Sharma, Vasu
    Xu, Hu
    Ryali, Chaitanya
    Fan, Haoqi
    Li, Yanghao
    Li, Shang-Wen
    Ghosh, Gargi
    Malik, Jitendra
    Feichtenhofer, Christoph
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [28] Compression Algorithms for Audio-Video Streaming
    Rahman, Tarif Riyad
    Rahman, Miftahur
    UKSIM-AMSS FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2010, : 187 - 192
  • [29] An efficient audio-video synchronization methodology
    Yang, Ming
    Bourbakis, Nikolaos
    Chen, Zizhong
    Trifas, Monica
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 767 - +
  • [30] Speaker tracking audio-video system
    Cetnarowicz, Damian
    Dabrowski, Adam
    2016 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2016, : 230 - 233