Variational Bayesian methods for audio indexing

被引:0
|
作者
Valente, F [1 ]
Wellekens, C [1 ]
机构
[1] Inst Eurecom, Sophia Antipolis, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we aim to investigate the use of Variational Bayesian methods for audio indexing purposes. Variational Bayesian (VB) techniques are approximated techniques for fully Bayesian learning. Contrarily to non Bayesian methods (e.g. Maximum Likelihood) or partially Bayesian criterion (e.g. Maximum a Posteriori), VB benefits from important model selection properties. VB learning is based on the Free Energy optimization; Free Energy can be used at the same time as an objective function and as a model selection criterion allowing simultaneous model learning/model selection. Here we explore the use of VB learning and VB model selection in a speaker clustering task comparing results with classical learning techniques (ML and MAP) and classical model selection criteria (BIC). Experiments are run on the evaluation data set NIST-1996 HUB-4 and results show that VB can outperform classical methods.
引用
收藏
页码:307 / 319
页数:13
相关论文
共 50 条
  • [31] Audio indexing:: primary components retrieval -: Robust classification in audio documents
    Pinquier, Julien
    Andre-Obrecht, Regine
    MULTIMEDIA TOOLS AND APPLICATIONS, 2006, 30 (03) : 313 - 330
  • [32] Wavelet-based indexing of audio data in audio/multimedia databases
    Subramanya, SR
    Youssef, A
    INTERNATIONAL WORKSHOP ON MULTI-MEDIA DATABASE MANAGEMENT SYSTEMS- PROCEEDINGS, 1998, : 46 - 53
  • [33] Indexing audio-visual sequences by joint audio and video processing
    Saraceno, C
    Leonardi, R
    VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
  • [34] CueVideo: Automated video/audio indexing and browsing
    Amir, A
    Srinivasan, S
    Ponceleon, D
    Petkovic, D
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 326 - 326
  • [35] Robustness evaluation of the basic descriptors for audio indexing
    Essafi, Hassane
    Sayah, Salima
    Ouddan, Mohamed Amine
    12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS, 2006, : 369 - 376
  • [36] New word detection in audio-indexing
    Dharanipragada, S
    Roukos, S
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 551 - 557
  • [37] Using audio description for indexing moving images
    Turner, JM
    Colinet, EL
    KNOWLEDGE ORGANIZATION, 2004, 31 (04): : 222 - 230
  • [38] Audio indexing for efficient music information retrieval
    Karydis, I
    Nanopoulos, A
    Papadopoulos, AN
    Manolopoulos, Y
    11TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2005, : 22 - 29
  • [39] An unsupervised scheme for speaker indexing of audio databases
    Chen, Yanxiang
    Liu, Ming
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 3, 2009, : 90 - +
  • [40] Transcribing broadcast news for audio and video indexing
    Gauvain, JL
    Lamel, L
    Adda, G
    COMMUNICATIONS OF THE ACM, 2000, 43 (02) : 64 - 70