Variational Bayesian methods for audio indexing

被引:0
|
作者
Valente, F [1 ]
Wellekens, C [1 ]
机构
[1] Inst Eurecom, Sophia Antipolis, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we aim to investigate the use of Variational Bayesian methods for audio indexing purposes. Variational Bayesian (VB) techniques are approximated techniques for fully Bayesian learning. Contrarily to non Bayesian methods (e.g. Maximum Likelihood) or partially Bayesian criterion (e.g. Maximum a Posteriori), VB benefits from important model selection properties. VB learning is based on the Free Energy optimization; Free Energy can be used at the same time as an objective function and as a model selection criterion allowing simultaneous model learning/model selection. Here we explore the use of VB learning and VB model selection in a speaker clustering task comparing results with classical learning techniques (ML and MAP) and classical model selection criteria (BIC). Experiments are run on the evaluation data set NIST-1996 HUB-4 and results show that VB can outperform classical methods.
引用
收藏
页码:307 / 319
页数:13
相关论文
共 50 条
  • [21] Speech processing for audio indexing
    Lamel, Lori
    Gauvain, Jean-Luc
    ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 4 - 15
  • [22] Unsupervised Bayesian Surprise Detection in Spatial Audio with Convolutional Variational Autoencoder and LSTM Model
    Khah, Arman Nik
    Htun, Chitsein
    Prakash, Ravi
    PROCEEDINGS OF THE 2024 ACM INTERNATIONAL CONFERENCE ON INTERACTIVE MEDIA EXPERIENCES WORKSHOPS, IMXW 2024, 2024, : 116 - 121
  • [23] Variational Bayesian Methods for Stochastically Constrained System Design Problems
    Jaiswal, Prateek
    Honnappa, Harsh
    Rao, Vinayak A.
    SYMPOSIUM ON ADVANCES IN APPROXIMATE BAYESIAN INFERENCE, VOL 118, 2019, 118
  • [24] Audio indexing: primary components retrievalRobust classification in audio documents
    Julien Pinquier
    Régine André-Obrecht
    Multimedia Tools and Applications, 2006, 30 : 313 - 330
  • [25] Audio retrieval by latent perceptual indexing
    Sundaram, Shiva
    Narayanan, Shrikanth
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 49 - 52
  • [26] Audio indexing of Arabic broadcast news
    Billa, J
    Noamany, M
    Srivastava, A
    Liu, D
    Stone, R
    Xu, J
    Makhoul, J
    Kubala, F
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 5 - 8
  • [27] A morphological generator for the indexing of arabic audio
    Shaalan, KF
    Talhami, HE
    Kamel, IH
    PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2005, : 307 - 312
  • [28] Bayesian and Variational Methods for Discontinuity Detection: Theory Overview and Performance Comparison
    Benciolini, Battista
    Reguzzoni, Mirko
    Venuti, Giovanna
    Vitti, Alfonso
    VII HOTINE-MARUSSI SYMPOSIUM ON MATHEMATICAL GEODESY, 2012, 137
  • [29] ESTIMATION OF NAVIGATION PERFORMANCE AND OFFSET BY THE EM ALGORITHM AND THE VARIATIONAL BAYESIAN METHODS
    Fujita, Masato
    ADVANCES AND APPLICATIONS IN STATISTICS, 2013, 35 (01) : 1 - 27
  • [30] HMM-based text segmentation using variational Bayes learning and its application to audio-visual indexing
    Koshinaka, Takafumi
    Okumura, Akitoshi
    Isotani, Ryosuke
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2007, 90 (12): : 1 - 11