Bayesian Estimation of PLDA in the Presence of Noisy Training Labels, With Applications to Speaker Verification

被引:3
|
作者
Borgstrom, Bengt J. [1 ]
机构
[1] MIT, Lincoln Lab, Lexington, MA 02420 USA
关键词
Noise measurement; Estimation; Training; Labeling; Data models; Adaptation models; Bayes methods; Speaker verification; probabilistic linear discriminant analysis; noisy labels; variational bayes;
D O I
10.1109/TASLP.2021.3130980
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paperpresents a Bayesian framework for estimating a Probabilistic Linear Discriminant Analysis (PLDA) model in the presence of noisy labels. True class labels are interpreted as latent random variables, which are transmitted through a noisy channel, and received as observed speaker labels. The labeling process is modeled as a Discrete Memoryless Channel (DMC). PLDA hyperparameters are interpreted as random variables, and their joint posterior distribution is derived using mean-field Variational Bayes, allowing maximum a posteriori (MAP) estimates of the PLDA model parameters to be determined. The proposed solution, referred to as VB-MAP, is presented as a general framework, but is studied in the context of speaker verification, and a variety of use cases are discussed. Specifically, VB-MAP can be used for PLDA estimation with unreliable labels, unsupervised PLDA estimation, and to infer the reliability of a PLDA training set. Experimental results show the proposed approach to provide significant performance improvements on a variety of NIST Speaker Recognition Evaluation (SRE) tasks, both for data sets with simulated mislabels, and for data sets with naturally occurring missing or unreliable labels.
引用
收藏
页码:414 / 428
页数:15
相关论文
共 27 条
  • [21] Bayesian Estimation for Performance Measures of Two Diagnostic Tests in the Presence of Verification Bias
    Aragon, Davi Casale
    Martinez, Edson Zangiacomi
    Achcar, Jorge Alberto
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2010, 20 (04) : 821 - 834
  • [22] A deterministic filter for non-Gaussian Bayesian estimation- Applications to dynamical system estimation with noisy measurements
    Pajonk, Oliver
    Rosic, Bojana V.
    Litvinenko, Alexander
    Matthies, Hermann G.
    PHYSICA D-NONLINEAR PHENOMENA, 2012, 241 (07) : 775 - 788
  • [23] Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures
    Villalba, Jesus
    Lleida, Eduardo
    Ortega, Alfonso
    Miguel, Antonio
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 1 - 10
  • [24] FA-ExU-Net: The Simultaneous Training of an Embedding Extractor and Enhancement Model for a Speaker Verification System Robust to Short Noisy Utterances
    Kim, Ju-ho
    Heo, Jungwoo
    Shin, Hyun-seo
    Lim, Chan-yeong
    Yu, Ha-Jin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2269 - 2282
  • [25] Multi-task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech
    Jati, Arindam
    Peri, Raghuveer
    Pal, Monisankha
    Park, Tae Jin
    Kumar, Naveen
    Travadi, Ruchir
    Georgiou, Panayiotis
    Narayanan, Shrikanth
    INTERSPEECH 2019, 2019, : 2463 - 2467
  • [26] ROBUST SPEAKER VERIFICATION IN REVERBERANT CONDITIONS USING ESTIMATED ACOUSTIC PARAMETERS - A maximum likelihood estimation and training on the fly approach
    Al-Karawi, Khamis A.
    Li, Francis
    2017 SEVENTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH 2017), 2017, : 52 - 57
  • [27] Training Strategies for Deep Latent Models and Applications to Speech Presence Probability Estimation
    Chazan, Shlomo E.
    Gannot, Sharon
    Goldberger, Jacob
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 319 - 328