Bayesian Estimation of PLDA in the Presence of Noisy Training Labels, With Applications to Speaker Verification

被引：3

作者：

Borgstrom, Bengt J. ^{[1
]}

机构：

[1] MIT, Lincoln Lab, Lexington, MA 02420 USA

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2022年 / 30卷

关键词：

Noise measurement; Estimation; Training; Labeling; Data models; Adaptation models; Bayes methods; Speaker verification; probabilistic linear discriminant analysis; noisy labels; variational bayes;

D O I：

10.1109/TASLP.2021.3130980

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paperpresents a Bayesian framework for estimating a Probabilistic Linear Discriminant Analysis (PLDA) model in the presence of noisy labels. True class labels are interpreted as latent random variables, which are transmitted through a noisy channel, and received as observed speaker labels. The labeling process is modeled as a Discrete Memoryless Channel (DMC). PLDA hyperparameters are interpreted as random variables, and their joint posterior distribution is derived using mean-field Variational Bayes, allowing maximum a posteriori (MAP) estimates of the PLDA model parameters to be determined. The proposed solution, referred to as VB-MAP, is presented as a general framework, but is studied in the context of speaker verification, and a variety of use cases are discussed. Specifically, VB-MAP can be used for PLDA estimation with unreliable labels, unsupervised PLDA estimation, and to infer the reliability of a PLDA training set. Experimental results show the proposed approach to provide significant performance improvements on a variety of NIST Speaker Recognition Evaluation (SRE) tasks, both for data sets with simulated mislabels, and for data sets with naturally occurring missing or unreliable labels.

引用

页码：414 / 428

页数：15

共 27 条

[1] Bayesian Estimation of PLDA in the Presence of Noisy Training Labels, with Applications to Speaker Verification
Borgstrom, Bengt J.
IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 414 - 428
[2] BAYESIAN ESTIMATION OF PLDA WITH NOISY TRAINING LABELS, WITH APPLICATIONS TO SPEAKER VERIFICATION
Borgstrom, Bengt J.
Torres-Carrasquillo, Pedro
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7594 - 7598
[3] Robust Training for Speaker Verification against Noisy Labels
Fang, Zhihua
He, Liang
Ma, Hanhan
Guo, Xiaochen
Li, Lin
INTERSPEECH 2023, 2023, : 3192 - 3196
[4] Local Training in Speaker Verification for PLDA
Pahuja, Hunny
Ranjan, Priya
Ujlayan, Amit
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 1466 - 1469
[5] Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
Borgstrorn, Bengt J.
INTERSPEECH 2021, 2021, : 1039 - 1043
[6] CONSTRAINED DISCRIMINATIVE PLDA TRAINING FOR SPEAKER VERIFICATION
Rohdin, Johan
Biswas, Sangeeta
Shinoda, Koichi
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[7] Gaussian PLDA for speaker verification and joint estimation
Xu, Yun-Fei
Yang, Hai
Zhou, Ruo-Hua
Yan, Yong-Hong
Zidonghua Xuebao/Acta Automatica Sinica, 2014, 40 (06): : 1068 - 1074
[8] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
Wang, Qiongqiong
Koshinaka, Takafumi
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
[9] MULTI-OBJECTIVE OPTIMIZATION TRAINING OF PLDA FOR SPEAKER VERIFICATION
He, Liang
Chen, Xianhong
Xu, Can
Liu, Jia
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6026 - 6030
[10] A Bayesian approach to the verification problem: Applications to speaker verification
Jiang, H
Deng, L
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 874 - 884

← 1 2 3 →