Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification

被引：0

作者：

Cai, Weicheng ^{[2
,3
]}

Li, Ming ^{[1
,2
]}

Li, Lin ^{[4
]}

Hong, Qingyang ^{[4
]}

机构：

[1] Sun Yat Sen Univ, SYSU CMU Joint Inst Engn, Guangzhou, Guangdong, Peoples R China

[2] SYSU CMU Shunde Int Joint Res Inst, Shunde, Guangdong, Peoples R China

[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China

[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China

来源：

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年

关键词：

PLDA; covariance regularization; i-vector; speaker verification; duration; ROBUST;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present a covariance regularized probabilistic linear discriminant analysis (CR-PLDA) model for text independent speaker verification. In the conventional simplified PLDA modeling, the covariance matrix used to capture the residual energies is globally shared for all i-vectors. However, we believe that the point estimated i-vectors from longer speech utterances may be more accurate and their corresponding co-variances in the PLDA modeling should be smaller. Similar to the inverse 0th order statistics weighted covariance in the i-vector model training, we propose a duration dependent normalized exponential term containing the duration normalizing factor mu and duration extent factor v to regularize the covariance in the PLDA modeling. Experimental results are reported on the NIST SRE 2010 common condition 5 female part task and the NIST 2014 i-vector machine learning challenge, respectively. For both tasks, the proposed covariance regularized PLDA system outperforms the baseline PLDA system by more than 13% relatively in terms of equal error rate (EER) and norm minDCF values.

引用

页码：1027 / 1031

页数：5

共 50 条

[1] PLDA FOR SPEAKER VERIFICATION WITH UTTERANCES OF ARBITRARY DURATION
Kenny, Patrick
Stafylakis, Themos
Ouellet, Pierre
Alam, Md Jahangir
Dumouchel, Pierre
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7649 - 7653
[2] PLDA Modeling in the Fishervoice Subspace for Speaker Verification
Zhong, Jinghua
Jiang, Weiwu
Rao, Wei
Mak, Man-Wai
Meng, Helen
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1130 - 1134
[3] SNR-Invariant PLDA Modeling for Robust Speaker Verification
Li, Na
Mak, Man-Wai
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
[4] Neural PLDA Modeling for End-to-End Speaker Verification
Ramoji, Shreyas
Krishnan, Prashant
Ganapathy, Sriram
INTERSPEECH 2020, 2020, : 4333 - 4337
[5] INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION
Madikeri, Srikanth
Ferras, Marc
Motlicek, Petr
Dey, Subhadeep
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5365 - 5369
[6] Nonparametrically trained PLDA for short duration i-vector speaker verification
Khosravani, Abbas
Homayounpour, Mohammad M.
COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 105 - 122
[7] Dataset-Invariant Covariance Normalization for Out-domain PLDA Speaker Verification
Rahman, Md Hafizur
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1017 - 1021
[8] Local Training in Speaker Verification for PLDA
Pahuja, Hunny
Ranjan, Priya
Ujlayan, Amit
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 1466 - 1469
[9] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
Jiang, Ye
Lee, Kong Aik
Tang, Zhenmin
Ma, Bin
Larcher, Anthony
Li, Haizhou
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
[10] PHONETICALLY-CONSTRAINED PLDA MODELING FOR TEXT-DEPENDENT SPEAKER VERIFICATION WITH MULTIPLE SHORT UTTERANCES
Larcher, Anthony
Lee, Kong Aik
Ma, Bin
Li, Haizhou
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7673 - 7677

← 1 2 3 4 5 →