Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification

被引:0
|
作者
Cai, Weicheng [2 ,3 ]
Li, Ming [1 ,2 ]
Li, Lin [4 ]
Hong, Qingyang [4 ]
机构
[1] Sun Yat Sen Univ, SYSU CMU Joint Inst Engn, Guangzhou, Guangdong, Peoples R China
[2] SYSU CMU Shunde Int Joint Res Inst, Shunde, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China
[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China
关键词
PLDA; covariance regularization; i-vector; speaker verification; duration; ROBUST;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a covariance regularized probabilistic linear discriminant analysis (CR-PLDA) model for text independent speaker verification. In the conventional simplified PLDA modeling, the covariance matrix used to capture the residual energies is globally shared for all i-vectors. However, we believe that the point estimated i-vectors from longer speech utterances may be more accurate and their corresponding co-variances in the PLDA modeling should be smaller. Similar to the inverse 0th order statistics weighted covariance in the i-vector model training, we propose a duration dependent normalized exponential term containing the duration normalizing factor mu and duration extent factor v to regularize the covariance in the PLDA modeling. Experimental results are reported on the NIST SRE 2010 common condition 5 female part task and the NIST 2014 i-vector machine learning challenge, respectively. For both tasks, the proposed covariance regularized PLDA system outperforms the baseline PLDA system by more than 13% relatively in terms of equal error rate (EER) and norm minDCF values.
引用
收藏
页码:1027 / 1031
页数:5
相关论文
共 50 条
  • [1] PLDA FOR SPEAKER VERIFICATION WITH UTTERANCES OF ARBITRARY DURATION
    Kenny, Patrick
    Stafylakis, Themos
    Ouellet, Pierre
    Alam, Md Jahangir
    Dumouchel, Pierre
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7649 - 7653
  • [2] PLDA Modeling in the Fishervoice Subspace for Speaker Verification
    Zhong, Jinghua
    Jiang, Weiwu
    Rao, Wei
    Mak, Man-Wai
    Meng, Helen
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1130 - 1134
  • [3] SNR-Invariant PLDA Modeling for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2317 - 2321
  • [4] Neural PLDA Modeling for End-to-End Speaker Verification
    Ramoji, Shreyas
    Krishnan, Prashant
    Ganapathy, Sriram
    INTERSPEECH 2020, 2020, : 4333 - 4337
  • [5] INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION
    Madikeri, Srikanth
    Ferras, Marc
    Motlicek, Petr
    Dey, Subhadeep
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5365 - 5369
  • [6] Nonparametrically trained PLDA for short duration i-vector speaker verification
    Khosravani, Abbas
    Homayounpour, Mohammad M.
    COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 105 - 122
  • [7] Dataset-Invariant Covariance Normalization for Out-domain PLDA Speaker Verification
    Rahman, Md Hafizur
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1017 - 1021
  • [8] Local Training in Speaker Verification for PLDA
    Pahuja, Hunny
    Ranjan, Priya
    Ujlayan, Amit
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 1466 - 1469
  • [9] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
    Jiang, Ye
    Lee, Kong Aik
    Tang, Zhenmin
    Ma, Bin
    Larcher, Anthony
    Li, Haizhou
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
  • [10] PHONETICALLY-CONSTRAINED PLDA MODELING FOR TEXT-DEPENDENT SPEAKER VERIFICATION WITH MULTIPLE SHORT UTTERANCES
    Larcher, Anthony
    Lee, Kong Aik
    Ma, Bin
    Li, Haizhou
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7673 - 7677