I-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification

被引:2
|
作者
Tan, Zhili [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Peoples R China
关键词
Deep learning; speaker verification; score calibration; multi-task learning; noise robustness; PLDA;
D O I
10.21437/Interspeech.2017-656
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes applying multi-task learning to train deep neural networks (DNNs) for calibrating the PLDA scores of speaker verification systems under noisy environments. To facilitate the DNNs to learn the main task (calibration). several auxiliary tasks were introduced, including the prediction of SNR and duration from i-vectors and classifying whether an i-vector pair belongs to the same speaker or not. The possibility of replacing the PLDA model by a DNN during the scoring stage is also explored. Evaluations on noise contaminated speech suggest that the auxiliary tasks are important for the DNNs to learn the main calibration task and that the uncalibrated PLDA scores are an essential input to the DNNs. Without this input, the DNNs can only predict the score shifts accurately. suggesting that the PLDA model is indispensable.
引用
收藏
页码:1562 / 1566
页数:5
相关论文
共 50 条
  • [41] Discriminant Analysis Methods Comparison in I-Vector Space for Speaker Verification
    Mohammadi, Mohsen
    Mohammadi, Hamid Reza Sadegh
    2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 166 - 172
  • [42] Nonparametrically trained PLDA for short duration i-vector speaker verification
    Khosravani, Abbas
    Homayounpour, Mohammad M.
    COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 105 - 122
  • [43] Deep Nonlinear Metric Learning for Speaker Verification in the I-Vector Space
    Feng, Yong
    Xiong, Qingyu
    Shi, Weiren
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (01): : 215 - 219
  • [44] Neural Networks based Channel Compensation for I-Vector Speaker Verification
    Rao, Wei
    Xiao, Xiong
    Xu, Chenglin
    Xu, Haihua
    Lee, Kong Aik
    Chng, Eng Siong
    Li, Haizhou
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [45] Best Feature Selection for Emotional Speaker Verification in i-vector Representation
    Mackova, Lenka
    Cizmar, Anton
    Juhar, Jozef
    2015 25TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2015, : 209 - 212
  • [46] DNN and i-vector combined method for speaker recognition on multi-variability environments
    Flavio J. Reyes-Díaz
    Gabriel Hernández-Sierra
    José R. Calvo de Lara
    International Journal of Speech Technology, 2021, 24 : 409 - 418
  • [47] NORMALIZATION OF TOTAL VARIABILITY MATRIX FOR I-VECTOR/PLDA SPEAKER VERIFICATION
    Rao, Wei
    Mak, Man-Wai
    Lee, Kong-Aik
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4180 - 4184
  • [48] Maximum Likelihood i-vector Space Using PCA for Speaker Verification
    Lei, Zhenchun
    Yang, Yingchun
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2736 - 2739
  • [49] Improved i-vector extraction technique for speaker verification with short utterances
    Poddar A.
    Sahidullah M.
    Saha G.
    International Journal of Speech Technology, 2018, 21 (03) : 473 - 488
  • [50] Evaluation of the I-vector System for Text-dependent Speaker Verification
    Li, Lin
    Guo, Huiyang
    Shang, Fengyi
    Hong, Qingyang
    Liu, Kai
    PROCEEDINGS OF 2017 11TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2017, : 60 - 63