I-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification

被引:2
|
作者
Tan, Zhili [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Peoples R China
关键词
Deep learning; speaker verification; score calibration; multi-task learning; noise robustness; PLDA;
D O I
10.21437/Interspeech.2017-656
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes applying multi-task learning to train deep neural networks (DNNs) for calibrating the PLDA scores of speaker verification systems under noisy environments. To facilitate the DNNs to learn the main task (calibration). several auxiliary tasks were introduced, including the prediction of SNR and duration from i-vectors and classifying whether an i-vector pair belongs to the same speaker or not. The possibility of replacing the PLDA model by a DNN during the scoring stage is also explored. Evaluations on noise contaminated speech suggest that the auxiliary tasks are important for the DNNs to learn the main calibration task and that the uncalibrated PLDA scores are an essential input to the DNNs. Without this input, the DNNs can only predict the score shifts accurately. suggesting that the PLDA model is indispensable.
引用
收藏
页码:1562 / 1566
页数:5
相关论文
共 50 条
  • [21] Sparsity Analysis and Compensation for i-Vector Based Speaker Verification
    Li, Wei
    Fu, Tian Fan
    Zhu, Jie
    Chen, Ning
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 381 - 388
  • [22] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
  • [23] Feature sparsity analysis for i-vector based speaker verification
    Li, Wei
    Fu, Tianfan
    You, Hanxu
    Zhu, Jie
    Chen, Ning
    SPEECH COMMUNICATION, 2016, 80 : 60 - 70
  • [24] Cosine Metric Learning for Speaker Verification in the i-Vector Space
    Bai, Zhong
    Zhang, Xiao-Lei
    Chen, Jingdong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1126 - 1130
  • [25] Geometric Discriminant Analysis for I-vector Based Speaker Verification
    Xu, Can
    Chen, Xianhong
    He, Liang
    Liu, Jia
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1636 - 1640
  • [26] Bayesian Principal Component Analysis for I-Vector Speaker Verification
    Rong Y.-F.
    Chen C.
    Chen D.-Y.
    He Y.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (11): : 2186 - 2194
  • [27] WEIGHTED LDA TECHNIQUES FOR I-VECTOR BASED SPEAKER VERIFICATION
    Kanagasundaram, A.
    Dean, D.
    Vogt, R.
    McLaren, M.
    Sridharan, S.
    Mason, M.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4781 - 4784
  • [28] Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification
    Li, Ming
    Narayanan, Shrikanth
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (04): : 940 - 958
  • [29] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
    Jiang, Ye
    Lee, Kong Aik
    Tang, Zhenmin
    Ma, Bin
    Larcher, Anthony
    Li, Haizhou
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
  • [30] PERFORMANCE OF I-VECTOR SPEAKER VERIFICATION AND THE DETECTION OF SYNTHETIC SPEECH
    McClanahan, Richard D.
    Stewart, Bryan
    De Leon, Phillip L.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,