Sparsity Analysis and Compensation for i-Vector Based Speaker Verification

被引:1
|
作者
Li, Wei [1 ]
Fu, Tian Fan [2 ]
Zhu, Jie [1 ]
Chen, Ning [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn CSE, Shanghai 200240, Peoples R China
[3] East China Univ S&T, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
来源
关键词
Speaker verification; i-vector; Phonetic sparsity; Adapted first order Baum-Welch statistics analysis (AFSA);
D O I
10.1007/978-3-319-23132-7_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over recent years, i-vector based framework has been proven to provide state-of-art performance in speaker verification. Most of the researches focus on compensating the channel variability of i-vector. In this paper we will give an analysis that in the case that the duration of enrollment or test utterance is limited, i-vector based system may suffer from biased estimation problem. In order to solve this problem, we propose an improved i-vector extraction algorithm which we term Adapted First order Baum-Welch Statistics Analysis (AFSA). This new algorithm suppresses and compensates the deviation of first order Baum-Welch statistics caused by phonetic sparsity and phonetic imbalance. Experiments were performed based on NIST 2008 SRE data sets, Experimental results show that 10%-15% relative improvement is achieved compared to the baseline of traditional i-vector based system.
引用
收藏
页码:381 / 388
页数:8
相关论文
共 50 条
  • [31] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
    Jiang, Ye
    Lee, Kong Aik
    Tang, Zhenmin
    Ma, Bin
    Larcher, Anthony
    Li, Haizhou
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
  • [32] PERFORMANCE OF I-VECTOR SPEAKER VERIFICATION AND THE DETECTION OF SYNTHETIC SPEECH
    McClanahan, Richard D.
    Stewart, Bryan
    De Leon, Phillip L.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [33] SPEAKER VERIFICATION USING SIMPLIFIED AND SUPERVISED I-VECTOR MODELING
    Li, Ming
    Tsiartas, Andreas
    Van Segbroeck, Maarten
    Narayanan, Shrikanth S.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7199 - 7203
  • [34] Minimax i-vector extractor for short duration speaker verification
    Hautamaki, Ville
    Cheng, You-Chi
    Rajan, Padmanabhan
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3675 - 3679
  • [35] Non-linear PLDA for i-Vector Speaker Verification
    Novoselov, Sergey
    Pekhovsky, Timur
    Kudashev, Oleg
    Mendelev, Valentin
    Prudnikov, Alexey
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 214 - 218
  • [36] Bayesian Distance Metric Learning on i-vector for Speaker Verification
    Fang, Xiao
    Dehak, Najim
    Glass, James
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2513 - 2517
  • [37] GMM and i-vector based speaker verification using speaker-specific-text for short utterances
    Bharathi, B.
    Nagarajan, T.
    2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [38] Using the conformal embedding analysis to compensate the channel effect in the i-vector based speaker verification system
    Boulkenafet, Z.
    Bengherabi, M.
    Nouali, O.
    Cheriet, M.
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG 2013), 2013,
  • [39] I-vector Based Speaker Gender Recognition
    Wang, Minghe
    Chen, Ying
    Tang, Zhenmin
    Zhang, Erhua
    2015 IEEE ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2015, : 729 - 732
  • [40] Data selection for i-vector based automatic speaker verification anti-spoofing
    Hanilci, Cemal
    DIGITAL SIGNAL PROCESSING, 2018, 72 : 171 - 180