Sparsity Analysis and Compensation for i-Vector Based Speaker Verification

被引：1

作者：

Li, Wei ^{[1
]}

Fu, Tian Fan ^{[2
]}

Zhu, Jie ^{[1
]}

Chen, Ning ^{[3
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn CSE, Shanghai 200240, Peoples R China

[3] East China Univ S&T, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China

来源：

SPEECH AND COMPUTER (SPECOM 2015) | 2015年 / 9319卷

关键词：

Speaker verification; i-vector; Phonetic sparsity; Adapted first order Baum-Welch statistics analysis (AFSA);

D O I：

10.1007/978-3-319-23132-7_47

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Over recent years, i-vector based framework has been proven to provide state-of-art performance in speaker verification. Most of the researches focus on compensating the channel variability of i-vector. In this paper we will give an analysis that in the case that the duration of enrollment or test utterance is limited, i-vector based system may suffer from biased estimation problem. In order to solve this problem, we propose an improved i-vector extraction algorithm which we term Adapted First order Baum-Welch Statistics Analysis (AFSA). This new algorithm suppresses and compensates the deviation of first order Baum-Welch statistics caused by phonetic sparsity and phonetic imbalance. Experiments were performed based on NIST 2008 SRE data sets, Experimental results show that 10%-15% relative improvement is achieved compared to the baseline of traditional i-vector based system.

引用

页码：381 / 388

页数：8

共 50 条

[31] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
Jiang, Ye
Lee, Kong Aik
Tang, Zhenmin
Ma, Bin
Larcher, Anthony
Li, Haizhou
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
[32] PERFORMANCE OF I-VECTOR SPEAKER VERIFICATION AND THE DETECTION OF SYNTHETIC SPEECH
McClanahan, Richard D.
Stewart, Bryan
De Leon, Phillip L.
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[33] SPEAKER VERIFICATION USING SIMPLIFIED AND SUPERVISED I-VECTOR MODELING
Li, Ming
Tsiartas, Andreas
Van Segbroeck, Maarten
Narayanan, Shrikanth S.
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7199 - 7203
[34] Minimax i-vector extractor for short duration speaker verification
Hautamaki, Ville
Cheng, You-Chi
Rajan, Padmanabhan
Lee, Chin-Hui
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3675 - 3679
[35] Non-linear PLDA for i-Vector Speaker Verification
Novoselov, Sergey
Pekhovsky, Timur
Kudashev, Oleg
Mendelev, Valentin
Prudnikov, Alexey
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 214 - 218
[36] Bayesian Distance Metric Learning on i-vector for Speaker Verification
Fang, Xiao
Dehak, Najim
Glass, James
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2513 - 2517
[37] GMM and i-vector based speaker verification using speaker-specific-text for short utterances
Bharathi, B.
Nagarajan, T.
2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
[38] Using the conformal embedding analysis to compensate the channel effect in the i-vector based speaker verification system
Boulkenafet, Z.
Bengherabi, M.
Nouali, O.
Cheriet, M.
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG 2013), 2013,
[39] I-vector Based Speaker Gender Recognition
Wang, Minghe
Chen, Ying
Tang, Zhenmin
Zhang, Erhua
2015 IEEE ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2015, : 729 - 732
[40] Data selection for i-vector based automatic speaker verification anti-spoofing
Hanilci, Cemal
DIGITAL SIGNAL PROCESSING, 2018, 72 : 171 - 180

← 1 2 3 4 5 →