Compensation of Intrinsic Variability with Factor Analysis Modeling for Robust Speaker Verification

被引:0
|
作者
Chen, Sheng [1 ]
Xu, Mingxing [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol, Key Lab Pervas Comp,Minist Educ, Beijing 100084, Peoples R China
关键词
speaker verification; intrinsic variability; joint factor analysis; i-vector; LDA; WCCN; NAP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Performances of speaker verification systems are adversely affected by intrinsic variability in the real world applications. In this paper, factor analysis approaches of Joint Factor Analysis (JFA) and i-vector modeling are used to address the effects of intrinsic variations for robust speaker verification. The speaker variability and intrinsic variability are modeled with the speaker and session factors respectively in the JFA approach. In the i-vector framework, a low-dimensional space is defined to model the total variability and intrinsic variations are compensated with a variety of techniques including Linear Discriminant Analysis (LDA), Within-Class Co-variance Normalization (WCCN) and Nuisance Attribute Projection (NAP). Experiments in the intrinsic variation corpus show that factor analysis approaches of JFA and i-vector framework perform much better than the GMM-UBM paradigm in modeling the intrinsic variability. Relative reductions in Error Equal Rate (EER) of around 39.85% and 36.76% are obtained respectively for JFA and i-Vector+LDA+WCCN speaker verification systems, compared to the GMM-UBM baseline system.
引用
收藏
页码:1574 / 1577
页数:4
相关论文
共 50 条
  • [12] Within-Session Variability Modelling for Factor Analysis Speaker Verification
    Vogt, Robbie
    Pelecanos, Jason
    Scheffer, Nicolas
    Kajarekar, Sachin
    Sridharan, Sridha
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1531 - +
  • [13] Intrinsic Variation Robust Speaker Verification based on Sparse Representation
    Nie, Yi
    Xu, Mingxing
    Xianyu, Haishu
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [14] Cross-domain variation compensation for robust speaker verification
    Huang, Houjun
    Zhou, Ruohua
    Yan, Yonghong
    ELECTRONICS LETTERS, 2015, 51 (21) : 1706 - 1707
  • [15] Psychoacoustic Model Compensation for Robust Speaker Verification in Environmental Noise
    Panda, Ashish
    Srikanthan, Thambipillai
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (03): : 945 - 953
  • [16] Restoring the Residual Speaker Information in Total Variability Modeling for Speaker Verification
    Zhang, Ce
    Zheng, Rong
    Xu, Bo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 132 - 135
  • [17] Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise
    Hasan, Taufiq
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 381 - 391
  • [18] MODELLING SPEAKER AND CHANNEL VARIABILITY USING DEEP NEURAL NETWORKS FOR ROBUST SPEAKER VERIFICATION
    Bhattacharya, Gautam
    Alam, Jahangir
    Kenny, Patrick
    Gupta, Vishwa
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 192 - 198
  • [19] Factor Analysis Multi-Session Training Constraint in Session Compensation for Speaker Verification
    Matrouf, Driss
    Bonastre, Jean-Francois
    Mezaache, Salah Eddine
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1421 - 1424
  • [20] Session variability subspace projection based model compensation for speaker verification
    Deng, Jing
    Zheng, Thomas Fang
    Wu, Wenhu
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 57 - +