Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification

被引:0
|
作者
Cai, Weicheng [2 ,3 ]
Li, Ming [1 ,2 ]
Li, Lin [4 ]
Hong, Qingyang [4 ]
机构
[1] Sun Yat Sen Univ, SYSU CMU Joint Inst Engn, Guangzhou, Guangdong, Peoples R China
[2] SYSU CMU Shunde Int Joint Res Inst, Shunde, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China
[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China
关键词
PLDA; covariance regularization; i-vector; speaker verification; duration; ROBUST;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a covariance regularized probabilistic linear discriminant analysis (CR-PLDA) model for text independent speaker verification. In the conventional simplified PLDA modeling, the covariance matrix used to capture the residual energies is globally shared for all i-vectors. However, we believe that the point estimated i-vectors from longer speech utterances may be more accurate and their corresponding co-variances in the PLDA modeling should be smaller. Similar to the inverse 0th order statistics weighted covariance in the i-vector model training, we propose a duration dependent normalized exponential term containing the duration normalizing factor mu and duration extent factor v to regularize the covariance in the PLDA modeling. Experimental results are reported on the NIST SRE 2010 common condition 5 female part task and the NIST 2014 i-vector machine learning challenge, respectively. For both tasks, the proposed covariance regularized PLDA system outperforms the baseline PLDA system by more than 13% relatively in terms of equal error rate (EER) and norm minDCF values.
引用
收藏
页码:1027 / 1031
页数:5
相关论文
共 50 条
  • [31] FACTORED COVARIANCE MODELING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Wang, Eryu
    Lee, Kong Aik
    Ma, Bin
    Li, Haizhou
    Guo, Wu
    Dai, Lirong
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4856 - 4859
  • [32] Text-independent speaker verification using covariance modeling
    Zilca, RD
    IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (04) : 97 - 99
  • [33] Twin Model G-PLDA for Duration Mismatch Compensation in Text-Independent Speaker Verification
    Ma, Jianbo
    Sethu, Vidhyasaharan
    Arnbikairajah, Eliatharnby
    Lee, Kong Aik
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1853 - 1857
  • [34] Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification
    Tur, Gokhan
    Shriberg, Elizabeth
    Stolcke, Andreas
    Kajarekar, Sachin
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2664 - 2667
  • [35] Domain mismatch modeling of out-domain i-vectors for PLDA speaker verification
    Rahman, Md Hafizur
    Himawan, Ivan
    Dean, David
    Sridharan, Sridha
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1581 - 1585
  • [36] Turkish Text-Dependent Speaker Verification using i-vector/PLDA Approach
    Hanilci, Cemal
    Celiktas, Havva
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [37] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
    Wang, Qiongqiong
    Koshinaka, Takafumi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
  • [38] CHANNEL ADAPTATION OF PLDA FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Chen, Liping
    Lee, Kong Aik
    Ma, Bin
    Guo, Wu
    Li, Haizhou
    Dai, Li Rong
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5251 - 5255
  • [39] Unifying Cosine and PLDA Back-ends for Speaker Verification
    Peng, Zhiyuan
    He, Xuanji
    Ding, Ke
    Lee, Tan
    Wan, Guanglu
    INTERSPEECH 2022, 2022, : 336 - 340
  • [40] Covariance Based Deep Feature for Text-Dependent Speaker Verification
    Wang, Shuai
    Dinkel, Heinrich
    Qian, Yanmin
    Yu, Kai
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 231 - 242