Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification

被引:0
|
作者
Cai, Weicheng [2 ,3 ]
Li, Ming [1 ,2 ]
Li, Lin [4 ]
Hong, Qingyang [4 ]
机构
[1] Sun Yat Sen Univ, SYSU CMU Joint Inst Engn, Guangzhou, Guangdong, Peoples R China
[2] SYSU CMU Shunde Int Joint Res Inst, Shunde, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China
[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China
关键词
PLDA; covariance regularization; i-vector; speaker verification; duration; ROBUST;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a covariance regularized probabilistic linear discriminant analysis (CR-PLDA) model for text independent speaker verification. In the conventional simplified PLDA modeling, the covariance matrix used to capture the residual energies is globally shared for all i-vectors. However, we believe that the point estimated i-vectors from longer speech utterances may be more accurate and their corresponding co-variances in the PLDA modeling should be smaller. Similar to the inverse 0th order statistics weighted covariance in the i-vector model training, we propose a duration dependent normalized exponential term containing the duration normalizing factor mu and duration extent factor v to regularize the covariance in the PLDA modeling. Experimental results are reported on the NIST SRE 2010 common condition 5 female part task and the NIST 2014 i-vector machine learning challenge, respectively. For both tasks, the proposed covariance regularized PLDA system outperforms the baseline PLDA system by more than 13% relatively in terms of equal error rate (EER) and norm minDCF values.
引用
收藏
页码:1027 / 1031
页数:5
相关论文
共 50 条
  • [41] A TRANSFER LEARNING METHOD FOR PLDA-BASED SPEAKER VERIFICATION
    Hong, Qingyang
    Zhang, Jun
    Li, Lin
    Wan, Lihong
    Tong, Feng
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5455 - 5459
  • [42] An Iterative Framework for Unsupervised Learning in the PLDA based Speaker Verification
    Liu, Wenbo
    Yu, Zhiding
    Li, Ming
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 78 - +
  • [43] MULTI-OBJECTIVE OPTIMIZATION TRAINING OF PLDA FOR SPEAKER VERIFICATION
    He, Liang
    Chen, Xianhong
    Xu, Can
    Liu, Jia
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6026 - 6030
  • [44] DNN-Driven Mixture of PLDA for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    Chien, Jen-Tzung
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1371 - 1383
  • [45] Performance Evaluation of Mixtures of PLDA and Conventional PLDA for a Small-Set Speaker Verification System
    Wan, Qianhui
    Bouchard, Martin
    2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,
  • [46] Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA
    Pang, Xiaomin
    Mak, Man-Wai
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (04) : 633 - 648
  • [47] Regularized Within-Class Precision Matrix Based PLDA in Text-Dependent Speaker Verification
    Yoon, Sung-Hyun
    Jeon, Jong-June
    Yu, Ha-Jin
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [48] Autonomous selection of i-vectors for PLDA modelling in speaker verification
    Biswas, Sangeeta
    Rohdin, Johan
    Shinoda, Koichi
    SPEECH COMMUNICATION, 2015, 72 : 32 - 46
  • [49] PLDA using Gaussian Restricted Boltzmann Machines with application to Speaker Verification
    Stafylakis, Themos
    Kenny, Patrick
    Senoussaoui, Mohammed
    Dumouchel, Pierre
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1690 - 1693
  • [50] Analysis of the Influence of Speech Corpora in the PLDA Verification in the Task of Speaker Recognition
    Machlica, Lukas
    Zajic, Zbynek
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 464 - 471