Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification

被引:0
|
作者
Cai, Weicheng [2 ,3 ]
Li, Ming [1 ,2 ]
Li, Lin [4 ]
Hong, Qingyang [4 ]
机构
[1] Sun Yat Sen Univ, SYSU CMU Joint Inst Engn, Guangzhou, Guangdong, Peoples R China
[2] SYSU CMU Shunde Int Joint Res Inst, Shunde, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China
[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China
关键词
PLDA; covariance regularization; i-vector; speaker verification; duration; ROBUST;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a covariance regularized probabilistic linear discriminant analysis (CR-PLDA) model for text independent speaker verification. In the conventional simplified PLDA modeling, the covariance matrix used to capture the residual energies is globally shared for all i-vectors. However, we believe that the point estimated i-vectors from longer speech utterances may be more accurate and their corresponding co-variances in the PLDA modeling should be smaller. Similar to the inverse 0th order statistics weighted covariance in the i-vector model training, we propose a duration dependent normalized exponential term containing the duration normalizing factor mu and duration extent factor v to regularize the covariance in the PLDA modeling. Experimental results are reported on the NIST SRE 2010 common condition 5 female part task and the NIST 2014 i-vector machine learning challenge, respectively. For both tasks, the proposed covariance regularized PLDA system outperforms the baseline PLDA system by more than 13% relatively in terms of equal error rate (EER) and norm minDCF values.
引用
收藏
页码:1027 / 1031
页数:5
相关论文
共 50 条
  • [21] FULL-COVARIANCE UBM AND HEAVY-TAILED PLDA IN I-VECTOR SPEAKER VERIFICATION
    Matejka, Pavel
    Glembek, Ondrej
    Castaldo, Fabio
    Alam, M. J.
    Plchot, Oldrich
    Kenny, Patrick
    Burget, Lukas
    Cernocky, Jan 'Honza'
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4828 - 4831
  • [22] Fast Scoring for Mixture of PLDA in I-Vector/PLDA Speaker Verification
    Mak, Man-Wai
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 587 - 593
  • [23] Speaker Verification using Lasso based Sparse Total Variability Supervector with PLDA modeling
    Li, Ming
    Lu, Charley
    Wang, Anne
    Narayanan, Shrikanth
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [24] Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification
    Mohammad Azharuddin Laskar
    Chuya China Bhanja
    Rabul Hussain Laskar
    Circuits, Systems, and Signal Processing, 2021, 40 : 5127 - 5151
  • [25] Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification
    Laskar, Mohammad Azharuddin
    Bhanja, Chuya China
    Laskar, Rabul Hussain
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (10) : 5127 - 5151
  • [26] Subspace-constrained Supervector PLDA for Speaker Verification
    Garcia-Romero, Daniel
    McCree, Alan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2478 - 2482
  • [27] Transfer learning for PLDA-based speaker verification
    Hong, Qingyang
    Li, Lin
    Zhang, Jun
    Wan, Lihong
    Guo, Huiyang
    SPEECH COMMUNICATION, 2017, 92 : 90 - 99
  • [28] DISCRIMINATIVE MULTI-DOMAIN PLDA FOR SPEAKER VERIFICATION
    Sholokhov, Alexey
    Kinnunen, Tomi
    Cumani, Sandro
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5030 - 5034
  • [29] DIFFUSION MAPS FOR PLDA-BASED SPEAKER VERIFICATION
    Barkan, Oren
    Aronowitz, Hagai
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7639 - 7643
  • [30] IMPROVING PLDA SPEAKER VERIFICATION WITH LIMITED DEVELOPMENT DATA
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,