Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification

被引：0

作者：

Cai, Weicheng ^{[2
,3
]}

Li, Ming ^{[1
,2
]}

Li, Lin ^{[4
]}

Hong, Qingyang ^{[4
]}

机构：

[1] Sun Yat Sen Univ, SYSU CMU Joint Inst Engn, Guangzhou, Guangdong, Peoples R China

[2] SYSU CMU Shunde Int Joint Res Inst, Shunde, Guangdong, Peoples R China

[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China

[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China

来源：

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年

关键词：

PLDA; covariance regularization; i-vector; speaker verification; duration; ROBUST;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present a covariance regularized probabilistic linear discriminant analysis (CR-PLDA) model for text independent speaker verification. In the conventional simplified PLDA modeling, the covariance matrix used to capture the residual energies is globally shared for all i-vectors. However, we believe that the point estimated i-vectors from longer speech utterances may be more accurate and their corresponding co-variances in the PLDA modeling should be smaller. Similar to the inverse 0th order statistics weighted covariance in the i-vector model training, we propose a duration dependent normalized exponential term containing the duration normalizing factor mu and duration extent factor v to regularize the covariance in the PLDA modeling. Experimental results are reported on the NIST SRE 2010 common condition 5 female part task and the NIST 2014 i-vector machine learning challenge, respectively. For both tasks, the proposed covariance regularized PLDA system outperforms the baseline PLDA system by more than 13% relatively in terms of equal error rate (EER) and norm minDCF values.

引用

页码：1027 / 1031

页数：5

共 50 条

[21] FULL-COVARIANCE UBM AND HEAVY-TAILED PLDA IN I-VECTOR SPEAKER VERIFICATION
Matejka, Pavel
Glembek, Ondrej
Castaldo, Fabio
Alam, M. J.
Plchot, Oldrich
Kenny, Patrick
Burget, Lukas
Cernocky, Jan 'Honza'
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4828 - 4831
[22] Fast Scoring for Mixture of PLDA in I-Vector/PLDA Speaker Verification
Mak, Man-Wai
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 587 - 593
[23] Speaker Verification using Lasso based Sparse Total Variability Supervector with PLDA modeling
Li, Ming
Lu, Charley
Wang, Anne
Narayanan, Shrikanth
2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
[24] Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification
Mohammad Azharuddin Laskar
Chuya China Bhanja
Rabul Hussain Laskar
Circuits, Systems, and Signal Processing, 2021, 40 : 5127 - 5151
[25] Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification
Laskar, Mohammad Azharuddin
Bhanja, Chuya China
Laskar, Rabul Hussain
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (10) : 5127 - 5151
[26] Subspace-constrained Supervector PLDA for Speaker Verification
Garcia-Romero, Daniel
McCree, Alan
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2478 - 2482
[27] Transfer learning for PLDA-based speaker verification
Hong, Qingyang
Li, Lin
Zhang, Jun
Wan, Lihong
Guo, Huiyang
SPEECH COMMUNICATION, 2017, 92 : 90 - 99
[28] DISCRIMINATIVE MULTI-DOMAIN PLDA FOR SPEAKER VERIFICATION
Sholokhov, Alexey
Kinnunen, Tomi
Cumani, Sandro
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5030 - 5034
[29] DIFFUSION MAPS FOR PLDA-BASED SPEAKER VERIFICATION
Barkan, Oren
Aronowitz, Hagai
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7639 - 7643
[30] IMPROVING PLDA SPEAKER VERIFICATION WITH LIMITED DEVELOPMENT DATA
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,

← 1 2 3 4 5 →