CONSTRAINED DISCRIMINATIVE PLDA TRAINING FOR SPEAKER VERIFICATION

被引：0

作者：

Rohdin, Johan ^{[1
]}

Biswas, Sangeeta ^{[1
]}

Shinoda, Koichi ^{[1
]}

机构：

[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 152, Japan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

PLDA; discriminative training; speaker verification; i-vector;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Many studies have proven the effectiveness of discriminative training for speaker verification based on probabilistic linear discriminative analysis (PLDA) with i-vectors as features. Most of them directly optimize the log-likelihood ratio score function of the PLDA model instead of explicitly train the PLDA model. But this optimization process removes some of the constraints that normally are imposed on the PLDA log likelihood ratio score function. This may deteriorate the verification performance when the amount of training data is limited. In this paper, we first show two constraints which the score function should follow, and then we propose a new constrained discriminative training algorithm which keeps these constraints. Our experiments show that our method obtained significant improvements in the verification performance in the male trials of the telephone speaker verification tasks of NIST SRE08 and SRE10.

引用

页数：5

共 50 条

[31] CHANNEL ADAPTATION OF PLDA FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Chen, Liping
Lee, Kong Aik
Ma, Bin
Guo, Wu
Li, Haizhou
Dai, Li Rong
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5251 - 5255
[32] Unifying Cosine and PLDA Back-ends for Speaker Verification
Peng, Zhiyuan
He, Xuanji
Ding, Ke
Lee, Tan
Wan, Guanglu
INTERSPEECH 2022, 2022, : 336 - 340
[33] Neural PLDA Modeling for End-to-End Speaker Verification
Ramoji, Shreyas
Krishnan, Prashant
Ganapathy, Sriram
INTERSPEECH 2020, 2020, : 4333 - 4337
[34] Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification
Cai, Weicheng
Li, Ming
Li, Lin
Hong, Qingyang
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1027 - 1031
[35] A TRANSFER LEARNING METHOD FOR PLDA-BASED SPEAKER VERIFICATION
Hong, Qingyang
Zhang, Jun
Li, Lin
Wan, Lihong
Tong, Feng
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5455 - 5459
[36] An Iterative Framework for Unsupervised Learning in the PLDA based Speaker Verification
Liu, Wenbo
Yu, Zhiding
Li, Ming
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 78 - +
[37] DNN-Driven Mixture of PLDA for Robust Speaker Verification
Li, Na
Mak, Man-Wai
Chien, Jen-Tzung
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1371 - 1383
[38] Performance Evaluation of Mixtures of PLDA and Conventional PLDA for a Small-Set Speaker Verification System
Wan, Qianhui
Bouchard, Martin
2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,
[39] Maximum Model Distance Discriminative Training for Text-Independent Speaker Verification
Hong, Q. Y.
Kwong, S.
IECON 2004: 30TH ANNUAL CONFERENCE OF IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOL 2, 2004, : 1769 - 1774
[40] Discriminative training for speaker identification
Hong, QY
Kwong, S
ELECTRONICS LETTERS, 2004, 40 (04) : 280 - 281

← 1 2 3 4 5 →