CONSTRAINED DISCRIMINATIVE PLDA TRAINING FOR SPEAKER VERIFICATION

被引：0

作者：

Rohdin, Johan ^{[1
]}

Biswas, Sangeeta ^{[1
]}

Shinoda, Koichi ^{[1
]}

机构：

[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 152, Japan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

PLDA; discriminative training; speaker verification; i-vector;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Many studies have proven the effectiveness of discriminative training for speaker verification based on probabilistic linear discriminative analysis (PLDA) with i-vectors as features. Most of them directly optimize the log-likelihood ratio score function of the PLDA model instead of explicitly train the PLDA model. But this optimization process removes some of the constraints that normally are imposed on the PLDA log likelihood ratio score function. This may deteriorate the verification performance when the amount of training data is limited. In this paper, we first show two constraints which the score function should follow, and then we propose a new constrained discriminative training algorithm which keeps these constraints. Our experiments show that our method obtained significant improvements in the verification performance in the male trials of the telephone speaker verification tasks of NIST SRE08 and SRE10.

引用

页数：5

共 50 条

[41] Autonomous selection of i-vectors for PLDA modelling in speaker verification
Biswas, Sangeeta
Rohdin, Johan
Shinoda, Koichi
SPEECH COMMUNICATION, 2015, 72 : 32 - 46
[42] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
Jiang, Ye
Lee, Kong Aik
Tang, Zhenmin
Ma, Bin
Larcher, Anthony
Li, Haizhou
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
[43] PLDA using Gaussian Restricted Boltzmann Machines with application to Speaker Verification
Stafylakis, Themos
Kenny, Patrick
Senoussaoui, Mohammed
Dumouchel, Pierre
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1690 - 1693
[44] Analysis of the Influence of Speech Corpora in the PLDA Verification in the Task of Speaker Recognition
Machlica, Lukas
Zajic, Zbynek
TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 464 - 471
[45] Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
Wang, Qiongqiong
Lee, Kong Aik
Liu, Tianchi
INTERSPEECH 2022, 2022, : 600 - 604
[46] Sparse kernel machines with empirical kernel maps for PLDA speaker verification
Rao, Wei
Mak, Man-Wai
COMPUTER SPEECH AND LANGUAGE, 2016, 38 : 104 - 121
[47] Supervized Mixture of PLDA Models for Cross-Channel Speaker Verification
Simonchik, Konstantin
Pekhovsky, Timur
Shulipa, Andrey
Afanasyev, Anton
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1682 - 1685
[48] Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
Fookes, Clinton
Himawan, Ivan
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1835 - 1838
[49] Non-linear PLDA for i-Vector Speaker Verification
Novoselov, Sergey
Pekhovsky, Timur
Kudashev, Oleg
Mendelev, Valentin
Prudnikov, Alexey
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 214 - 218
[50] Improving the PLDA based Speaker Verification in Limited Microphone Data Conditions
Kanagasundaram, A.
Dean, D.
Gonzalez-Dominguez, J.
Sridharan, S.
Ramos, D.
Gonzalez-Rodriguez, J.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3641 - 3645

← 1 2 3 4 5 →