CONSTRAINED DISCRIMINATIVE PLDA TRAINING FOR SPEAKER VERIFICATION

被引:0
|
作者
Rohdin, Johan [1 ]
Biswas, Sangeeta [1 ]
Shinoda, Koichi [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 152, Japan
关键词
PLDA; discriminative training; speaker verification; i-vector;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many studies have proven the effectiveness of discriminative training for speaker verification based on probabilistic linear discriminative analysis (PLDA) with i-vectors as features. Most of them directly optimize the log-likelihood ratio score function of the PLDA model instead of explicitly train the PLDA model. But this optimization process removes some of the constraints that normally are imposed on the PLDA log likelihood ratio score function. This may deteriorate the verification performance when the amount of training data is limited. In this paper, we first show two constraints which the score function should follow, and then we propose a new constrained discriminative training algorithm which keeps these constraints. Our experiments show that our method obtained significant improvements in the verification performance in the male trials of the telephone speaker verification tasks of NIST SRE08 and SRE10.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Autonomous selection of i-vectors for PLDA modelling in speaker verification
    Biswas, Sangeeta
    Rohdin, Johan
    Shinoda, Koichi
    SPEECH COMMUNICATION, 2015, 72 : 32 - 46
  • [42] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
    Jiang, Ye
    Lee, Kong Aik
    Tang, Zhenmin
    Ma, Bin
    Larcher, Anthony
    Li, Haizhou
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
  • [43] PLDA using Gaussian Restricted Boltzmann Machines with application to Speaker Verification
    Stafylakis, Themos
    Kenny, Patrick
    Senoussaoui, Mohammed
    Dumouchel, Pierre
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1690 - 1693
  • [44] Analysis of the Influence of Speech Corpora in the PLDA Verification in the Task of Speaker Recognition
    Machlica, Lukas
    Zajic, Zbynek
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 464 - 471
  • [45] Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
    Wang, Qiongqiong
    Lee, Kong Aik
    Liu, Tianchi
    INTERSPEECH 2022, 2022, : 600 - 604
  • [46] Sparse kernel machines with empirical kernel maps for PLDA speaker verification
    Rao, Wei
    Mak, Man-Wai
    COMPUTER SPEECH AND LANGUAGE, 2016, 38 : 104 - 121
  • [47] Supervized Mixture of PLDA Models for Cross-Channel Speaker Verification
    Simonchik, Konstantin
    Pekhovsky, Timur
    Shulipa, Andrey
    Afanasyev, Anton
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1682 - 1685
  • [48] Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    Fookes, Clinton
    Himawan, Ivan
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1835 - 1838
  • [49] Non-linear PLDA for i-Vector Speaker Verification
    Novoselov, Sergey
    Pekhovsky, Timur
    Kudashev, Oleg
    Mendelev, Valentin
    Prudnikov, Alexey
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 214 - 218
  • [50] Improving the PLDA based Speaker Verification in Limited Microphone Data Conditions
    Kanagasundaram, A.
    Dean, D.
    Gonzalez-Dominguez, J.
    Sridharan, S.
    Ramos, D.
    Gonzalez-Rodriguez, J.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3641 - 3645