Robust model for speaker verification against session-dependent utterance variation

被引:0
|
作者
Matsui, T [1 ]
Aikawa, K
机构
[1] Inst Stat Math, Tokyo 1068569, Japan
[2] NTT Corp, NTT Commun Sci Labs, Tokyo 1008116, Japan
来源
关键词
speaker verification; speaker model; session dependent; utterance variation; handset dependent distortion;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a new method for creating robust speaker models to cope with inter-session variation of a speaker in a continuous HMM-based speaker verification system. The new method estimates session-independent parameters by decomposing inter-session variations into two distinct parts: session-dependent and -independent. The parameters of the speaker models are estimated using the speaker adaptive training algorithm in conjunction with the equalization of session-dependent variation. The resultant models capture the session-independent speaker characteristics more reliably than the conventional models and their discriminative power improves accordingly. Moreover we have made our models more invariant to handset variations in a public switched telephone network (PSTN) by focusing on session-dependent variation and handset-dependent distortion separately. Text-independent speech data recorded by 20 speakers in seven sessions over 16 months was used to evaluate the new approach. The proposed method reduces the error rate by 15% relatively. When compared with the popular cepstral mean normalization, the error rate is reduced by 24% relatively when the speaker models were recreated using speech data recorded in four or more sessions.
引用
收藏
页码:712 / 718
页数:7
相关论文
共 49 条
  • [1] Robust model for speaker verification against session-dependent utterance variation
    Matsui, T
    Aikawa, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 117 - 120
  • [2] Robust speaker recognition against utterance variations
    Lee, JJ
    Rheem, JY
    Lee, KY
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 2, PROCEEDINGS, 2003, 2668 : 624 - 630
  • [3] Model selection and score normalization for text-dependent single utterance speaker verification
    Buyuk, Osman
    Arslan, Mustafa Levent
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2012, 20 : 1277 - 1295
  • [4] A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification
    Aronowitz, H
    Burshtein, D
    Amir, A
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 733 - 736
  • [5] Robust Session Variability Compensation for SVM Speaker Verification
    Seo, Hyunson
    Jung, Chi-Sang
    Kang, Hong-Goo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
  • [6] Robust Speaker Verification Against Additive Noise
    Wang, Ming-He
    Zhang, Er-Hua
    Tang, Zhen-Min
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (02) : 291 - 305
  • [7] Robust Methods for Text-Dependent Speaker Verification
    Bhukya, Ramesh K.
    Prasanna, S. R. Mahadeva
    Sarma, Biswajit Dev
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (11) : 5253 - 5288
  • [8] Robust Methods for Text-Dependent Speaker Verification
    Ramesh K. Bhukya
    S. R. Mahadeva Prasanna
    Biswajit Dev Sarma
    Circuits, Systems, and Signal Processing, 2019, 38 : 5253 - 5288
  • [9] Robust Training for Speaker Verification against Noisy Labels
    Fang, Zhihua
    He, Liang
    Ma, Hanhan
    Guo, Xiaochen
    Li, Lin
    INTERSPEECH 2023, 2023, : 3192 - 3196
  • [10] Phoneme dependent inter-session variability reduction for speaker verification
    Lu, Haoze
    Zhang, Wenbin
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2015, 7 (02) : 83 - 96