Unsupervised intra-speaker variability compensation based on Gestalt and model adaptation in speaker verification with telephone speech

被引:2
|
作者
Yoma, Nestor Becerra [1 ]
Garreton, Claudio [1 ]
Molina, Carlos [1 ]
Huenupan, Fernando [1 ]
机构
[1] Univ Chile, Speech Proc & Transmiss Lab, Dept Elect Engn, Santiago, Chile
关键词
Text-dependent speaker verification; Feature compensation; Intra-speaker variability; Unsupervised model adaptation; Gestalt; Telephone speech; Limited enrolling data; Noise robustness; Speaker verification database in Spanish;
D O I
10.1016/j.specom.2007.11.005
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, an unsupervised intra-speaker variability compensation (ISVC) method based oil Gestalt is proposed to address the problem of limited enrolling data and noise robustness in text-dependent speaker verification (SV). Experiments with two databases show that: ISVC can lead to reductions in EER as high as 20% or 40% and ISCV provides reductions in the integral below the ROC curve between 30%, and 60%. Also, the observed improvements are independent of the number of enrolling utterances. In contrast to model adaptation methods, ISVC is memoryless with respect to previous verification attempts. As shown here, unsupervised model adaptation can lead to substantial improvements in EER but is highly dependent oil the sequence of client/impostor verification events. In adverse scenarios, such its massive impostor attacks and verification from alternated telephone line, unsupervised model adaptation might even provide reductions in verification accuracy when compared with the baseline system. In those cases, ISVC can even outperform adaptation schemes. It is worth emphasizing that ISVC and unsupervised model adaptation are compatible and the combination of both methods always improves the performance of model adaptation. The combination of both schemes can lead to improvements in EER its high its 34%. Due to the restrictions of commercially available databases for text-dependent SV research, the results presented here are based oil local databases in Spanish. By doing so, the visibility of research in Iberian Languages is highlighted. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:953 / 964
页数:12
相关论文
共 50 条
  • [31] Variable pronunciations reveal dynamic intra-speaker variation in speech planning
    Kilbourn-Ceron, Oriana
    Goldrick, Matthew
    PSYCHONOMIC BULLETIN & REVIEW, 2021, 28 (04) : 1365 - 1380
  • [32] Score-level Compensation of Extreme Speech Duration Variability in Speaker Verification
    Perez-Gomez, Sergio
    Ramos, Daniel
    Gonzalez-Dominguez, Javier
    Gonzalez-Rodriguez, Joaquin
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 374 - 377
  • [33] An Unsupervised Domain Adaptation Method Based on Distribution Alignment for Speaker Verification
    Gu, Qing
    Song, Yan
    Guo, Wu
    Ye, Zhongfu
    Dai, Lirong
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2024, 2025, 2312 : 359 - 369
  • [34] N-Best-based unsupervised speaker adaptation for speech recognition
    Matsui, T
    Furui, S
    COMPUTER SPEECH AND LANGUAGE, 1998, 12 (01): : 41 - 50
  • [35] Multimodal speech synthesis architecture for unsupervised speaker adaptation
    Hieu-Thi Luong
    Yamagishi, Junichi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2494 - 2498
  • [36] Sequential Model Adaptation for Speaker Verification
    Wang, Jun
    Wang, Dong
    Wu, Xiaojun
    Zheng, Thomas Fang
    Tejedor, Javier
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2459 - 2463
  • [37] Model adaptation methods for speaker verification
    Mistretta, W
    Farrell, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 113 - 116
  • [38] Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
    Kosaka, Tetsuo
    Takeda, Yuui
    Ito, Takashi
    Kato, Masaharu
    Kohda, Masaki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2363 - 2369
  • [39] Generalized Variability Model for Speaker Verification
    Ma, Jianbo
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    Lee, Kong Aik
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (12) : 1775 - 1779
  • [40] Shouted and whispered speech compensation for speaker verification systems
    Prieto, Santi
    Ortega, Alfonso
    Lopez-Espejo, Ivan
    Lleida, Eduardo
    DIGITAL SIGNAL PROCESSING, 2022, 127