Automatic speaker recognition with crosslanguage speech material

被引:10
|
作者
Kuenzel, Hermann J. [1 ]
机构
[1] Univ Marburg, D-35032 Marburg, Germany
关键词
FORENSIC SPEAKER RECOGNITION; AUTOMATIC SPEAKER RECOGNITION; CROSS-LANGUAGE SPEECH MATERIAL; TRANSMISSION CHANNEL CHARACTERISTICS;
D O I
10.1558/ijsll.v20i1.21
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Automatic systems for forensic speaker recognition (FASR) claim to be largely independent of language based on the fact that feature vectors are composed of acoustic parameters that are derived from the resonance characteristics of vocal tract cavities. Yet a certain 'language gap' may remain which may deteriorate the performance of a system unless properly compensated. This forensic aspect of what may be called cross-language speaker recognition has not yet received due attention. Based on the most common forensic cross-language setting, the aim of this study was to assess the effect of language mismatch on the performance of a standard FASR system and compare its magnitude with the effect of other sources of mismatch on the same voice data. Using the automatic system Batvox 3 in an experiment with 75 bilingual speakers of seven languages and four kinds of transmission channels, it can be shown that, if speaker model and reference population are matched in terms of language, the remaining mismatch between speaker model and test sample can be neglected, since equal error rates (EERs) for same-language or cross-language comparisons are approximately the same, ranging from zero to 5.6%. Transmission of the speech data via landline telephone, GSM and, for part of the corpus, VoIP (using Skype) caused EERs to rise by less than 1% on average.
引用
收藏
页码:21 / 44
页数:24
相关论文
共 50 条
  • [41] Exploiting speech production information for automatic speech and speaker modeling and recognition - possibilities and new opportunities
    Ramanarayanan, Vikram
    Ghosh, Prasanta Kumar
    Lammert, Adam
    Narayanan, Shrikanth S.
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [42] Continuous speech recognition using an on-line speaker adaptation method based on automatic speaker clustering
    Zhang, W
    Nakagawa, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03) : 464 - 473
  • [43] Evaluation of the usefulness of selected features of the speech signal for automatic speaker recognition systems
    Dobrowolski, Andrzej P.
    Majda, Ewelina
    PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (10): : 193 - 197
  • [44] AN INTRODUCTION TO SPEECH AND SPEAKER RECOGNITION
    PEACOCKE, RD
    GRAF, DH
    COMPUTER, 1990, 23 (08) : 26 - 33
  • [45] Automatic Speaker Recognition performance with matched and mismatched female bilingual speech data
    Nuttall, Bryony
    Harrison, Philip
    Hughes, Vincent
    INTERSPEECH 2023, 2023, : 601 - 605
  • [46] Detection of GSM speech coding for telephone call classification and automatic speaker recognition
    Dabrowski, Adam
    Drgas, Szymon
    Marciniak, Tomasz
    ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 415 - 418
  • [47] Speaker-Aware Multi-Task Learning for Automatic Speech Recognition
    Pironkov, Gueorgui
    Dupont, Stephane
    Dutoit, Thierry
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2900 - 2905
  • [48] BiasHacker: Voice Command Disruption by Exploiting Speaker Biases in Automatic Speech Recognition
    Walker, Payton
    McClaran, Nathan
    Zheng, Zihao
    Saxena, Nitesh
    Gu, Guofei
    PROCEEDINGS OF THE 15TH ACM CONFERENCE ON SECURITY AND PRIVACY IN WIRELESS AND MOBILE NETWORKS (WISEC '22), 2022, : 119 - 124
  • [49] SPEAKER REINFORCEMENT USING TARGET SOURCE EXTRACTION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Zorila, Catalin
    Doddipatla, Rama
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6297 - 6301
  • [50] ON COMBINING DNN AND GMM WITH UNSUPERVISED SPEAKER ADAPTATION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Liu, Shilin
    Sim, Khe Chai
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,