A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition

被引:16
|
作者
Ferras, Marc [1 ]
Madikeri, Srikanth [1 ]
Motlicek, Petr [1 ]
Dey, Subhadeep [1 ]
Bourlard, Herve [1 ]
机构
[1] Idiap Res Inst, CH-1920 Martigny, Switzerland
关键词
Codec; degraded speech; noise; robustness; simulation; speaker recognition; NOISE;
D O I
10.1109/LSP.2016.2537844
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The state-of-the-art speaker-recognition systems suffer from significant performance loss on degraded speech conditions and acoustic mismatch between enrolment and test phases. Past international evaluation campaigns, such as the NIST speaker recognition evaluation (SRE), have partly addressed these challenges in some evaluation conditions. This work aims at further assessing and compensating for the effect of a wide variety of speech-degradation processes on speaker-recognition performance. We present an open-source simulator generating degraded telephone, VoIP, and interview-speech recordings using a comprehensive list of narrow-band, wide-band, and audio codecs, together with a database of over 60 h of environmental noise recordings and over 100 impulse responses collected from publicly available data. We provide speaker-verification results obtained with an i-vector-based system using either a clean or degraded PLDA back-end on a NIST SRE subset of data corrupted by the proposed simulator. While error rates increase considerably under degraded speech conditions, large relative equal error rate (EER) reductions were observed when using a PLDA model trained with a large number of degraded sessions per speaker.
引用
收藏
页码:527 / 531
页数:5
相关论文
共 50 条
  • [41] Comparison of Numerical Methods and Open-Source Libraries for Eigenvalue Analysis of Large-Scale Power Systems
    Tzounas, Georgios
    Dassios, Ioannis
    Liu, Muyang
    Milano, Federico
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 27
  • [42] Open source tools for large-scale neuroscience
    Freeman, Jeremy
    CURRENT OPINION IN NEUROBIOLOGY, 2015, 32 : 156 - 163
  • [43] Oblivion: an open-source system for large-scale analysis of macro-based office malware
    Sanna, Alessandro
    Cara, Fabrizio
    Maiorca, Davide
    Giacinto, Giorgio
    JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2024, 20 (04): : 783 - 802
  • [44] Open-Source License Violations of Binary Software at Large Scale
    Feng, Muyue
    Mao, Weixuan
    Yuan, Zimu
    Xiao, Yang
    Ban, Gu
    Wang, Wei
    Wang, Shiyang
    Tang, Qian
    Xu, Jiahuan
    Su, He
    Liu, Binghong
    Huo, Wei
    2019 IEEE 26TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER), 2019, : 564 - 568
  • [45] AutoNetkit: Simplifying Large Scale, Open-Source Network Experimentation
    Knight, Simon
    Jaboldinov, Askar
    Maennel, Olaf
    Phillips, Iain
    Roughan, Matthew
    ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2012, 42 (04) : 97 - 98
  • [46] CathSim: An Open-Source Simulator for Endovascular Intervention
    Jianu, Tudor
    Huang, Baoru
    Vu, Minh Nhat
    Abdelaziz, Mohamed E. M. K.
    Fichera, Sebastiano
    Lee, Chun-Yi
    Berthet-Rayne, Pierre
    Rodriguez y Baena, Ferdinando
    Nguyen, Anh
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (03): : 971 - 979
  • [47] An Open-Source Simulator for Exploring HPLC Theory
    Abate-Pella, Daniel
    Stoll, Dwight R.
    Carr, Peter W.
    Boswell, Paul G.
    LC GC NORTH AMERICA, 2015, 33 (03) : 200 - 207
  • [48] Open-Source JTAG Simulator Bundle for Labs
    Shibin, Konstantin
    Devadze, Sergei
    Rosin, Vjatseslav
    Jutman, Artur
    Ubar, Raimund
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2012, 58 (03) : 233 - 239
  • [49] LIFTING: a Flexible Open-Source Fault Simulator
    Bosio, A.
    Di Natale, G.
    PROCEEDINGS OF THE 17TH ASIAN TEST SYMPOSIUM, 2008, : 35 - 40
  • [50] LARGE-SCALE SPEAKER IDENTIFICATION
    Schmidt, Ludwig
    Sharifi, Matthew
    Moreno, Ignacio Lopez
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,