A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition

被引:16
|
作者
Ferras, Marc [1 ]
Madikeri, Srikanth [1 ]
Motlicek, Petr [1 ]
Dey, Subhadeep [1 ]
Bourlard, Herve [1 ]
机构
[1] Idiap Res Inst, CH-1920 Martigny, Switzerland
关键词
Codec; degraded speech; noise; robustness; simulation; speaker recognition; NOISE;
D O I
10.1109/LSP.2016.2537844
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The state-of-the-art speaker-recognition systems suffer from significant performance loss on degraded speech conditions and acoustic mismatch between enrolment and test phases. Past international evaluation campaigns, such as the NIST speaker recognition evaluation (SRE), have partly addressed these challenges in some evaluation conditions. This work aims at further assessing and compensating for the effect of a wide variety of speech-degradation processes on speaker-recognition performance. We present an open-source simulator generating degraded telephone, VoIP, and interview-speech recordings using a comprehensive list of narrow-band, wide-band, and audio codecs, together with a database of over 60 h of environmental noise recordings and over 100 impulse responses collected from publicly available data. We provide speaker-verification results obtained with an i-vector-based system using either a clean or degraded PLDA back-end on a NIST SRE subset of data corrupted by the proposed simulator. While error rates increase considerably under degraded speech conditions, large relative equal error rate (EER) reductions were observed when using a PLDA model trained with a large number of degraded sessions per speaker.
引用
收藏
页码:527 / 531
页数:5
相关论文
共 50 条
  • [31] TypeScript: An Open-Source Programming Language with Options for Robust Development and Large-Scale Applications
    Acropolis Institute of Technology and Research, Dept. of Computer Science and Information Technology, Indore, India
    Int. Conf. Adv. Comput. Res. Sci. Eng. Technol., ACROSET, 2024,
  • [32] QuoVidi: An open-source web application for the organization of large-scale biological treasure hunts
    Lobet, Guillaume
    Descamps, Charlotte
    Leveau, Lola
    Guillet, Alain
    Rees, Jean-Francois
    ECOLOGY AND EVOLUTION, 2021, 11 (08): : 3516 - 3526
  • [33] GATECloud.net: a platform for large-scale, open-source text processing on the cloud
    Tablan, Valentin
    Roberts, Ian
    Cunningham, Hamish
    Bontcheva, Kalina
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2013, 371 (1983):
  • [34] SGL: A domain-specific language for large-scale analysis of open-source code
    Foo, Darius
    Yi, Ang Ming
    Yeo, Jason
    Sharma, Asankhaya
    2018 IEEE CYBERSECURITY DEVELOPMENT CONFERENCE (SECDEV 2018), 2018, : 61 - 68
  • [35] Empowering OCL research: a large-scale corpus of open-source data from GitHub
    Josh G. M. Mengerink
    Jeroen Noten
    Alexander Serebrenik
    Empirical Software Engineering, 2019, 24 : 1574 - 1609
  • [36] Empowering OCL research: a large-scale corpus of open-source data from GitHub
    Mengerink, Josh G. M.
    Noten, Jeroen
    Serebrenik, Alexander
    EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (03) : 1574 - 1609
  • [37] Leveraging Human Oversight and Intervention in Large-Scale Parallel Processing of Open-source Data
    Casini, Enrico
    Suri, Niranjan
    Bradshaw, Jeffrey M.
    NEXT-GENERATION ANALYST III, 2015, 9499
  • [38] An Open-Source Microscopic Traffic Simulator
    Treiber, Martin
    Kesting, Arne
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2010, 2 (03) : 6 - 13
  • [39] Identifying and characterizing change-prone classes in two large-scale open-source products
    Koru, A. Guenes
    Liu, Hongfang
    JOURNAL OF SYSTEMS AND SOFTWARE, 2007, 80 (01) : 63 - 73
  • [40] Interpreting Large-Scale Attacks Against Open-Source Medical Systems Using eXplainable AI
    Lu, Wei
    COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, CISIS-2024, 2024, 87 : 60 - 71