ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech

被引:60
|
作者
Nautsch A. [1 ]
Wang X. [2 ]
Evans N. [1 ]
Kinnunen T.H. [3 ]
Vestman V. [3 ]
Todisco M. [1 ]
Delgado H. [4 ]
Sahidullah M. [5 ]
Yamagishi J. [6 ]
Lee K.A. [7 ]
机构
[1] Digital Security Department, EURECOM (Campus SophiaTech), Biot
[2] Digital Content and Media Sciences Research Division, National Institute of Informatics, Tokyo
[3] School of Computing, University of Eastern Finland (Joensuu Campus), Joensuu
[4] Department of Speech, Nuance Communications, Madrid
[5] Department of Multispeech, Universite de Lorraine, CNRS, Inria, LORIA, Nancy
[6] Yamagishi Laboratory, National Institute of Informatics, Tokyo
[7] Institute for Infocomm Research, A*STAR
基金
芬兰科学院; 日本科学技术振兴机构;
关键词
automatic speaker verification; countermeasures; presentation attack detection; speaker recognition; Spoofing;
D O I
10.1109/TBIOM.2021.3059479
中图分类号
学科分类号
摘要
The ASVspoof initiative was conceived to spearhead research in anti-spoofing for automatic speaker verification (ASV). This paper describes the third in a series of bi-annual challenges: ASVspoof 2019. With the challenge database and protocols being described elsewhere, the focus of this paper is on results and the top performing single and ensemble system submissions from 62 teams, all of which out-perform the two baseline systems, often by a substantial margin. Deeper analyses shows that performance is dominated by specific conditions involving either specific spoofing attacks or specific acoustic environments. While fusion is shown to be particularly effective for the logical access scenario involving speech synthesis and voice conversion attacks, participants largely struggled to apply fusion successfully for the physical access scenario involving simulated replay attacks. This is likely the result of a lack of system complementarity, while oracle fusion experiments show clear potential to improve performance. Furthermore, while results for simulated data are promising, experiments with real replay data show a substantial gap, most likely due to the presence of additive noise in the latter. This finding, among others, leads to a number of ideas for further research and directions for future editions of the ASVspoof challenge. © 2019 IEEE.
引用
收藏
页码:252 / 265
页数:13
相关论文
共 50 条
  • [21] Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition
    Wu, Zhizheng
    Chng, Eng Siong
    Li, Haizhou
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1698 - 1701
  • [22] Phase perturbation improves channel robustness for speech spoofing countermeasures
    Zang, Yongyi
    Zhang, You
    Duan, Zhiyao
    INTERSPEECH 2023, 2023, : 3162 - 3166
  • [23] NEURAL MOS PREDICTION FOR SYNTHESIZED SPEECH USING MULTI-TASK LEARNING WITH SPOOFING DETECTION AND SPOOFING TYPE CLASSIFICATION
    Choi, Yeunju
    Jung, Youngmoon
    Kim, Hoirin
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 462 - 469
  • [24] ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild
    Liu, Xuechen
    Wang, Xin
    Sahidullah, Md
    Patino, Jose
    Delgado, Hector
    Kinnunen, Tomi
    Todisco, Massimiliano
    Yamagishi, Junichi
    Evans, Nicholas
    Nautsch, Andreas
    Lee, Kong Aik
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2507 - 2522
  • [25] ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
    Todisco, Massimiliano
    Wang, Xin
    Vestman, Ville
    Sahidullah, Md
    Delgado, Hector
    Nautsch, Andreas
    Yamagishi, Junichi
    Evans, Nicholas
    Kinnunen, Tomi H.
    Lee, Kong Aik
    INTERSPEECH 2019, 2019, : 1008 - 1012
  • [26] Speech spoofing countermeasures based on source voice analysis and machine learning techniques
    Rahmeni, Raoudha
    Ben Aicha, Anis
    Ben Ayed, Yassine
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 668 - 675
  • [27] Linear Frequency Residual Cepstral Features for Replay Spoof Detection on ASVSpoof 2019
    Gupta, Priyanka
    Patil, Hemant A.
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 349 - 353
  • [28] Human spoofing detection performance on degraded speech
    Terblanche, Camryn
    Harrison, Philip
    Gully, Amelia J.
    INTERSPEECH 2021, 2021, : 1738 - 1742
  • [29] On the contribution of the voice texture for speech spoofing detection
    Rahmeni, Raoudha
    Ben Aicha, Anis
    Ben Ayed, Yassine
    2019 19TH INTERNATIONAL CONFERENCE ON SCIENCES AND TECHNIQUES OF AUTOMATIC CONTROL AND COMPUTER ENGINEERING (STA), 2019, : 501 - 505
  • [30] Improving Replay Detection System with Channel Consistency DenseNeXt for the ASVspoof 2019 Challenge
    Zhang, Chao
    Cheng, Junjie
    Gu, Yanmei
    Wang, Huacan
    Ma, Jun
    Wang, Shaojun
    Xiao, Jing
    INTERSPEECH 2020, 2020, : 4596 - 4600