ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech

被引:60
|
作者
Nautsch A. [1 ]
Wang X. [2 ]
Evans N. [1 ]
Kinnunen T.H. [3 ]
Vestman V. [3 ]
Todisco M. [1 ]
Delgado H. [4 ]
Sahidullah M. [5 ]
Yamagishi J. [6 ]
Lee K.A. [7 ]
机构
[1] Digital Security Department, EURECOM (Campus SophiaTech), Biot
[2] Digital Content and Media Sciences Research Division, National Institute of Informatics, Tokyo
[3] School of Computing, University of Eastern Finland (Joensuu Campus), Joensuu
[4] Department of Speech, Nuance Communications, Madrid
[5] Department of Multispeech, Universite de Lorraine, CNRS, Inria, LORIA, Nancy
[6] Yamagishi Laboratory, National Institute of Informatics, Tokyo
[7] Institute for Infocomm Research, A*STAR
基金
芬兰科学院; 日本科学技术振兴机构;
关键词
automatic speaker verification; countermeasures; presentation attack detection; speaker recognition; Spoofing;
D O I
10.1109/TBIOM.2021.3059479
中图分类号
学科分类号
摘要
The ASVspoof initiative was conceived to spearhead research in anti-spoofing for automatic speaker verification (ASV). This paper describes the third in a series of bi-annual challenges: ASVspoof 2019. With the challenge database and protocols being described elsewhere, the focus of this paper is on results and the top performing single and ensemble system submissions from 62 teams, all of which out-perform the two baseline systems, often by a substantial margin. Deeper analyses shows that performance is dominated by specific conditions involving either specific spoofing attacks or specific acoustic environments. While fusion is shown to be particularly effective for the logical access scenario involving speech synthesis and voice conversion attacks, participants largely struggled to apply fusion successfully for the physical access scenario involving simulated replay attacks. This is likely the result of a lack of system complementarity, while oracle fusion experiments show clear potential to improve performance. Furthermore, while results for simulated data are promising, experiments with real replay data show a substantial gap, most likely due to the presence of additive noise in the latter. This finding, among others, leads to a number of ideas for further research and directions for future editions of the ASVspoof challenge. © 2019 IEEE.
引用
收藏
页码:252 / 265
页数:13
相关论文
共 50 条
  • [1] ASVspoof 2019: A large-scale public database of synthetized, converted and replayed speech
    Wang, Xin
    Yamagishi, Junichi
    Todisco, Massimiliano
    Delgado, Hector
    Nautsch, Andreas
    Evans, Nicholas
    Sahidullah, Md
    Vestman, Ville
    Kinnunen, Tomi
    Lee, Kong Aik
    Juvela, Lauri
    Alku, Paavo
    Peng, Yu-Huai
    Hwang, Hsin-Te
    Tsao, Yu
    Wang, Hsin-Min
    Le Maguer, Sebastien
    Becker, Markus
    Henderson, Fergus
    Clark, Rob
    Zhang, Yu
    Wang, Quan
    Jia, Ye
    Onuma, Kai
    Mushika, Koji
    Kaned, Takashi
    Jiang, Yuan
    Liu, Li Juan
    Wu, Yi-Chiao
    Huang, Wen-Chin
    Toda, Tomoki
    Tanaka, Kou
    Kameoka, Hirokazu
    Steiner, Ingmar
    Matrouf, Driss
    Bonastre, Jean-Francois
    Govender, Avashna
    Ronanki, Srikanth
    Zhang, Jing-Xuan
    Ling, Zhen-Hua
    COMPUTER SPEECH AND LANGUAGE, 2020, 64
  • [2] ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech
    Wang X.
    Yamagishi J.
    Todisco M.
    Delgado H.
    Nautsch A.
    Evans N.
    Sahidullah M.
    Vestman V.
    Kinnunen T.
    Lee K.A.
    Juvela L.
    Alku P.
    Peng Y.-H.
    Hwang H.-T.
    Tsao Y.
    Wang H.-M.
    Maguer S.L.
    Becker M.
    Henderson F.
    Clark R.
    Zhang Y.
    Wang Q.
    Jia Y.
    Onuma K.
    Mushika K.
    Kaneda T.
    Jiang Y.
    Liu L.-J.
    Wu Y.-C.
    Huang W.-C.
    Toda T.
    Tanaka K.
    Kameoka H.
    Steiner I.
    Matrouf D.
    Bonastre J.-F.
    Govender A.
    Ronanki S.
    Zhang J.-X.
    Ling Z.-H.
    Wang, Xin (wangxin@nii.ac.jp), 1600, Academic Press (64):
  • [3] ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge
    Wu, Zhizheng
    Yamagishi, Junichi
    Kinnunen, Tomi
    Hanilci, Cemal
    Sahidullah, Mohammed
    Sizov, Aleksandr
    Evans, Nicholas
    Todisco, Massimiliano
    Delgado, Hector
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 588 - 604
  • [4] ASVspoof 2015: the First Automatic Speaker Verification Spoofing and Countermeasures Challenge
    Wu, Zhizheng
    Kinnunen, Tomi
    Evans, Nicholas
    Yamagishi, Junichi
    Hanilci, Cemal
    Sahidullah, Md
    Sizov, Aleksandr
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2037 - 2041
  • [5] Integrated Spoofing Countermeasures and Automatic Speaker Verification: an Evaluation on ASVspoof 2015
    Sahidullah, Md
    Delgado, Hector
    Todisco, Massimiliano
    Yu, Hong
    Kinnunen, Tomi
    Evans, Nicholas
    Tana, Zheng-Hua
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1700 - 1704
  • [6] GENERALIZATION OF SPOOFING COUNTERMEASURES: A CASE STUDY WITH ASVSPOOF 2015 AND BTAS 2016 CORPORA
    Paul, Dipjyoti
    Sahidullah, Md
    Saha, Goutam
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2047 - 2051
  • [7] SHNU Anti-spoofing Systems for ASVspoof 2019 Challenge
    Feng, Zhimin
    Tong, Qiqi
    Long, Yanhua
    Wei, Shuang
    Yang, Chunxia
    Zhang, Qiaozheng
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 548 - 552
  • [8] The SJTU Robust Anti-spoofing System for the ASVspoof 2019 Challenge
    Yang, Yexin
    Wang, Hongji
    Dinkel, Heinrich
    Chen, Zhengyang
    Wang, Shuai
    Qian, Yanmin
    Yu, Kai
    INTERSPEECH 2019, 2019, : 1038 - 1042
  • [9] A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection
    Wang, Xin
    Yamagishi, Junichi
    INTERSPEECH 2021, 2021, : 4259 - 4263
  • [10] The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection
    Kinnunen, Tomi
    Sahidullah, Md
    Delgado, Hector
    Todisco, Massimiliano
    Evans, Nicholas
    Yamagishi, Junichi
    Lee, Kong Aik
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2 - 6