ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech

被引：60

作者：

Nautsch A. ^{[1
]}

Wang X. ^{[2
]}

Evans N. ^{[1
]}

Kinnunen T.H. ^{[3
]}

Vestman V. ^{[3
]}

Todisco M. ^{[1
]}

Delgado H. ^{[4
]}

Sahidullah M. ^{[5
]}

Yamagishi J. ^{[6
]}

Lee K.A. ^{[7
]}

机构：

[1] Digital Security Department, EURECOM (Campus SophiaTech), Biot

[2] Digital Content and Media Sciences Research Division, National Institute of Informatics, Tokyo

[3] School of Computing, University of Eastern Finland (Joensuu Campus), Joensuu

[4] Department of Speech, Nuance Communications, Madrid

[5] Department of Multispeech, Universite de Lorraine, CNRS, Inria, LORIA, Nancy

[6] Yamagishi Laboratory, National Institute of Informatics, Tokyo

[7] Institute for Infocomm Research, A*STAR

来源：

IEEE Transactions on Biometrics, Behavior, and Identity Science | 2021年 / 3卷 / 02期

基金：

芬兰科学院; 日本科学技术振兴机构;

关键词：

automatic speaker verification; countermeasures; presentation attack detection; speaker recognition; Spoofing;

D O I：

10.1109/TBIOM.2021.3059479

中图分类号：

学科分类号：

摘要：

The ASVspoof initiative was conceived to spearhead research in anti-spoofing for automatic speaker verification (ASV). This paper describes the third in a series of bi-annual challenges: ASVspoof 2019. With the challenge database and protocols being described elsewhere, the focus of this paper is on results and the top performing single and ensemble system submissions from 62 teams, all of which out-perform the two baseline systems, often by a substantial margin. Deeper analyses shows that performance is dominated by specific conditions involving either specific spoofing attacks or specific acoustic environments. While fusion is shown to be particularly effective for the logical access scenario involving speech synthesis and voice conversion attacks, participants largely struggled to apply fusion successfully for the physical access scenario involving simulated replay attacks. This is likely the result of a lack of system complementarity, while oracle fusion experiments show clear potential to improve performance. Furthermore, while results for simulated data are promising, experiments with real replay data show a substantial gap, most likely due to the presence of additive noise in the latter. This finding, among others, leads to a number of ideas for further research and directions for future editions of the ASVspoof challenge. © 2019 IEEE.

引用

页码：252 / 265

页数：13

共 50 条

[1] ASVspoof 2019: A large-scale public database of synthetized, converted and replayed speech
Wang, Xin
Yamagishi, Junichi
Todisco, Massimiliano
Delgado, Hector
Nautsch, Andreas
Evans, Nicholas
Sahidullah, Md
Vestman, Ville
Kinnunen, Tomi
Lee, Kong Aik
Juvela, Lauri
Alku, Paavo
Peng, Yu-Huai
Hwang, Hsin-Te
Tsao, Yu
Wang, Hsin-Min
Le Maguer, Sebastien
Becker, Markus
Henderson, Fergus
Clark, Rob
Zhang, Yu
Wang, Quan
Jia, Ye
Onuma, Kai
Mushika, Koji
Kaned, Takashi
Jiang, Yuan
Liu, Li Juan
Wu, Yi-Chiao
Huang, Wen-Chin
Toda, Tomoki
Tanaka, Kou
Kameoka, Hirokazu
Steiner, Ingmar
Matrouf, Driss
Bonastre, Jean-Francois
Govender, Avashna
Ronanki, Srikanth
Zhang, Jing-Xuan
Ling, Zhen-Hua
COMPUTER SPEECH AND LANGUAGE, 2020, 64
[2] ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech
Wang X.
Yamagishi J.
Todisco M.
Delgado H.
Nautsch A.
Evans N.
Sahidullah M.
Vestman V.
Kinnunen T.
Lee K.A.
Juvela L.
Alku P.
Peng Y.-H.
Hwang H.-T.
Tsao Y.
Wang H.-M.
Maguer S.L.
Becker M.
Henderson F.
Clark R.
Zhang Y.
Wang Q.
Jia Y.
Onuma K.
Mushika K.
Kaneda T.
Jiang Y.
Liu L.-J.
Wu Y.-C.
Huang W.-C.
Toda T.
Tanaka K.
Kameoka H.
Steiner I.
Matrouf D.
Bonastre J.-F.
Govender A.
Ronanki S.
Zhang J.-X.
Ling Z.-H.
Wang, Xin (wangxin@nii.ac.jp), 1600, Academic Press (64):
[3] ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge
Wu, Zhizheng
Yamagishi, Junichi
Kinnunen, Tomi
Hanilci, Cemal
Sahidullah, Mohammed
Sizov, Aleksandr
Evans, Nicholas
Todisco, Massimiliano
Delgado, Hector
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 588 - 604
[4] ASVspoof 2015: the First Automatic Speaker Verification Spoofing and Countermeasures Challenge
Wu, Zhizheng
Kinnunen, Tomi
Evans, Nicholas
Yamagishi, Junichi
Hanilci, Cemal
Sahidullah, Md
Sizov, Aleksandr
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2037 - 2041
[5] Integrated Spoofing Countermeasures and Automatic Speaker Verification: an Evaluation on ASVspoof 2015
Sahidullah, Md
Delgado, Hector
Todisco, Massimiliano
Yu, Hong
Kinnunen, Tomi
Evans, Nicholas
Tana, Zheng-Hua
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1700 - 1704
[6] GENERALIZATION OF SPOOFING COUNTERMEASURES: A CASE STUDY WITH ASVSPOOF 2015 AND BTAS 2016 CORPORA
Paul, Dipjyoti
Sahidullah, Md
Saha, Goutam
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2047 - 2051
[7] SHNU Anti-spoofing Systems for ASVspoof 2019 Challenge
Feng, Zhimin
Tong, Qiqi
Long, Yanhua
Wei, Shuang
Yang, Chunxia
Zhang, Qiaozheng
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 548 - 552
[8] The SJTU Robust Anti-spoofing System for the ASVspoof 2019 Challenge
Yang, Yexin
Wang, Hongji
Dinkel, Heinrich
Chen, Zhengyang
Wang, Shuai
Qian, Yanmin
Yu, Kai
INTERSPEECH 2019, 2019, : 1038 - 1042
[9] A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection
Wang, Xin
Yamagishi, Junichi
INTERSPEECH 2021, 2021, : 4259 - 4263
[10] The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection
Kinnunen, Tomi
Sahidullah, Md
Delgado, Hector
Todisco, Massimiliano
Evans, Nicholas
Yamagishi, Junichi
Lee, Kong Aik
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2 - 6

← 1 2 3 4 5 →