ASVspoof 2019: A large-scale public database of synthetized, converted and replayed speech

被引:0
|
作者
Wang, Xin [1 ]
Yamagishi, Junichi [1 ,2 ]
Todisco, Massimiliano [3 ]
Delgado, Hector [3 ]
Nautsch, Andreas [3 ]
Evans, Nicholas [3 ]
Sahidullah, Md [4 ]
Vestman, Ville [5 ]
Kinnunen, Tomi [5 ]
Lee, Kong Aik [6 ]
Juvela, Lauri [7 ]
Alku, Paavo [7 ]
Peng, Yu-Huai [8 ]
Hwang, Hsin-Te [8 ]
Tsao, Yu [8 ]
Wang, Hsin-Min [8 ]
Le Maguer, Sebastien [9 ]
Becker, Markus [10 ]
Henderson, Fergus [10 ]
Clark, Rob [10 ]
Zhang, Yu [10 ]
Wang, Quan [10 ]
Jia, Ye [10 ]
Onuma, Kai [11 ]
Mushika, Koji [11 ]
Kaned, Takashi [11 ]
Jiang, Yuan [12 ]
Liu, Li Juan [12 ]
Wu, Yi-Chiao [13 ]
Huang, Wen-Chin [13 ]
Toda, Tomoki [13 ]
Tanaka, Kou [14 ]
Kameoka, Hirokazu [14 ]
Steiner, Ingmar [15 ]
Matrouf, Driss [16 ]
Bonastre, Jean-Francois [16 ]
Govender, Avashna [2 ]
Ronanki, Srikanth [2 ,17 ]
Zhang, Jing-Xuan [18 ]
Ling, Zhen-Hua [18 ]
机构
[1] Natl Inst Informat, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo, Japan
[2] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
[3] EURECOM, Campus SophiaTech,450 Route Chappes, F-06410 Biot, France
[4] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[5] Univ Eastern Finland, Joensuu Campus,Yliopistokatu 2, FI-80100 Joensuu, Finland
[6] NEC Corp Ltd, Minato Ku, 7-1,Shiba 5 Chome, Tokyo 1088001, Japan
[7] Aalto Univ, Rakentajanaukio 2 C, Aalto 00076, Finland
[8] Acad Sinica, 128,Sec 2,Acad Rd, Taipei, Taiwan
[9] Trinity Coll Dublin, Sch Engn, Sigmedia, ADAPT Ctr, Dublin, Ireland
[10] Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA
[11] HOYA, Shinjuku Ku, Shinjuku Pk Tower 35F,3-7-1 Nishi Shinjuku, Tokyo 1631035, Japan
[12] iFlytek Res, 666 Wangjiang West Rd, Hefei 230088, Peoples R China
[13] Nagoya Univ, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
[14] NTT Commun Sci Labs, 3-1 Morinosato Wakamiya, Atsugi, Kanagawa 2430198, Japan
[15] audEERING GmbH, Friedrichshafener Str 1, D-82205 Gilching, Germany
[16] Avignon Univ, LIA, 339 Chemin Meinajaris, F-84911 Avignon, France
[17] Amazon, Seattle, WA USA
[18] Univ Sci & Technol China, 96 JinZhai Rd, Hefei 230026, Anhui, Peoples R China
来源
基金
芬兰科学院; 爱尔兰科学基金会;
关键词
SPEAKER; COUNTERMEASURES; VOCODER;
D O I
10.1016/j.csi.2020.101114
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
引用
收藏
页数:27
相关论文
共 50 条
  • [1] ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech
    Wang X.
    Yamagishi J.
    Todisco M.
    Delgado H.
    Nautsch A.
    Evans N.
    Sahidullah M.
    Vestman V.
    Kinnunen T.
    Lee K.A.
    Juvela L.
    Alku P.
    Peng Y.-H.
    Hwang H.-T.
    Tsao Y.
    Wang H.-M.
    Maguer S.L.
    Becker M.
    Henderson F.
    Clark R.
    Zhang Y.
    Wang Q.
    Jia Y.
    Onuma K.
    Mushika K.
    Kaneda T.
    Jiang Y.
    Liu L.-J.
    Wu Y.-C.
    Huang W.-C.
    Toda T.
    Tanaka K.
    Kameoka H.
    Steiner I.
    Matrouf D.
    Bonastre J.-F.
    Govender A.
    Ronanki S.
    Zhang J.-X.
    Ling Z.-H.
    Wang, Xin (wangxin@nii.ac.jp), 1600, Academic Press (64):
  • [2] ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech
    Nautsch A.
    Wang X.
    Evans N.
    Kinnunen T.H.
    Vestman V.
    Todisco M.
    Delgado H.
    Sahidullah M.
    Yamagishi J.
    Lee K.A.
    IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021, 3 (02): : 252 - 265
  • [3] A Large-Scale Japanese Speech Database
    1600, (The International Society for Computers and Their Applications (ISCA)):
  • [4] ShEMO: a large-scale validated database for Persian speech emotion detection
    Nezami, Omid Mohamad
    Lou, Paria Jamshid
    Karami, Mansoureh
    LANGUAGE RESOURCES AND EVALUATION, 2019, 53 (01) : 1 - 16
  • [5] ShEMO: a large-scale validated database for Persian speech emotion detection
    Omid Mohamad Nezami
    Paria Jamshid Lou
    Mansoureh Karami
    Language Resources and Evaluation, 2019, 53 : 1 - 16
  • [6] Biological evaluation of large-scale synthetized superparamagnetic iron oxide nanoparticles
    Estevezi, Manuel
    Cicuendezi, Monica
    Crespo, Julian
    Acevedo, Claudio Fernandez
    Mateo, Tamara Oroz
    Leza, Amaia Rada
    Serrano, Juana
    Colilla, Montserrat
    Morales, Maria Del Puerto
    Gonzalez, Blanca
    Barba, Isabel Izquierdo
    Regis, Maria Vallet
    TISSUE ENGINEERING PART A, 2023, 29 (13-14)
  • [7] Video Violence Rating: A Large-Scale Public Database and A Multimodal Rating Model
    Xiang, Tao
    Pan, Hongyan
    Nan, Zhixiong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8557 - 8568
  • [8] A SURVEY OF LARGE-SCALE DATABASE ISSUES
    ANTENUCCI, JC
    GIS-87 SAN FRANCISCO, VOL 3: INTO THE HANDS OF THE DECISION MAKER, 1988, : 17 - 21
  • [9] LARGE-SCALE TELECOMMUNICATIONS AND DATABASE STANDARDS
    LEFKON, RG
    DATA MANAGEMENT, 1987, 25 (05): : 18 - 24
  • [10] MMsINC: a large-scale chemoinformatics database
    Masciocchi, Joel
    Frau, Gianfranco
    Fanton, Marco
    Sturlese, Mattia
    Floris, Matteo
    Pireddu, Luca
    Palla, Piergiorgio
    Cedrati, Fabian
    Rodriguez-Tome, Patricia
    Moro, Stefano
    NUCLEIC ACIDS RESEARCH, 2009, 37 : D284 - D290