ASVspoof 2019: A large-scale public database of synthetized, converted and replayed speech

被引:0
|
作者
Wang, Xin [1 ]
Yamagishi, Junichi [1 ,2 ]
Todisco, Massimiliano [3 ]
Delgado, Hector [3 ]
Nautsch, Andreas [3 ]
Evans, Nicholas [3 ]
Sahidullah, Md [4 ]
Vestman, Ville [5 ]
Kinnunen, Tomi [5 ]
Lee, Kong Aik [6 ]
Juvela, Lauri [7 ]
Alku, Paavo [7 ]
Peng, Yu-Huai [8 ]
Hwang, Hsin-Te [8 ]
Tsao, Yu [8 ]
Wang, Hsin-Min [8 ]
Le Maguer, Sebastien [9 ]
Becker, Markus [10 ]
Henderson, Fergus [10 ]
Clark, Rob [10 ]
Zhang, Yu [10 ]
Wang, Quan [10 ]
Jia, Ye [10 ]
Onuma, Kai [11 ]
Mushika, Koji [11 ]
Kaned, Takashi [11 ]
Jiang, Yuan [12 ]
Liu, Li Juan [12 ]
Wu, Yi-Chiao [13 ]
Huang, Wen-Chin [13 ]
Toda, Tomoki [13 ]
Tanaka, Kou [14 ]
Kameoka, Hirokazu [14 ]
Steiner, Ingmar [15 ]
Matrouf, Driss [16 ]
Bonastre, Jean-Francois [16 ]
Govender, Avashna [2 ]
Ronanki, Srikanth [2 ,17 ]
Zhang, Jing-Xuan [18 ]
Ling, Zhen-Hua [18 ]
机构
[1] Natl Inst Informat, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo, Japan
[2] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
[3] EURECOM, Campus SophiaTech,450 Route Chappes, F-06410 Biot, France
[4] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[5] Univ Eastern Finland, Joensuu Campus,Yliopistokatu 2, FI-80100 Joensuu, Finland
[6] NEC Corp Ltd, Minato Ku, 7-1,Shiba 5 Chome, Tokyo 1088001, Japan
[7] Aalto Univ, Rakentajanaukio 2 C, Aalto 00076, Finland
[8] Acad Sinica, 128,Sec 2,Acad Rd, Taipei, Taiwan
[9] Trinity Coll Dublin, Sch Engn, Sigmedia, ADAPT Ctr, Dublin, Ireland
[10] Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA
[11] HOYA, Shinjuku Ku, Shinjuku Pk Tower 35F,3-7-1 Nishi Shinjuku, Tokyo 1631035, Japan
[12] iFlytek Res, 666 Wangjiang West Rd, Hefei 230088, Peoples R China
[13] Nagoya Univ, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
[14] NTT Commun Sci Labs, 3-1 Morinosato Wakamiya, Atsugi, Kanagawa 2430198, Japan
[15] audEERING GmbH, Friedrichshafener Str 1, D-82205 Gilching, Germany
[16] Avignon Univ, LIA, 339 Chemin Meinajaris, F-84911 Avignon, France
[17] Amazon, Seattle, WA USA
[18] Univ Sci & Technol China, 96 JinZhai Rd, Hefei 230026, Anhui, Peoples R China
来源
基金
芬兰科学院; 爱尔兰科学基金会;
关键词
SPEAKER; COUNTERMEASURES; VOCODER;
D O I
10.1016/j.csi.2020.101114
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
引用
收藏
页数:27
相关论文
共 50 条
  • [41] Exploring Large-Scale Interactive Public Illustrations
    Thorn, Emily-Clare
    Rennick-Egglestone, Stefan
    Koleva, Boriana
    Preston, William
    Benford, Steve
    Quinn, Anthony
    Mortier, Richard
    DIS 2016: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON DESIGNING INTERACTIVE SYSTEMS, 2016, : 17 - 27
  • [42] THE SPEECHTRANSFORMER FOR LARGE-SCALE MANDARIN CHINESE SPEECH RECOGNITION
    Zhao, Yuanyuan
    Li, Jie
    Wang, Xiaorui
    Li, Yan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7095 - 7099
  • [43] LARGE-SCALE PUBLIC PROJECTS - THE PERSONAL CONNECTION
    GOLDENBERG, TY
    MEVORACH, B
    PUBLIC ADMINISTRATION AND DEVELOPMENT, 1991, 11 (01) : 57 - 65
  • [44] Analyzing Large-Scale Public Campaigns on Twitter
    Proskurnia, Julia
    Mavlyutov, Ruslan
    Prokofyev, Roman
    Aberer, Karl
    Cudre-Mauroux, Philippe
    SOCIAL INFORMATICS, PT II, 2016, 10047 : 225 - 243
  • [45] MLS: A Large-Scale Multilingual Dataset for Speech Research
    Pratap, Vineel
    Xu, Qiantong
    Sriram, Anuroop
    Synnaeve, Gabriel
    Collobert, Ronan
    INTERSPEECH 2020, 2020, : 2757 - 2761
  • [46] Multimodal and Multilingual Embeddings for Large-Scale Speech Mining
    Duquenne, Paul-Ambroise
    Gong, Hongyu
    Schwenk, Holger
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [47] Problems on large-scale speech corpus and the applications in TTS
    Zhang S.
    Liu L.
    Diao L.-H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (04): : 687 - 696
  • [48] SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
    Duquenne, Paul-Ambroise
    Gong, Hongyu
    Dong, Ning
    Du, Jingfei
    Lee, Ann
    Goswami, Vedanuj
    Wang, Changhan
    Pino, Juan
    Sagot, Benoit
    Schwenk, Holger
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 16251 - 16269
  • [49] Filling the gap between a large-scale database and Multimodal interactions
    Araki, Masahiro
    LARGE-SCALE KNOWLEDGE RESOURCES: CONSTRUCTION AND APPLICATION, 2008, 4938 : 179 - 185
  • [50] Mining basic active structures from a large-scale database
    Naoto Takada
    Norihito Ohmori
    Takashi Okada
    Journal of Cheminformatics, 5