Towards single integrated spoofing-aware speaker verification embeddings

被引:1
|
作者
Mun, Sung Hwan [1 ]
Shim, Hye-jin [2 ]
Tak, Hemlata [3 ]
Wang, Xin [4 ]
Liu, Xuechen [2 ,5 ]
Sahidullah, Md [6 ]
Jeong, Myeonghun [1 ]
Han, Min Hyun [1 ]
Todisco, Massimiliano [3 ]
Lee, Kong Aik [7 ]
Yamagishi, Junichi [4 ]
Evans, Nicholas [3 ]
Kinnunen, Tomi [2 ]
Kim, Nam Soo [1 ]
Jung, Jee-weon [8 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Univ Eastern Finland, Kuopio, Finland
[3] EURECOM, Biot, France
[4] Natl Inst Informat, Tokyo, Japan
[5] INRIA, Le Chesnay Rocquencourt, France
[6] TCG CREST, Kolkata, India
[7] ASTAR, Inst Infocomm Res, Singapore, Singapore
[8] Carnegie Mellon Univ, Pittsburgh, PA USA
来源
基金
芬兰科学院;
关键词
spoofing-aware speaker verification; speaker verification; anti-spoofing;
D O I
10.21437/Interspeech.2023-1402
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study aims to develop a single integrated spoofing-aware speaker verification (SASV) embeddings that satisfy two aspects. First, rejecting non-target speakers' input as well as target speakers' spoofed inputs should be addressed. Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge. We analyze that the inferior performance of single SASV embeddings comes from insufficient amount of training data and distinct nature of ASV and CM tasks. To this end, we propose a novel framework that includes multi-stage training and a combination of loss functions. Copy synthesis, combined with several vocoders, is also exploited to address the lack of spoofed data. Experimental results show dramatic improvements, achieving an SASV-EER of 1.06% on the evaluation protocol of the SASV2022 challenge.
引用
收藏
页码:3989 / 3993
页数:5
相关论文
共 50 条
  • [31] ADVERSARIAL ATTACKS ON SPOOFING COUNTERMEASURES OF AUTOMATIC SPEAKER VERIFICATION
    Liu, Songxiang
    Wu, Haibin
    Lee, Hung-yi
    Meng, Helen
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 312 - 319
  • [32] SPEAKER VERIFICATION USING SECURE BINARY EMBEDDINGS
    Portelo, Jose
    Raj, Bhiksha
    Boufounos, Petros
    Trancoso, Isabel
    Abad, Alberto
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [33] Preventing converted speech spoofing attacks in speaker verification
    Correia, M. J.
    Abad, A.
    Trancoso, I.
    2014 37TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2014, : 1320 - 1325
  • [34] ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge
    Wu, Zhizheng
    Yamagishi, Junichi
    Kinnunen, Tomi
    Hanilci, Cemal
    Sahidullah, Mohammed
    Sizov, Aleksandr
    Evans, Nicholas
    Todisco, Massimiliano
    Delgado, Hector
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 588 - 604
  • [35] Spoofing Speaker Verification System by Adversarial Examples Leveraging the Generalized Speaker Difference
    Luo, Hongwei
    Shen, Yijie
    Lin, Feng
    Xu, Guoai
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [36] TEXT ADAPTATION FOR SPEAKER VERIFICATION WITH SPEAKER-TEXT FACTORIZED EMBEDDINGS
    Yang, Yexin
    Wang, Shuai
    Gong, Xun
    Qian, Yanmin
    Yu, Kai
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6454 - 6458
  • [37] Deeply Fused Speaker Embeddings for Text-Independent Speaker Verification
    Bhattacharya, Gautam
    Alam, Jahangir
    Gupta, Vishwa
    Kenny, Patrick
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3588 - 3592
  • [38] SAS : A SPEAKER VERIFICATION SPOOFING DATABASE CONTAINING DIVERSE ATTACKS
    Wu, Zhizheng
    Khodabakhsh, Ali
    Demiroglu, Cenk
    Yamagishi, Junichi
    Saito, Daisuke
    Toda, Tomoki
    King, Simon
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4440 - 4444
  • [39] Deep Discriminative Embeddings for Duration Robust Speaker Verification
    Li, Na
    Tuo, Deyi
    Su, Dan
    Li, Zhifeng
    Yu, Dong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2262 - 2266
  • [40] DoubleDeceiver: Deceiving the Speaker Verification System Protected by Spoofing Countermeasures
    Zhang, Mengao
    Xu, Ke
    Li, Hao
    Wang, Lei
    Fang, Chengfang
    Shi, Jie
    INTERSPEECH 2023, 2023, : 4014 - 4018