Towards single integrated spoofing-aware speaker verification embeddings

被引:1
|
作者
Mun, Sung Hwan [1 ]
Shim, Hye-jin [2 ]
Tak, Hemlata [3 ]
Wang, Xin [4 ]
Liu, Xuechen [2 ,5 ]
Sahidullah, Md [6 ]
Jeong, Myeonghun [1 ]
Han, Min Hyun [1 ]
Todisco, Massimiliano [3 ]
Lee, Kong Aik [7 ]
Yamagishi, Junichi [4 ]
Evans, Nicholas [3 ]
Kinnunen, Tomi [2 ]
Kim, Nam Soo [1 ]
Jung, Jee-weon [8 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Univ Eastern Finland, Kuopio, Finland
[3] EURECOM, Biot, France
[4] Natl Inst Informat, Tokyo, Japan
[5] INRIA, Le Chesnay Rocquencourt, France
[6] TCG CREST, Kolkata, India
[7] ASTAR, Inst Infocomm Res, Singapore, Singapore
[8] Carnegie Mellon Univ, Pittsburgh, PA USA
来源
基金
芬兰科学院;
关键词
spoofing-aware speaker verification; speaker verification; anti-spoofing;
D O I
10.21437/Interspeech.2023-1402
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study aims to develop a single integrated spoofing-aware speaker verification (SASV) embeddings that satisfy two aspects. First, rejecting non-target speakers' input as well as target speakers' spoofed inputs should be addressed. Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge. We analyze that the inferior performance of single SASV embeddings comes from insufficient amount of training data and distinct nature of ASV and CM tasks. To this end, we propose a novel framework that includes multi-stage training and a combination of loss functions. Copy synthesis, combined with several vocoders, is also exploited to address the lack of spoofed data. Experimental results show dramatic improvements, achieving an SASV-EER of 1.06% on the evaluation protocol of the SASV2022 challenge.
引用
收藏
页码:3989 / 3993
页数:5
相关论文
共 50 条
  • [21] CONTENT-AWARE SPEAKER EMBEDDINGS FOR SPEAKER DIARISATION
    Sun, G.
    Liu, D.
    Zhang, C.
    Woodland, P. C.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7168 - 7172
  • [22] On the Vulnerability of Speaker Verification to Realistic Voice Spoofing
    Ergunay, Serife Kucur
    Khoury, Elie
    Lazaridis, Alexandros
    Marcel, Sebastien
    2015 IEEE 7TH INTERNATIONAL CONFERENCE ON BIOMETRICS THEORY, APPLICATIONS AND SYSTEMS (BTAS 2015), 2015,
  • [23] Backend Ensemble for Speaker Verification and Spoofing Countermeasure
    Zhang, Li
    Li, Yue
    Zhao, Huan
    Wang, Qing
    Xie, Lei
    INTERSPEECH 2022, 2022, : 4381 - 4385
  • [24] Speaker-Aware Anti-spoofing
    Liu, Xuechen
    Sahidullah, Md
    Lee, Kong Aik
    Kinnunen, Tomi
    INTERSPEECH 2023, 2023, : 2498 - 2502
  • [25] Towards Generating Adversarial Examples on Combined Systems of Automatic Speaker Verification and Spoofing Countermeasure
    Zhang, Xingyu
    Zhang, Xiongwei
    Zou, Xia
    Liu, Haibo
    Sun, Meng
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [26] Deep Speaker Embeddings for Short-Duration Speaker Verification
    Bhattacharya, Gautam
    Alam, Jahangir
    Kenny, Patrick
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1517 - 1521
  • [27] Deep speaker embeddings for Speaker Verification: Review and experimental comparison
    Jakubec, Maros
    Jarina, Roman
    Lieskovska, Eva
    Kasak, Peter
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [28] Ensemble Models for Spoofing Detection in Automatic Speaker Verification
    Chettri, Bhusan
    Stoller, Daniel
    Morfi, Veronica
    Ramirez, Marco A. Martinez
    Benetos, Emmanouil
    Sturm, Bob L.
    INTERSPEECH 2019, 2019, : 1018 - 1022
  • [29] Voice conversion and spoofing attack on speaker verification systems
    Wu, Zhizheng
    Li, Haizhou
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [30] Introduction to the Issue on Spoofing and Countermeasures for Automatic Speaker Verification
    Yamagishi, Junichi
    Kinnunen, Tomi H.
    Evans, Nicholas
    De Leon, Phillip
    Trancoso, Isabel
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 585 - 587