Towards single integrated spoofing-aware speaker verification embeddings

被引:1
|
作者
Mun, Sung Hwan [1 ]
Shim, Hye-jin [2 ]
Tak, Hemlata [3 ]
Wang, Xin [4 ]
Liu, Xuechen [2 ,5 ]
Sahidullah, Md [6 ]
Jeong, Myeonghun [1 ]
Han, Min Hyun [1 ]
Todisco, Massimiliano [3 ]
Lee, Kong Aik [7 ]
Yamagishi, Junichi [4 ]
Evans, Nicholas [3 ]
Kinnunen, Tomi [2 ]
Kim, Nam Soo [1 ]
Jung, Jee-weon [8 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Univ Eastern Finland, Kuopio, Finland
[3] EURECOM, Biot, France
[4] Natl Inst Informat, Tokyo, Japan
[5] INRIA, Le Chesnay Rocquencourt, France
[6] TCG CREST, Kolkata, India
[7] ASTAR, Inst Infocomm Res, Singapore, Singapore
[8] Carnegie Mellon Univ, Pittsburgh, PA USA
来源
基金
芬兰科学院;
关键词
spoofing-aware speaker verification; speaker verification; anti-spoofing;
D O I
10.21437/Interspeech.2023-1402
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study aims to develop a single integrated spoofing-aware speaker verification (SASV) embeddings that satisfy two aspects. First, rejecting non-target speakers' input as well as target speakers' spoofed inputs should be addressed. Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge. We analyze that the inferior performance of single SASV embeddings comes from insufficient amount of training data and distinct nature of ASV and CM tasks. To this end, we propose a novel framework that includes multi-stage training and a combination of loss functions. Copy synthesis, combined with several vocoders, is also exploited to address the lack of spoofed data. Experimental results show dramatic improvements, achieving an SASV-EER of 1.06% on the evaluation protocol of the SASV2022 challenge.
引用
收藏
页码:3989 / 3993
页数:5
相关论文
共 50 条
  • [41] Feature selection based on CQCCs for automatic speaker verification spoofing
    Wang, Xianliang
    Xiao, Yanhong
    Zhu, Xuan
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 32 - 36
  • [42] Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals
    Kinnunen, Tomi
    Delgado, Hector
    Evans, Nicholas
    Lee, Kong Aik
    Vestman, Ville
    Nautsch, Andreas
    Todisco, Massimiliano
    Wang, Xin
    Sahidullah, Md
    Yamagishi, Junichi
    Reynolds, Douglas A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2195 - 2210
  • [43] An assessment of automatic speaker verification vulnerabilities to replay spoofing attacks
    Janicki, Artur
    Alegre, Federico
    Evans, Nicholas
    SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (15) : 3030 - 3044
  • [44] Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
    Kanervisto, Anssi
    Hautamaki, Ville
    Kinnunen, Tomi
    Yamagishi, Junichi
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 477 - 488
  • [45] Anti-spoofing Methods for Automatic Speaker Verification System
    Lavrentyeva, Galina
    Novoselov, Sergey
    Simonchik, Konstantin
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2016, 2017, 661 : 172 - 184
  • [46] Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
    Kanervisto, Anssi
    Hautamaki, Ville
    Kinnunen, Tomi
    Yamagishi, Junichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 477 - 488
  • [47] Spoofing Speaker Verification With Voice Style Transfer And Reconstruction Loss
    Thebaud, Thomas
    Le Lan, Gael
    Larcher, Anthony
    2021 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2021, : 7 - 13
  • [48] SV-DeiT: Speaker Verification with DeiTCap Spoofing Detection
    Ranjan, Rishabh
    Vatsa, Mayank
    Singh, Richa
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [49] Optimizing a-DCF for Spoofing-Robust Speaker Verification
    Kurnaz, Oguzhan
    Mishra, Jagabandhu
    Kinnunen, Tomi H.
    Hanilci, Cemal
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1081 - 1085
  • [50] Group-based speaker embeddings for text-independent speaker verification
    Jung, Youngmoon
    Eom, Youngsik
    Lee, Yeonghyeon
    Kim, Hoirin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 496 - 502