Towards single integrated spoofing-aware speaker verification embeddings

被引：1

作者：

Mun, Sung Hwan ^{[1
]}

Shim, Hye-jin ^{[2
]}

Tak, Hemlata ^{[3
]}

Wang, Xin ^{[4
]}

Liu, Xuechen ^{[2
,5
]}

Sahidullah, Md ^{[6
]}

Jeong, Myeonghun ^{[1
]}

Han, Min Hyun ^{[1
]}

Todisco, Massimiliano ^{[3
]}

Lee, Kong Aik ^{[7
]}

Yamagishi, Junichi ^{[4
]}

Evans, Nicholas ^{[3
]}

Kinnunen, Tomi ^{[2
]}

Kim, Nam Soo ^{[1
]}

Jung, Jee-weon ^{[8
]}

机构：

[1] Seoul Natl Univ, Seoul, South Korea

[2] Univ Eastern Finland, Kuopio, Finland

[3] EURECOM, Biot, France

[4] Natl Inst Informat, Tokyo, Japan

[5] INRIA, Le Chesnay Rocquencourt, France

[6] TCG CREST, Kolkata, India

[7] ASTAR, Inst Infocomm Res, Singapore, Singapore

[8] Carnegie Mellon Univ, Pittsburgh, PA USA

来源：

INTERSPEECH 2023 | 2023年

基金：

芬兰科学院;

关键词：

spoofing-aware speaker verification; speaker verification; anti-spoofing;

D O I：

10.21437/Interspeech.2023-1402

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This study aims to develop a single integrated spoofing-aware speaker verification (SASV) embeddings that satisfy two aspects. First, rejecting non-target speakers' input as well as target speakers' spoofed inputs should be addressed. Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge. We analyze that the inferior performance of single SASV embeddings comes from insufficient amount of training data and distinct nature of ASV and CM tasks. To this end, we propose a novel framework that includes multi-stage training and a combination of loss functions. Copy synthesis, combined with several vocoders, is also exploited to address the lack of spoofed data. Experimental results show dramatic improvements, achieving an SASV-EER of 1.06% on the evaluation protocol of the SASV2022 challenge.

引用

页码：3989 / 3993

页数：5

共 50 条

[41] Feature selection based on CQCCs for automatic speaker verification spoofing
Wang, Xianliang
Xiao, Yanhong
Zhu, Xuan
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 32 - 36
[42] Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals
Kinnunen, Tomi
Delgado, Hector
Evans, Nicholas
Lee, Kong Aik
Vestman, Ville
Nautsch, Andreas
Todisco, Massimiliano
Wang, Xin
Sahidullah, Md
Yamagishi, Junichi
Reynolds, Douglas A.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2195 - 2210
[43] An assessment of automatic speaker verification vulnerabilities to replay spoofing attacks
Janicki, Artur
Alegre, Federico
Evans, Nicholas
SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (15) : 3030 - 3044
[44] Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Kanervisto, Anssi
Hautamaki, Ville
Kinnunen, Tomi
Yamagishi, Junichi
IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 477 - 488
[45] Anti-spoofing Methods for Automatic Speaker Verification System
Lavrentyeva, Galina
Novoselov, Sergey
Simonchik, Konstantin
ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2016, 2017, 661 : 172 - 184
[46] Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Kanervisto, Anssi
Hautamaki, Ville
Kinnunen, Tomi
Yamagishi, Junichi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 477 - 488
[47] Spoofing Speaker Verification With Voice Style Transfer And Reconstruction Loss
Thebaud, Thomas
Le Lan, Gael
Larcher, Anthony
2021 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2021, : 7 - 13
[48] SV-DeiT: Speaker Verification with DeiTCap Spoofing Detection
Ranjan, Rishabh
Vatsa, Mayank
Singh, Richa
2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
[49] Optimizing a-DCF for Spoofing-Robust Speaker Verification
Kurnaz, Oguzhan
Mishra, Jagabandhu
Kinnunen, Tomi H.
Hanilci, Cemal
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1081 - 1085
[50] Group-based speaker embeddings for text-independent speaker verification
Jung, Youngmoon
Eom, Youngsik
Lee, Yeonghyeon
Kim, Hoirin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 496 - 502

← 1 2 3 4 5 →