Towards single integrated spoofing-aware speaker verification embeddings

被引：1

作者：

Mun, Sung Hwan ^{[1
]}

Shim, Hye-jin ^{[2
]}

Tak, Hemlata ^{[3
]}

Wang, Xin ^{[4
]}

Liu, Xuechen ^{[2
,5
]}

Sahidullah, Md ^{[6
]}

Jeong, Myeonghun ^{[1
]}

Han, Min Hyun ^{[1
]}

Todisco, Massimiliano ^{[3
]}

Lee, Kong Aik ^{[7
]}

Yamagishi, Junichi ^{[4
]}

Evans, Nicholas ^{[3
]}

Kinnunen, Tomi ^{[2
]}

Kim, Nam Soo ^{[1
]}

Jung, Jee-weon ^{[8
]}

机构：

[1] Seoul Natl Univ, Seoul, South Korea

[2] Univ Eastern Finland, Kuopio, Finland

[3] EURECOM, Biot, France

[4] Natl Inst Informat, Tokyo, Japan

[5] INRIA, Le Chesnay Rocquencourt, France

[6] TCG CREST, Kolkata, India

[7] ASTAR, Inst Infocomm Res, Singapore, Singapore

[8] Carnegie Mellon Univ, Pittsburgh, PA USA

来源：

INTERSPEECH 2023 | 2023年

基金：

芬兰科学院;

关键词：

spoofing-aware speaker verification; speaker verification; anti-spoofing;

D O I：

10.21437/Interspeech.2023-1402

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This study aims to develop a single integrated spoofing-aware speaker verification (SASV) embeddings that satisfy two aspects. First, rejecting non-target speakers' input as well as target speakers' spoofed inputs should be addressed. Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge. We analyze that the inferior performance of single SASV embeddings comes from insufficient amount of training data and distinct nature of ASV and CM tasks. To this end, we propose a novel framework that includes multi-stage training and a combination of loss functions. Copy synthesis, combined with several vocoders, is also exploited to address the lack of spoofed data. Experimental results show dramatic improvements, achieving an SASV-EER of 1.06% on the evaluation protocol of the SASV2022 challenge.

引用

页码：3989 / 3993

页数：5

共 50 条

[21] CONTENT-AWARE SPEAKER EMBEDDINGS FOR SPEAKER DIARISATION
Sun, G.
Liu, D.
Zhang, C.
Woodland, P. C.
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7168 - 7172
[22] On the Vulnerability of Speaker Verification to Realistic Voice Spoofing
Ergunay, Serife Kucur
Khoury, Elie
Lazaridis, Alexandros
Marcel, Sebastien
2015 IEEE 7TH INTERNATIONAL CONFERENCE ON BIOMETRICS THEORY, APPLICATIONS AND SYSTEMS (BTAS 2015), 2015,
[23] Backend Ensemble for Speaker Verification and Spoofing Countermeasure
Zhang, Li
Li, Yue
Zhao, Huan
Wang, Qing
Xie, Lei
INTERSPEECH 2022, 2022, : 4381 - 4385
[24] Speaker-Aware Anti-spoofing
Liu, Xuechen
Sahidullah, Md
Lee, Kong Aik
Kinnunen, Tomi
INTERSPEECH 2023, 2023, : 2498 - 2502
[25] Towards Generating Adversarial Examples on Combined Systems of Automatic Speaker Verification and Spoofing Countermeasure
Zhang, Xingyu
Zhang, Xiongwei
Zou, Xia
Liu, Haibo
Sun, Meng
SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
[26] Deep Speaker Embeddings for Short-Duration Speaker Verification
Bhattacharya, Gautam
Alam, Jahangir
Kenny, Patrick
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1517 - 1521
[27] Deep speaker embeddings for Speaker Verification: Review and experimental comparison
Jakubec, Maros
Jarina, Roman
Lieskovska, Eva
Kasak, Peter
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
[28] Ensemble Models for Spoofing Detection in Automatic Speaker Verification
Chettri, Bhusan
Stoller, Daniel
Morfi, Veronica
Ramirez, Marco A. Martinez
Benetos, Emmanouil
Sturm, Bob L.
INTERSPEECH 2019, 2019, : 1018 - 1022
[29] Voice conversion and spoofing attack on speaker verification systems
Wu, Zhizheng
Li, Haizhou
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
[30] Introduction to the Issue on Spoofing and Countermeasures for Automatic Speaker Verification
Yamagishi, Junichi
Kinnunen, Tomi H.
Evans, Nicholas
De Leon, Phillip
Trancoso, Isabel
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 585 - 587

← 1 2 3 4 5 →