Towards single integrated spoofing-aware speaker verification embeddings

被引：1

作者：

Mun, Sung Hwan ^{[1
]}

Shim, Hye-jin ^{[2
]}

Tak, Hemlata ^{[3
]}

Wang, Xin ^{[4
]}

Liu, Xuechen ^{[2
,5
]}

Sahidullah, Md ^{[6
]}

Jeong, Myeonghun ^{[1
]}

Han, Min Hyun ^{[1
]}

Todisco, Massimiliano ^{[3
]}

Lee, Kong Aik ^{[7
]}

Yamagishi, Junichi ^{[4
]}

Evans, Nicholas ^{[3
]}

Kinnunen, Tomi ^{[2
]}

Kim, Nam Soo ^{[1
]}

Jung, Jee-weon ^{[8
]}

机构：

[1] Seoul Natl Univ, Seoul, South Korea

[2] Univ Eastern Finland, Kuopio, Finland

[3] EURECOM, Biot, France

[4] Natl Inst Informat, Tokyo, Japan

[5] INRIA, Le Chesnay Rocquencourt, France

[6] TCG CREST, Kolkata, India

[7] ASTAR, Inst Infocomm Res, Singapore, Singapore

[8] Carnegie Mellon Univ, Pittsburgh, PA USA

来源：

INTERSPEECH 2023 | 2023年

基金：

芬兰科学院;

关键词：

spoofing-aware speaker verification; speaker verification; anti-spoofing;

D O I：

10.21437/Interspeech.2023-1402

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This study aims to develop a single integrated spoofing-aware speaker verification (SASV) embeddings that satisfy two aspects. First, rejecting non-target speakers' input as well as target speakers' spoofed inputs should be addressed. Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge. We analyze that the inferior performance of single SASV embeddings comes from insufficient amount of training data and distinct nature of ASV and CM tasks. To this end, we propose a novel framework that includes multi-stage training and a combination of loss functions. Copy synthesis, combined with several vocoders, is also exploited to address the lack of spoofed data. Experimental results show dramatic improvements, achieving an SASV-EER of 1.06% on the evaluation protocol of the SASV2022 challenge.

引用

页码：3989 / 3993

页数：5

共 50 条

[31] ADVERSARIAL ATTACKS ON SPOOFING COUNTERMEASURES OF AUTOMATIC SPEAKER VERIFICATION
Liu, Songxiang
Wu, Haibin
Lee, Hung-yi
Meng, Helen
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 312 - 319
[32] SPEAKER VERIFICATION USING SECURE BINARY EMBEDDINGS
Portelo, Jose
Raj, Bhiksha
Boufounos, Petros
Trancoso, Isabel
Abad, Alberto
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[33] Preventing converted speech spoofing attacks in speaker verification
Correia, M. J.
Abad, A.
Trancoso, I.
2014 37TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2014, : 1320 - 1325
[34] ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge
Wu, Zhizheng
Yamagishi, Junichi
Kinnunen, Tomi
Hanilci, Cemal
Sahidullah, Mohammed
Sizov, Aleksandr
Evans, Nicholas
Todisco, Massimiliano
Delgado, Hector
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 588 - 604
[35] Spoofing Speaker Verification System by Adversarial Examples Leveraging the Generalized Speaker Difference
Luo, Hongwei
Shen, Yijie
Lin, Feng
Xu, Guoai
SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
[36] TEXT ADAPTATION FOR SPEAKER VERIFICATION WITH SPEAKER-TEXT FACTORIZED EMBEDDINGS
Yang, Yexin
Wang, Shuai
Gong, Xun
Qian, Yanmin
Yu, Kai
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6454 - 6458
[37] Deeply Fused Speaker Embeddings for Text-Independent Speaker Verification
Bhattacharya, Gautam
Alam, Jahangir
Gupta, Vishwa
Kenny, Patrick
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3588 - 3592
[38] SAS : A SPEAKER VERIFICATION SPOOFING DATABASE CONTAINING DIVERSE ATTACKS
Wu, Zhizheng
Khodabakhsh, Ali
Demiroglu, Cenk
Yamagishi, Junichi
Saito, Daisuke
Toda, Tomoki
King, Simon
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4440 - 4444
[39] Deep Discriminative Embeddings for Duration Robust Speaker Verification
Li, Na
Tuo, Deyi
Su, Dan
Li, Zhifeng
Yu, Dong
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2262 - 2266
[40] DoubleDeceiver: Deceiving the Speaker Verification System Protected by Spoofing Countermeasures
Zhang, Mengao
Xu, Ke
Li, Hao
Wang, Lei
Fang, Chengfang
Shi, Jie
INTERSPEECH 2023, 2023, : 4014 - 4018

← 1 2 3 4 5 →