SV-DeiT: Speaker Verification with DeiTCap Spoofing Detection

被引:0
|
作者
Ranjan, Rishabh [1 ]
Vatsa, Mayank [1 ]
Singh, Richa [1 ]
机构
[1] Indian Inst Technol, Jodhpur, Rajasthan, India
关键词
D O I
10.1109/IJCB57857.2023.10449121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As advancements in automatic speech generation continue to progress, the ability to distinguish between real and fake samples has diminished. In addition, current spoofing detection algorithms struggle to perform well on new and unseen test distributions. To address these challenges, this paper presents two contributions. First, inspired by the success of transformer and capsule networks in high representation capabilities, we propose the DeiTCap spoof detection network on spectrogram audio features. This framework utilizes multi-head attention, sub-entities (capsules) in the audio domain and a modified routing algorithm to identify capsule agreement. The proposed spoof detection algorithm is integrated into the spoofing aware speaker recognition framework SV-DeiT. Second, we introduce a novel text-to-speech dataset TRADIF created with cutting-edge transformers and diffusion models to evaluate the generalizability of countermeasure systems. Our proposed DeiTCap achieves an EER of 1.08% on the evaluation set of the ASVSpoof2019 LA dataset. Moreover, the proposed network demonstrates strength in cross-domain training-testing with two different datasets, highlighting its robustness and versatility.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] DEVELOPMENT OF VOICE SPOOFING DETECTION SYSTEMS FOR 2019 EDITION OF AUTOMATIC SPEAKER VERIFICATION AND COUNTERMEASURES CHALLENGE
    Monteiro, Joao
    Alam, Jahangir
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 1003 - 1010
  • [22] Spoofing Detection for Speaker Verification with Glottal Flow and 1D Pure Convolutional Networks
    Camarena-Ibarrola, Antonio
    Figueroa, Karina
    Plancarte Curiel, Axel
    PATTERN RECOGNITION, MCPR 2023, 2023, 13902 : 149 - 158
  • [23] SAS : A SPEAKER VERIFICATION SPOOFING DATABASE CONTAINING DIVERSE ATTACKS
    Wu, Zhizheng
    Khodabakhsh, Ali
    Demiroglu, Cenk
    Yamagishi, Junichi
    Saito, Daisuke
    Toda, Tomoki
    King, Simon
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4440 - 4444
  • [24] DoubleDeceiver: Deceiving the Speaker Verification System Protected by Spoofing Countermeasures
    Zhang, Mengao
    Xu, Ke
    Li, Hao
    Wang, Lei
    Fang, Chengfang
    Shi, Jie
    INTERSPEECH 2023, 2023, : 4014 - 4018
  • [25] VSASV: a Vietnamese Dataset for Spoofing-Aware Speaker Verification
    Vu Hoang
    Viet Thanh Pham
    Hoa Nguyen Xuan
    Nhi Pham
    Phuong Dat
    Thi Thu Trang Nguyen
    INTERSPEECH 2024, 2024, : 4288 - 4292
  • [26] An assessment of automatic speaker verification vulnerabilities to replay spoofing attacks
    Janicki, Artur
    Alegre, Federico
    Evans, Nicholas
    SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (15) : 3030 - 3044
  • [27] Feature selection based on CQCCs for automatic speaker verification spoofing
    Wang, Xianliang
    Xiao, Yanhong
    Zhu, Xuan
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 32 - 36
  • [28] Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals
    Kinnunen, Tomi
    Delgado, Hector
    Evans, Nicholas
    Lee, Kong Aik
    Vestman, Ville
    Nautsch, Andreas
    Todisco, Massimiliano
    Wang, Xin
    Sahidullah, Md
    Yamagishi, Junichi
    Reynolds, Douglas A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2195 - 2210
  • [29] Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
    Kanervisto, Anssi
    Hautamaki, Ville
    Kinnunen, Tomi
    Yamagishi, Junichi
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 477 - 488
  • [30] Anti-spoofing Methods for Automatic Speaker Verification System
    Lavrentyeva, Galina
    Novoselov, Sergey
    Simonchik, Konstantin
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2016, 2017, 661 : 172 - 184