SV-DeiT: Speaker Verification with DeiTCap Spoofing Detection

被引:0
|
作者
Ranjan, Rishabh [1 ]
Vatsa, Mayank [1 ]
Singh, Richa [1 ]
机构
[1] Indian Inst Technol, Jodhpur, Rajasthan, India
关键词
D O I
10.1109/IJCB57857.2023.10449121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As advancements in automatic speech generation continue to progress, the ability to distinguish between real and fake samples has diminished. In addition, current spoofing detection algorithms struggle to perform well on new and unseen test distributions. To address these challenges, this paper presents two contributions. First, inspired by the success of transformer and capsule networks in high representation capabilities, we propose the DeiTCap spoof detection network on spectrogram audio features. This framework utilizes multi-head attention, sub-entities (capsules) in the audio domain and a modified routing algorithm to identify capsule agreement. The proposed spoof detection algorithm is integrated into the spoofing aware speaker recognition framework SV-DeiT. Second, we introduce a novel text-to-speech dataset TRADIF created with cutting-edge transformers and diffusion models to evaluate the generalizability of countermeasure systems. Our proposed DeiTCap achieves an EER of 1.08% on the evaluation set of the ASVSpoof2019 LA dataset. Moreover, the proposed network demonstrates strength in cross-domain training-testing with two different datasets, highlighting its robustness and versatility.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Integrated Spoofing Countermeasures and Automatic Speaker Verification: an Evaluation on ASVspoof 2015
    Sahidullah, Md
    Delgado, Hector
    Todisco, Massimiliano
    Yu, Hong
    Kinnunen, Tomi
    Evans, Nicholas
    Tana, Zheng-Hua
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1700 - 1704
  • [42] Spoofing-Aware Speaker Verification by Multi-Level Fusion
    Wu, Haibin
    Meng, Lingwei
    Kang, Jiawen
    Li, Jinchao
    Li, Xu
    Wu, Xixin
    Lee, Hung-yi
    Meng, Helen
    INTERSPEECH 2022, 2022, : 4357 - 4361
  • [43] SASV 2022: The First Spoofing-Aware Speaker Verification Challenge
    Jung, Jee-weon
    Tak, Hemlata
    Shim, Hye-jin
    Heo, Hee-Soo
    Lee, Bong-Jin
    Chung, Soo-Whan
    Yu, Ha-Jin
    Evans, Nicholas
    Kinnunen, Tomi
    INTERSPEECH 2022, 2022, : 2893 - 2897
  • [44] Constant Q cepstral coefficients: A spoofing countermeasure for automatic speaker verification
    Todisco, Massimiliano
    Delgado, Hector
    Evans, Nicholas
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 516 - 535
  • [45] ASVspoof 2015: the First Automatic Speaker Verification Spoofing and Countermeasures Challenge
    Wu, Zhizheng
    Kinnunen, Tomi
    Evans, Nicholas
    Yamagishi, Junichi
    Hanilci, Cemal
    Sahidullah, Md
    Sizov, Aleksandr
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2037 - 2041
  • [46] The CLIPS System for 2022 Spoofing-Aware Speaker Verification Challenge
    Lin, Jucai
    Chen, Tingwei
    Huang, Jingbiao
    Fang, Ruidong
    Yin, Jun
    Yin, Yuanping
    Shi, Wei
    Huang, Weizhen
    Mao, Yapeng
    INTERSPEECH 2022, 2022, : 4367 - 4370
  • [47] Compressed High Dimensional Features for Speaker Spoofing Detection
    Zhao, Yuanjun
    Togneri, Roberto
    Sreeram, Victor
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 569 - 572
  • [48] Replay spoofing detection system for automatic speaker verification using multi-task learning of noise classes
    Shim, Hye-Jin
    Jung, Jee-Weon
    Heo, Hee-Soo
    Yoon, Sung-Hyun
    Yu, Ha-Jin
    2018 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2018, : 172 - 176
  • [49] The GMM and I-Vector Systems Based on Spoofing Algorithms for Speaker Spoofing Detection
    Tang, Hui
    Lei, Zhenchun
    Huang, Zhongying
    Gan, Hailin
    Yu, Kun
    Yang, Yingen
    BIOMETRIC RECOGNITION (CCBR 2019), 2019, 11818 : 502 - 510
  • [50] Development of CRIM System for the Automatic Speaker Verification Spoofing and Countermeasures Challenge 2015
    Alam, Md Jahangir
    Kenny, Patrick
    Bhattacharya, Gautam
    Stafylakis, Themos
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2072 - 2076