Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof Detection

被引:0
|
作者
Amir Mohammad Rostami
Mohammad Mehdi Homayounpour
Ahmad Nickabadi
机构
[1] Amirkabir University of Technology,Department of Computer Engineering
关键词
Automatic speaker verification; Spoof detection; ASVspoof; Efficient attention branch network; Combined loss function; EfficientNet-A0;
D O I
暂无
中图分类号
学科分类号
摘要
Many endeavors have sought to develop countermeasure techniques as enhancements on Automatic Speaker Verification (ASV) systems, in order to make them more robust against spoof attacks. As evidenced by the latest ASVspoof 2019 countermeasure challenge, models currently deployed for the task of ASV are, at their best, devoid of suitable degrees of generalization to unseen attacks. A joint improvement of components of ASV spoof detection systems including the classifier, feature extraction phase, and model loss function may lead to a better detection of attacks by these systems. Accordingly, the present study proposes the Efficient Attention Branch Network (EABN) architecture with a combined loss function to address the model generalization to unseen attacks. The EABN is based on attention and perception branches. The attention branch provides an attention mask that improves the classification performance and at the same time is interpretable from a human point of view. The perception branch, is used for our main purpose which is spoof detection. The new EfficientNet-A0 architecture was optimized and employed for the perception branch, with nearly ten times fewer parameters and approximately seven times fewer floating-point operations than the SE-Res2Net50 as the best existing network. The proposed method on ASVspoof 2019 dataset achieved EER = 0.86% and t-DCF = 0.0239 in the Physical Access (PA) scenario using the logPowSpec as the input feature extraction method. Furthermore, using the LFCC feature, and the SE-Res2Net50 for the perception branch, the proposed model achieved EER = 1.89% and t-DCF = 0.507 in the Logical Access (LA) scenario, which to the best of our knowledge, is the best single system ASV spoofing countermeasure method.
引用
收藏
页码:4252 / 4270
页数:18
相关论文
共 50 条
  • [21] Generative and Discriminative Modelling of Linear Energy Sub-bands for Spoof Detection in Speaker Verification Systems
    Suvidha Rupesh Kumar
    B. Bharathi
    Circuits, Systems, and Signal Processing, 2022, 41 : 3811 - 3831
  • [22] Influence of Packet Loss on a Speaker Verification System over IP Network
    Polacky, Jozef
    Pocta, Peter
    Jarina, Roman
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA 2016), 2016, : 390 - 394
  • [23] Verikube: Automatic and Efficient Verification for Container Network Policies
    Kang, Haney
    Shin, Seungwon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (12) : 2131 - 2134
  • [24] SIMPLE ATTENTION MODULE BASED SPEAKER VERIFICATION WITH ITERATIVE NOISY LABEL DETECTION
    Qin, Xiaoyi
    Li, Na
    Weng, Chao
    Su, Dan
    Li, Ming
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6722 - 6726
  • [25] MHASAN: Multi-Head Angular Self Attention Network for Spoof Detection
    Hasan, Md
    Roy, Koushik
    Rupty, Labiba
    Hossain, Md. Sourave
    Sengupta, Shirshajit
    Taus, Shehzad Noor
    Mohammed, Nabeel
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 154 - 160
  • [26] Replay spoof detection for speaker verification system using magnitude-phase-instantaneous frequency and energy features
    Bharath, K. P.
    Kumar, M. Rajesh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (27) : 39343 - 39366
  • [27] Semi-supervised speech activity detection with an application to automatic speaker verification
    Sholokhov, Alexey
    Sahidullah, Md
    Kinnunen, Tomi
    COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 132 - 156
  • [28] LOGICAL ACCESS ATTACKS DETECTION THROUGH AUDIO FINGERPRINTING IN AUTOMATIC SPEAKER VERIFICATION
    Espin, Juan M.
    Font, R.
    Marin-Blazquez, Javier G.
    Esquembre, F.
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [29] EMBANet: A flexible efficient multi-branch attention network
    Zu, Keke
    Zhang, Hu
    Zhang, Lei
    Lu, Jian
    Xu, Chen
    Chen, Hongyang
    Zheng, Yu
    NEURAL NETWORKS, 2025, 185
  • [30] Towards Generating Adversarial Examples on Combined Systems of Automatic Speaker Verification and Spoofing Countermeasure
    Zhang, Xingyu
    Zhang, Xiongwei
    Zou, Xia
    Liu, Haibo
    Sun, Meng
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022