DeepDet: YAMNet with BottleNeck Attention Module (BAM) for TTS synthesis detection

被引:0
|
作者
Rabbia Mahum
Aun Irtaza
Ali Javed
Haitham A. Mahmoud
Haseeb Hassan
机构
[1] UET Taxila,Computer Science Department
[2] UET Taxila,Software Engineering Department
[3] King Saud University,Industrial Engineering Department, College of Engineering
[4] Shenzhen Technology University (SZTU),College of Big Data and Internet
来源
EURASIP Journal on Audio, Speech, and Music Processing | / 2024卷
关键词
Deep learning; Spoofing detector; Fake speech detection;
D O I
暂无
中图分类号
学科分类号
摘要
Spoofed speeches are becoming a big threat to society due to advancements in artificial intelligence techniques. Therefore, there must be an automated spoofing detector that can be integrated into automatic speaker verification (ASV) systems. In this study, we recommend a novel and robust model, named DeepDet, based on deep-layered architecture, to categorize speech into two classes: spoofed and bonafide. DeepDet is an improved model based on Yet Another Mobile Network (YAMNet) employing a customized MobileNet combined with a bottleneck attention module (BAM). First, we convert audio into mel-spectrograms that consist of time–frequency representations on mel-scale. Second, we trained our deep layered model using the extracted mel-spectrograms on a Logical Access (LA) set, including synthesized speeches and voice conversions of the ASVspoof-2019 dataset. In the end, we classified the audios, utilizing our trained binary classifier. More precisely, we utilized the power of layered architecture and guided attention that can discern the spoofed speech from bonafide samples. Our proposed improved model employs depth-wise linearly separate convolutions, which makes our model lighter weight than existing techniques. Furthermore, we implemented extensive experiments to assess the performance of the suggested model using the ASVspoof 2019 corpus. We attained an equal error rate (EER) of 0.042% on Logical Access (LA), whereas 0.43% on Physical Access (PA) attacks. Therefore, the performance of the proposed model is significant on the ASVspoof 2019 dataset and indicates the effectiveness of the DeepDet over existing spoofing detectors. Additionally, our proposed model is robust enough that can identify the unseen spoofed audios and classifies the several attacks accurately.
引用
收藏
相关论文
共 50 条
  • [21] Pulmonary Nodule Detection Based on Convolutional Block Attention Module
    Wen, Chenrui
    Hong, Minjie
    Yang, Xinhao
    Jia, Juncheng
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8583 - 8587
  • [22] A Helmet Detection Algorithm Based on Transformers with Deformable Attention Module
    Chen, Songle
    Sun, Hongbo
    Wu, Yuxin
    Shang, Lei
    Ruan, Xiukai
    CHINESE JOURNAL OF ELECTRONICS, 2025, 34 (01) : 229 - 241
  • [23] A Helmet Detection Algorithm Based on Transformers with Deformable Attention Module
    Songle Chen
    Hongbo Sun
    Yuxin Wu
    Lei Shang
    Xiukai Ruan
    Chinese Journal of Electronics, 2025, 34 (01) : 229 - 241
  • [24] Weakly supervised video anomaly detection with temporal attention module
    Song, Wonjoon
    Kim, Jonghyun
    Kim, Joongkyu
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 982 - 985
  • [25] Object Detection Network Based on Module Stack and Attention Mechanism
    Dou, Xinke
    Wang, Ting
    Shao, Shiliang
    Cao, Xianqing
    ELECTRONICS, 2023, 12 (17)
  • [26] Deep convolutional multi-informative metric correlation analysis with bottleneck attention module for face recognition in the wild
    Amrani, Moussa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (22) : 62459 - 62487
  • [27] Hot spot detection algorithm of photovoltaic module based on attention mechanism
    Fan, Tao
    Sun, Tao
    Liu, Hu
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (07): : 1304 - 1313
  • [28] Research on improved YOLOx weed detection based on lightweight attention module
    Zhu, Huibin
    Zhang, Yuanyuan
    Mu, Danlei
    Bai, Lizhen
    Wu, Xian
    Zhuang, Hao
    Li, Hui
    CROP PROTECTION, 2024, 177
  • [29] A General Multiscale Pyramid Attention Module for Ship Detection in SAR Images
    Wang, Peng
    Chen, Yongkang
    Yang, Yi
    Chen, Ping
    Zhang, Gong
    Zhu, Daiyin
    Jie, Yongshi
    Jiang, Cheng
    Leung, Henry
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 2815 - 2827
  • [30] PEDESTRIAN DETECTION BASED ON SPATIAL ATTENTION MODULE FOR OUTDOOR VIDEO SURVEILLANCE
    Wang, Xiaoyan
    Hu, Hai-Miao
    Zhang, Yugui
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), 2019, : 247 - 251