Improved Speech Authenticity Detection in Chinese-English Bilingual Contexts

被引:0
|
作者
Tsai, Cheng-Yuan [1 ]
Chang, Sheng-Chain [2 ,3 ]
Hung, Chao-Hsiang [2 ,3 ]
Wang, Syu-Siang [2 ,3 ]
Fang, Shih-Hau [2 ,3 ,4 ]
机构
[1] Minist Justice Invest Bur, Forens Sci Div, New Taipei City 231, Taiwan
[2] Yuan Ze Univ, Dept Elect Engn, Taoyuan City 320, Taiwan
[3] Yuan Ze Univ, AI Res Ctr, Taoyuan City 320, Taiwan
[4] Natl Taiwan Normal Univ, Dept Elect Engn, Taipei City 106, Taiwan
关键词
acoustic signal processing; machine learning; forgery detection; audio tampering; signal analysis; deep learning; AUDIO;
D O I
10.3390/s24216807
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The rapid evolution of voice technology has heightened the need for robust detection systems to distinguish between authentic and tampered speech. Recent competitions have significantly advanced the development of countermeasures against spoofing attacks. However, while advancements in detection technologies have been notable, existing methods often focus on a single type of tampering and language. Our contribution lies in developing an improved model that integrates an enhanced ResNet architecture with an LSTM to improve the detection of tampered audio, particularly in challenging multilingual scenarios. In the experiments, we built a hybrid dataset from self-recording Chinese speech and public VCTK2 English samples, enhanced the ResNet model generalization capabilities, and evaluated our approach using the bilingual dataset. Experiment results demonstrate that the proposed approach achieves a superior performance with an equal error rate of 11.62%, even in the face of bilingual conditions, and, more importantly, outperforms the leading models from ASVSpoof 2021 and ADD 2022 competitions. We also employed advanced tampering techniques, including CycleGAN voice conversion and auto splicing, to simulate real-world tampering scenarios and verify the effectiveness of the proposed approach.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] BVTED: A Specialized Bilingual (Chinese-English) Dataset for Vulnerability Triple Extraction Tasks
    Liu, Kai
    Wang, Yi
    Ding, Zhaoyun
    Li, Aiping
    Zhang, Weiming
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [42] On Chinese-English Dictionaries
    吴景荣
    外国语(上海外国语学院学报), 1991, (04) : 66 - 76
  • [43] An eye-movement database of bilingual language control for Chinese-English bilinguals
    Wang, Tao
    Wang, Yue
    Hu, Haibo
    Wang, Xing
    Chen, Shengdong
    Yang, Yiming
    SCIENTIFIC DATA, 2025, 12 (01)
  • [44] EVIDENCE FOR THE ROLE OF FREQUENCY IN THE ACQUISITION OF LEXICALIZATION PATTERNS OF CHINESE-ENGLISH BILINGUAL CHILDREN
    Nicoladis, Elena
    Yin, Hui
    JOURNAL OF CHINESE LINGUISTICS, 2010, 38 (02) : 288 - 322
  • [45] Is the Chinese number-naming system transparent? Evidence from Chinese-English bilingual children
    Rasmussen, C
    Ho, E
    Nicoladis, E
    Leung, J
    Bisanz, J
    CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2006, 60 (01): : 60 - 67
  • [46] The Chinese-English Dictionary
    Huang, Jin-hong
    INTERNATIONAL JOURNAL OF LEXICOGRAPHY, 2024,
  • [47] Testing to prevent bad translation: Brand name conversions in Chinese-English contexts
    Kum, Doreen
    Lee, Yih Hwai
    Qiu, Cheng
    JOURNAL OF BUSINESS RESEARCH, 2011, 64 (06) : 594 - 600
  • [48] Chinese And English Bilingual Scene Text Detection
    Sha, Yuan
    Shi, Ping
    You, Jian
    Bao, Xiaojie
    Fu, Sizhe
    Zeng, Guoxiang
    2017 IEEE 3RD INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC), 2017, : 499 - 503
  • [49] Phonological and morphological literacy skills in English and Chinese: A cross-linguistic neuroimaging comparison of Chinese-English bilingual and monolingual English children
    Zhang, Kehui
    Sun, Xin
    Yu, Chi-Lin
    Eggleston, Rachel L.
    Marks, Rebecca A.
    Nickerson, Nia
    Caruso, Valeria C.
    Hu, Xiao-Su
    Tardif, Twila
    Chou, Tai-Li
    Booth, James R.
    Kovelman, Ioulia
    HUMAN BRAIN MAPPING, 2023, 44 (13) : 4812 - 4829
  • [50] The production of referring expressions in oral narratives of Chinese-English bilingual speakers and monolingual peers
    Chen, Liang
    Lei, Jianghua
    CHILD LANGUAGE TEACHING & THERAPY, 2013, 29 (01): : 41 - 55