Improved Speech Authenticity Detection in Chinese-English Bilingual Contexts

被引:0
|
作者
Tsai, Cheng-Yuan [1 ]
Chang, Sheng-Chain [2 ,3 ]
Hung, Chao-Hsiang [2 ,3 ]
Wang, Syu-Siang [2 ,3 ]
Fang, Shih-Hau [2 ,3 ,4 ]
机构
[1] Minist Justice Invest Bur, Forens Sci Div, New Taipei City 231, Taiwan
[2] Yuan Ze Univ, Dept Elect Engn, Taoyuan City 320, Taiwan
[3] Yuan Ze Univ, AI Res Ctr, Taoyuan City 320, Taiwan
[4] Natl Taiwan Normal Univ, Dept Elect Engn, Taipei City 106, Taiwan
关键词
acoustic signal processing; machine learning; forgery detection; audio tampering; signal analysis; deep learning; AUDIO;
D O I
10.3390/s24216807
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The rapid evolution of voice technology has heightened the need for robust detection systems to distinguish between authentic and tampered speech. Recent competitions have significantly advanced the development of countermeasures against spoofing attacks. However, while advancements in detection technologies have been notable, existing methods often focus on a single type of tampering and language. Our contribution lies in developing an improved model that integrates an enhanced ResNet architecture with an LSTM to improve the detection of tampered audio, particularly in challenging multilingual scenarios. In the experiments, we built a hybrid dataset from self-recording Chinese speech and public VCTK2 English samples, enhanced the ResNet model generalization capabilities, and evaluated our approach using the bilingual dataset. Experiment results demonstrate that the proposed approach achieves a superior performance with an equal error rate of 11.62%, even in the face of bilingual conditions, and, more importantly, outperforms the leading models from ASVSpoof 2021 and ADD 2022 competitions. We also employed advanced tampering techniques, including CycleGAN voice conversion and auto splicing, to simulate real-world tampering scenarios and verify the effectiveness of the proposed approach.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Chinese-English bilingual speech recognition
    Yu, SM
    Hu, S
    Zhang, SW
    Xu, B
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 603 - 609
  • [2] Research on Chinese-English bilingual speech recognition
    Zhang, Qingqing
    Pan, Jielin
    Yan, Yonghong
    Shengxue Xuebao/Acta Acustica, 2010, 35 (02): : 270 - 275
  • [3] Chinese-English bilingual phone modeling for cross-language speech recognition
    Yu, SM
    Zhang, SW
    Xu, B
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 917 - 920
  • [4] Speech Perception, Metalinguistic Awareness, Reading, and Vocabulary in Chinese-English Bilingual Children
    Cheung, Him
    Wong, Simpson Wai Lap
    Penney, Trevor Bruce
    Chung, Kevin Kien Hoa
    McBride-Chang, Catherine
    Ho, Connie Suk-Han
    JOURNAL OF EDUCATIONAL PSYCHOLOGY, 2010, 102 (02) : 367 - 380
  • [5] Speech assessment of Chinese-English bilingual children: Accent versus developmental level
    Hack, Jamie
    Marinova-Todd, Stefka H.
    Bernhardt, B. May
    INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2012, 14 (06) : 509 - 519
  • [6] Syntactic interference in Chinese-English bilingual children
    Wang, Erin Yaoling
    PACLIC 16: Language, Information, and Computation, Proceedings, 2002, : 433 - 446
  • [7] Chinese-English Bilingual School:Kensington Wade
    贺艳花
    高中生之友, 2018, (07) : 44 - 46
  • [8] Aphasia in a Chinese-English Bilingual Speaker with Dementia
    Weekes, B.
    Chu, L.
    Dai, E.
    Ha, J.
    Song, Y.
    50TH ACADEMY OF APHASIA MEETING, 2012, 61 : 206 - 207
  • [9] Realization of Chinese-English Bilingual Speech Dialogue System using Machine Translation Technology
    Zhao, Hongdan
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 1751 - 1762
  • [10] Literacy and metalinguistic awareness in Chinese-English bilingual children
    Xue, Q
    Homer, B
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 100 - 100