Towards Visualizing and Detecting Audio Adversarial Examples for Automatic Speech Recognition

被引:2
|
作者
Zong, Wei [1 ]
Chow, Yang-Wai [1 ]
Susilo, Willy [1 ]
机构
[1] Univ Wollongong, Sch Comp & Informat Technol, Inst Cybersecur & Cryptol, Wollongong, NSW, Australia
关键词
Adversarial machine learning; Adversarial example; Anomaly detection; Visualization;
D O I
10.1007/978-3-030-90567-5_27
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic speech recognition (ASR) systems are now ubiquitous in many commonly used applications, as various commercial products rely on ASR techniques, which are increasingly based on machine learning, to transcribe voice commands into text for further processing. However, audio adversarial examples (AEs) have emerged as a serious security threat, as they have been shown to be able to fool ASR models into producing incorrect results. Although there are proposed methods to defend against audio AEs, the intrinsic properties of audio AEs compared with benign audio have not been well studied. In this paper, we show that the machine learning decision boundary patterns around audio AEs and benign audio are fundamentally different. In addition, using dimensionality reduction techniques, we show that these different patterns can be distinguished visually in 2D space. Based on dimensionality reduction results, this paper also demonstrates that it is feasible to detect previously unknown audio AEs using anomaly detection methods.
引用
收藏
页码:531 / 549
页数:19
相关论文
共 50 条
  • [41] DOMPTEUR: Taming Audio Adversarial Examples
    Eisenhofer, Thorsten
    Schoenherr, Lea
    Frank, Joel
    Speckemeier, Lars
    Kolossa, Dorothea
    Holz, Thorsten
    PROCEEDINGS OF THE 30TH USENIX SECURITY SYMPOSIUM, 2021, : 2309 - 2326
  • [42] Creating Simple Adversarial Examples for Speech Recognition Deep Neural Networks
    Redden, Nathaniel
    Bernard, Ben
    Straub, Jeremy
    2019 IEEE 16TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS WORKSHOPS (MASSW 2019), 2019, : 58 - 62
  • [43] Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques
    Qin, Ying
    Lee, Tan
    Kong, Anthony Pak Hin
    Law, Sam Po
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [44] An audio-visual corpus for speech perception and automatic speech recognition (L)
    Cooke, Martin
    Barker, Jon
    Cunningham, Stuart
    Shao, Xu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05): : 2421 - 2424
  • [45] Indonesian Audio-Visual Speech Corpus for Multimodal Automatic Speech Recognition
    Maulana, Muhammad Rizki Aulia Rahman
    Fanany, Mohamad Ivan
    2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 381 - 385
  • [46] Hear No Evil: Towards Adversarial Robustness of Automatic Speech Recognition via Multi-Task Learning
    Das, Nilaksh
    Chau, Duen Horng
    INTERSPEECH 2022, 2022, : 3839 - 3843
  • [47] Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition
    Zhang, Yike
    Zhang, Pengyuan
    Yan, Yonghong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3348 - 3352
  • [48] Towards multilingual interoperability in automatic speech recognition
    Adda-Decker, M
    SPEECH COMMUNICATION, 2001, 35 (1-2) : 5 - 20
  • [49] Towards the Improvement of Automatic Recognition of Dysarthric Speech
    Tolba, Hesham
    EL Torgoman, Ahmed S.
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 1, 2009, : 277 - +
  • [50] Effective Adversarial Sample Detection for Securing Automatic Speech Recognition
    Lin, Chih-Yang
    Wang, Yan-Zhang
    Lin, Shou-Kuan
    Farady, Isack
    Jan, Yih-Kuen
    Lin, Wei-Yang
    2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, AVSS 2024, 2024,