Training Augmentation with Adversarial Examples for Robust Speech Recognition

被引:18
|
作者
Sun, Sining [1 ]
Yeh, Ching-Feng [2 ]
Ostendorf, Mari [3 ]
Hwang, Mei-Yuh [2 ]
Xie, Lei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Shaanxi, Peoples R China
[2] Mobvoi AI Lab, Seattle, WA USA
[3] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
基金
中国国家自然科学基金;
关键词
robust speech recognition; adversarial examples; FGSM; data augmentation; teacher-student model; ADAPTATION;
D O I
10.21437/Interspeech.2018-1247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the use of adversarial examples in training speech recognition systems to increase robustness of deep neural network acoustic models. During training, the fast gradient sign method is used to generate adversarial examples augmenting the original training data. Different from conventional data augmentation based on data transformations, the examples are dynamically generated based on current acoustic model parameters. We assess the impact of adversarial data augmentation in experiments on the Aurora-4 and CHiME-4 single-channel tasks, showing improved robustness against noise and channel variation. Further improvement is obtained when combining adversarial examples with teacher/student training, leading to a 23% relative word error rate reduction on Aurora-4.
引用
收藏
页码:2404 / 2408
页数:5
相关论文
共 50 条
  • [21] Improving Speech Emotion Recognition With Adversarial Data Augmentation Network
    Yi, Lu
    Mak, Man-Wai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 172 - 184
  • [22] A robust adversarial attack against speech recognition with UAP
    Qin, Ziheng
    Zhang, Xianglong
    Li, Shujun
    HIGH-CONFIDENCE COMPUTING, 2023, 3 (01):
  • [23] ROBUST SPEECH RECOGNITION USING GENERATIVE ADVERSARIAL NETWORKS
    Sriram, Anuroop
    Jun, Heewoo
    Gaur, Yashesh
    Satheesh, Sanjeev
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5639 - 5643
  • [24] EXPLORING SPEECH ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS FOR ROBUST SPEECH RECOGNITION
    Donahue, Chris
    Li, Bo
    Prabhavalkar, Rohit
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5024 - 5028
  • [25] Efficient Training of Robust Decision Trees Against Adversarial Examples
    Vos, Daniel
    Verwer, Sicco
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7599 - 7608
  • [26] Boosting Adversarial Training Using Robust Selective Data Augmentation
    Rasheed, Bader
    Khattak, Asad Masood
    Khan, Adil
    Protasov, Stanislav
    Ahmad, Muhammad
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [27] Boosting Adversarial Training Using Robust Selective Data Augmentation
    Bader Rasheed
    Asad Masood Khattak
    Adil Khan
    Stanislav Protasov
    Muhammad Ahmad
    International Journal of Computational Intelligence Systems, 16
  • [28] Robust Recognition of Conversational Telephone Speech via Multi-condition Training and Data Augmentation
    Malek, Jiri
    Zdansky, Jindrich
    Cerva, Petr
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 324 - 333
  • [29] CONTEXTUAL SPEECH RECOGNITION WITH DIFFICULT NEGATIVE TRAINING EXAMPLES
    Alon, Uri
    Pundak, Golan
    Sainath, Tara N.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6440 - 6444
  • [30] Towards Visualizing and Detecting Audio Adversarial Examples for Automatic Speech Recognition
    Zong, Wei
    Chow, Yang-Wai
    Susilo, Willy
    INFORMATION SECURITY AND PRIVACY, ACISP 2021, 2021, 13083 : 531 - 549