Training Augmentation with Adversarial Examples for Robust Speech Recognition

被引:18
|
作者
Sun, Sining [1 ]
Yeh, Ching-Feng [2 ]
Ostendorf, Mari [3 ]
Hwang, Mei-Yuh [2 ]
Xie, Lei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Shaanxi, Peoples R China
[2] Mobvoi AI Lab, Seattle, WA USA
[3] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
基金
中国国家自然科学基金;
关键词
robust speech recognition; adversarial examples; FGSM; data augmentation; teacher-student model; ADAPTATION;
D O I
10.21437/Interspeech.2018-1247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the use of adversarial examples in training speech recognition systems to increase robustness of deep neural network acoustic models. During training, the fast gradient sign method is used to generate adversarial examples augmenting the original training data. Different from conventional data augmentation based on data transformations, the examples are dynamically generated based on current acoustic model parameters. We assess the impact of adversarial data augmentation in experiments on the Aurora-4 and CHiME-4 single-channel tasks, showing improved robustness against noise and channel variation. Further improvement is obtained when combining adversarial examples with teacher/student training, leading to a 23% relative word error rate reduction on Aurora-4.
引用
收藏
页码:2404 / 2408
页数:5
相关论文
共 50 条
  • [41] Model Access Control Based on Hidden Adversarial Examples for Automatic Speech Recognition
    Chen H.
    Zhang J.
    Chen K.
    Zhang W.
    Yu N.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (03): : 1302 - 1315
  • [42] Noise Flooding for Detecting Audio Adversarial Examples Against Automatic Speech Recognition
    Rajaratnam, Krishan
    Kalita, Jugal
    2018 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2018, : 197 - 201
  • [43] Houdini: Fooling Deep Structured Visual and Speech Recognition Models with Adversarial Examples
    Cisse, Moustapha
    Adi, Yossi
    Neverova, Natalia
    Keshet, Joseph
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [44] Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation
    Baali, Massa
    Almakky, Ibrahim
    Shehata, Shady
    Karray, Fakhri
    INTERSPEECH 2023, 2023, : 1558 - 1562
  • [45] Efficient Adversarial Training with Transferable Adversarial Examples
    Zheng, Haizhong
    Zhang, Ziqi
    Gu, Juncheng
    Lee, Honglak
    Prakash, Atul
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1178 - 1187
  • [46] Robust License Plate Recognition With Shared Adversarial Training Network
    Zhang, Sheng
    Tang, Guozhi
    Liu, Yuliang
    Mao, Huiyun
    IEEE ACCESS, 2020, 8 : 697 - 705
  • [47] ACCURATE AND ROBUST SCENE TEXT RECOGNITION VIA ADVERSARIAL TRAINING
    Yang, Xiaomeng
    Yang, Dongbao
    Qiao, Zhi
    Zhou, Yu
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4275 - 4279
  • [48] PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
    Huang, Qidong
    Dong, Xiaoyi
    Chen, Dongdong
    Zhou, Hang
    Zhang, Weiming
    Zhang, Kui
    Hua, Gang
    Cheng, Yueqiang
    Yu, Nenghai
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2183 - 2196
  • [49] Generating Watermarked Speech Adversarial Examples
    Wang, Yumin
    Ye, Jingyu
    Wu, Hanzhou
    PROCEEDINGS OF ACM TURING AWARD CELEBRATION CONFERENCE, ACM TURC 2021, 2021, : 254 - 260
  • [50] Towards Better Understanding of Training Certifiably Robust Models against Adversarial Examples
    Lee, Sungyoon
    Lee, Woojin
    Park, Jinseong
    Lee, Jaewook
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,