Training Augmentation with Adversarial Examples for Robust Speech Recognition

被引：18

作者：

Sun, Sining ^{[1
]}

Yeh, Ching-Feng ^{[2
]}

Ostendorf, Mari ^{[3
]}

Hwang, Mei-Yuh ^{[2
]}

Xie, Lei ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Shaanxi, Peoples R China

[2] Mobvoi AI Lab, Seattle, WA USA

[3] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA

来源：

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年

基金：

中国国家自然科学基金;

关键词：

robust speech recognition; adversarial examples; FGSM; data augmentation; teacher-student model; ADAPTATION;

D O I：

10.21437/Interspeech.2018-1247

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper explores the use of adversarial examples in training speech recognition systems to increase robustness of deep neural network acoustic models. During training, the fast gradient sign method is used to generate adversarial examples augmenting the original training data. Different from conventional data augmentation based on data transformations, the examples are dynamically generated based on current acoustic model parameters. We assess the impact of adversarial data augmentation in experiments on the Aurora-4 and CHiME-4 single-channel tasks, showing improved robustness against noise and channel variation. Further improvement is obtained when combining adversarial examples with teacher/student training, leading to a 23% relative word error rate reduction on Aurora-4.

引用

页码：2404 / 2408

页数：5

共 50 条

[41] Model Access Control Based on Hidden Adversarial Examples for Automatic Speech Recognition
Chen H.
Zhang J.
Chen K.
Zhang W.
Yu N.
IEEE Transactions on Artificial Intelligence, 2024, 5 (03): : 1302 - 1315
[42] Noise Flooding for Detecting Audio Adversarial Examples Against Automatic Speech Recognition
Rajaratnam, Krishan
Kalita, Jugal
2018 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2018, : 197 - 201
[43] Houdini: Fooling Deep Structured Visual and Speech Recognition Models with Adversarial Examples
Cisse, Moustapha
Adi, Yossi
Neverova, Natalia
Keshet, Joseph
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[44] Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation
Baali, Massa
Almakky, Ibrahim
Shehata, Shady
Karray, Fakhri
INTERSPEECH 2023, 2023, : 1558 - 1562
[45] Efficient Adversarial Training with Transferable Adversarial Examples
Zheng, Haizhong
Zhang, Ziqi
Gu, Juncheng
Lee, Honglak
Prakash, Atul
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1178 - 1187
[46] Robust License Plate Recognition With Shared Adversarial Training Network
Zhang, Sheng
Tang, Guozhi
Liu, Yuliang
Mao, Huiyun
IEEE ACCESS, 2020, 8 : 697 - 705
[47] ACCURATE AND ROBUST SCENE TEXT RECOGNITION VIA ADVERSARIAL TRAINING
Yang, Xiaomeng
Yang, Dongbao
Qiao, Zhi
Zhou, Yu
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4275 - 4279
[48] PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Huang, Qidong
Dong, Xiaoyi
Chen, Dongdong
Zhou, Hang
Zhang, Weiming
Zhang, Kui
Hua, Gang
Cheng, Yueqiang
Yu, Nenghai
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2183 - 2196
[49] Generating Watermarked Speech Adversarial Examples
Wang, Yumin
Ye, Jingyu
Wu, Hanzhou
PROCEEDINGS OF ACM TURING AWARD CELEBRATION CONFERENCE, ACM TURC 2021, 2021, : 254 - 260
[50] Towards Better Understanding of Training Certifiably Robust Models against Adversarial Examples
Lee, Sungyoon
Lee, Woojin
Park, Jinseong
Lee, Jaewook
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,

← 1 2 3 4 5 →