Training Augmentation with Adversarial Examples for Robust Speech Recognition

被引：18

作者：

Sun, Sining ^{[1
]}

Yeh, Ching-Feng ^{[2
]}

Ostendorf, Mari ^{[3
]}

Hwang, Mei-Yuh ^{[2
]}

Xie, Lei ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Shaanxi, Peoples R China

[2] Mobvoi AI Lab, Seattle, WA USA

[3] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA

来源：

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年

基金：

中国国家自然科学基金;

关键词：

robust speech recognition; adversarial examples; FGSM; data augmentation; teacher-student model; ADAPTATION;

D O I：

10.21437/Interspeech.2018-1247

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper explores the use of adversarial examples in training speech recognition systems to increase robustness of deep neural network acoustic models. During training, the fast gradient sign method is used to generate adversarial examples augmenting the original training data. Different from conventional data augmentation based on data transformations, the examples are dynamically generated based on current acoustic model parameters. We assess the impact of adversarial data augmentation in experiments on the Aurora-4 and CHiME-4 single-channel tasks, showing improved robustness against noise and channel variation. Further improvement is obtained when combining adversarial examples with teacher/student training, leading to a 23% relative word error rate reduction on Aurora-4.

引用

页码：2404 / 2408

页数：5

共 50 条

[21] Improving Speech Emotion Recognition With Adversarial Data Augmentation Network
Yi, Lu
Mak, Man-Wai
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 172 - 184
[22] A robust adversarial attack against speech recognition with UAP
Qin, Ziheng
Zhang, Xianglong
Li, Shujun
HIGH-CONFIDENCE COMPUTING, 2023, 3 (01):
[23] ROBUST SPEECH RECOGNITION USING GENERATIVE ADVERSARIAL NETWORKS
Sriram, Anuroop
Jun, Heewoo
Gaur, Yashesh
Satheesh, Sanjeev
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5639 - 5643
[24] EXPLORING SPEECH ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS FOR ROBUST SPEECH RECOGNITION
Donahue, Chris
Li, Bo
Prabhavalkar, Rohit
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5024 - 5028
[25] Efficient Training of Robust Decision Trees Against Adversarial Examples
Vos, Daniel
Verwer, Sicco
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7599 - 7608
[26] Boosting Adversarial Training Using Robust Selective Data Augmentation
Rasheed, Bader
Khattak, Asad Masood
Khan, Adil
Protasov, Stanislav
Ahmad, Muhammad
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
[27] Boosting Adversarial Training Using Robust Selective Data Augmentation
Bader Rasheed
Asad Masood Khattak
Adil Khan
Stanislav Protasov
Muhammad Ahmad
International Journal of Computational Intelligence Systems, 16
[28] Robust Recognition of Conversational Telephone Speech via Multi-condition Training and Data Augmentation
Malek, Jiri
Zdansky, Jindrich
Cerva, Petr
TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 324 - 333
[29] CONTEXTUAL SPEECH RECOGNITION WITH DIFFICULT NEGATIVE TRAINING EXAMPLES
Alon, Uri
Pundak, Golan
Sainath, Tara N.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6440 - 6444
[30] Towards Visualizing and Detecting Audio Adversarial Examples for Automatic Speech Recognition
Zong, Wei
Chow, Yang-Wai
Susilo, Willy
INFORMATION SECURITY AND PRIVACY, ACISP 2021, 2021, 13083 : 531 - 549

← 1 2 3 4 5 →