A Survey of Adversarial Defenses and Robustness in NLP

被引：36

作者：

Goyal, Shreya ^{[1
]}

Doddapaneni, Sumanth ^{[1
]}

Khapra, Mitesh M. ^{[1
]}

Ravindran, Balaraman ^{[1
]}

机构：

[1] Indian Inst Technol Madras, Bhupat & Jyoti Mehta Sch Biosci, Robert Bosch Ctr Data Sci & AI, Chennai 600036, Tamil Nadu, India

来源：

ACM COMPUTING SURVEYS | 2023年 / 55卷 / 14S期

关键词：

Adversarial attacks; adversarial defenses; perturbations; NLP; DEEP NEURAL-NETWORKS; COMPUTER VISION; ATTACKS;

D O I：

10.1145/3593042

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In the past few years, it has become increasingly evident that deep neural networks are not resilient enough to withstand adversarial perturbations in input data, leaving them vulnerable to attack. Various authors have proposed strong adversarial attacks for computer vision and Natural Language Processing (NLP) tasks. As a response, many defense mechanisms have also been proposed to prevent these networks from failing. The significance of defending neural networks against adversarial attacks lies in ensuring that the model's predictions remain unchanged even if the input data is perturbed. Several methods for adversarial defense in NLP have been proposed, catering to different NLP tasks such as text classification, named entity recognition, and natural language inference. Some of these methods not only defend neural networks against adversarial attacks but also act as a regularization mechanism during training, saving the model from overfitting. This survey aims to review the various methods proposed for adversarial defenses in NLP over the past few years by introducing a novel taxonomy. The survey also highlights the fragility of advanced deep neural networks in NLP and the challenges involved in defending them.

引用

页数：39

共 50 条

[31] Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses
Sriramanan, Gaurang
Addepalli, Sravanti
Baburaj, Arya
Babu, R. Venkatesh
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[32] Improving Adversarial Robustness With Adversarial Augmentations
Chen, Chuanxi
Ye, Dengpan
He, Yiheng
Tang, Long
Xu, Yue
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (03) : 5105 - 5117
[33] Survey of Adversarial Attack, Defense and Robustness Analysis for Natural Language Processing
Zheng H.
Chen J.
Zhang Y.
Zhang X.
Ge C.
Liu Z.
Ouyang Y.
Ji S.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (08): : 1727 - 1750
[34] Decoupling Adversarial Training for Fair NLP
Han, Xudong
Baldwin, Timothy
Cohn, Trevor
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 471 - 477
[35] Adversarial examples: A survey of attacks and defenses in deep learning-enabled cybersecurity systems
Macas, Mayra
Wu, Chunming
Fuertes, Walter
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
[36] Adversarial Robustness for Code
Bielik, Pavol
Vechev, Martin
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[37] Adversarial Robustness Curves
Goepfert, Christina
Goepfert, Jan Philip
Hammer, Barbara
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 1167 : 172 - 179
[38] The Adversarial Robustness of Sampling
Ben-Eliezer, Omri
Yogev, Eylon
PODS'20: PROCEEDINGS OF THE 39TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2020, : 49 - 62
[39] Adversarial attacks and adversarial robustness in computational pathology
Narmin Ghaffari Laleh
Daniel Truhn
Gregory Patrick Veldhuizen
Tianyu Han
Marko van Treeck
Roman D. Buelow
Rupert Langer
Bastian Dislich
Peter Boor
Volkmar Schulz
Jakob Nikolas Kather
Nature Communications, 13
[40] Adversarial attacks and adversarial robustness in computational pathology
Ghaffari Laleh, Narmin
Truhn, Daniel
Veldhuizen, Gregory Patrick
Han, Tianyu
van Treeck, Marko
Buelow, Roman D.
Langer, Rupert
Dislich, Bastian
Boor, Peter
Schulz, Volkmar
Kather, Jakob Nikolas
NATURE COMMUNICATIONS, 2022, 13 (01)

← 1 2 3 4 5 →