Improving adversarial robustness of deep neural networks by using semantic information

被引：11

作者：

Wang, Lina ^{[1
]}

Chen, Xingshu ^{[1
,2
]}

Tang, Rui ^{[1
]}

Yue, Yawei ^{[1
]}

Zhu, Yi ^{[2
]}

Zeng, Xuemei ^{[2
]}

Wang, Wei ^{[2
]}

机构：

[1] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu 610065, Peoples R China

[2] Sichuan Univ, Cyber Sci Res Inst, Chengdu 610065, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2021年 / 226卷 / 226期

基金：

中国国家自然科学基金;

关键词：

Adversarial robustness; Semantic information; Region adversarial training; Targeted universal perturbations; CLASSIFICATION;

D O I：

10.1016/j.knosys.2021.107141

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The vulnerability of deep neural networks (DNNs) to adversarial attack, which is an attack that can mislead state-of-the-art classifiers into making an incorrect classification with high confidence by deliberately perturbing the original inputs, raises concerns about the robustness of DNNs to such attacks. Adversarial training, which is the main heuristic method for improving adversarial robustness and the first line of defense against adversarial attacks, requires many sample-by-sample calculations to increase training size and is usually insufficiently strong for an entire network. This paper provides a new perspective on the issue of adversarial robustness, one that shifts the focus from the network as a whole to the critical part of the region close to the decision boundary corresponding to a given class. From this perspective, we propose a method to generate a single but image-agnostic adversarial perturbation that carries the semantic information implying the directions to the fragile parts on the decision boundary and causes inputs to be misclassified as a specified target. We call the adversarial training based on such perturbations "region adversarial training" (RAT), which resembles classical adversarial training but is distinguished in that it reinforces the semantic information missing in the relevant regions. Experimental results on the MNIST and CIFAR-10 datasets show that this approach greatly improves adversarial robustness even when a very small dataset from the training data is used; moreover, it can defend against fast gradient sign method, universal perturbation, projected gradient descent, and Carlini and Wagner adversarial attacks, which have a completely different pattern from those encountered by the model during retraining. (C) 2021 Elsevier B.V. All rights reserved.

引用

页数：14

共 50 条

[31] Verifying Attention Robustness of Deep Neural Networks Against Semantic Perturbations
Munakata, Satoshi
Urban, Caterina
Yokoyama, Haruki
Yamamoto, Koji
Munakata, Kazuki
NASA FORMAL METHODS, NFM 2023, 2023, 13903 : 37 - 61
[32] Verifying Attention Robustness of Deep Neural Networks against Semantic Perturbations
Munakata, Satoshi
Urban, Caterina
Yokoyama, Haruki
Yamamoto, Koji
Munakata, Kazuki
2022 29TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC, 2022, : 560 - 561
[33] Improving adversarial robustness by learning shared information
Yu, Xi
Smedemark-Margulies, Niklas
Aeron, Shuchin
Koike-Akino, Toshiaki
Moulin, Pierre
Brand, Matthew
Parsons, Kieran
Wang, Ye
PATTERN RECOGNITION, 2023, 134
[34] Parseval Networks: Improving Robustness to Adversarial Examples
Cisse, Moustapha
Bojanowski, Piotr
Grave, Edouard
Dauphin, Yann
Usunier, Nicolas
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[35] Improving Error Related Potential Classification by using Generative Adversarial Networks and Deep Convolutional Neural Networks
Gao, Chenguang
Li, Zhao
Ora, Hiroki
Miyake, Yoshihiro
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2468 - 2476
[36] Adversarial Robustness Certification for Bayesian Neural Networks
Wicker, Matthew
Platzer, Andre
Laurenti, Luca
Kwiatkowska, Marta
FORMAL METHODS, PT I, FM 2024, 2025, 14933 : 3 - 28
[37] Detecting Adversarial Examples on Deep Neural Networks With Mutual Information Neural Estimation
Gao, Song
Wang, Ruxin
Wang, Xiaoxuan
Yu, Shui
Dong, Yunyun
Yao, Shaowen
Zhou, Wei
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (06) : 5168 - 5181
[38] On the Robustness of Bayesian Neural Networks to Adversarial Attacks
Bortolussi, Luca
Carbone, Ginevra
Laurenti, Luca
Patane, Andrea
Sanguinetti, Guido
Wicker, Matthew
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
[39] Robustness Against Adversarial Attacks in Neural Networks Using Incremental Dissipativity
Aquino, Bernardo
Rahnama, Arash
Seiler, Peter
Lin, Lizhen
Gupta, Vijay
IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2341 - 2346
[40] Benchmarking robustness of deep neural networks in semantic segmentation of fluorescence microscopy images
Zhong, Liqun
Li, Lingrui
Yang, Ge
BMC BIOINFORMATICS, 2024, 25 (01):

← 1 2 3 4 5 →