Robust Deep Learning Models against Semantic-Preserving Adversarial Attack

被引：1

作者：

Zhao, Yunce ^{[1
,2
]}

Gao, Dashan ^{[1
,3
]}

Yao, Yinghua ^{[1
,2
]}

Zhang, Zeqi ^{[4
]}

Mao, Bifei ^{[4
]}

Yao, Xin ^{[1
]}

机构：

[1] SUSTech, Dept CSE, Shenzhen, Peoples R China

[2] Univ Technol Sydney, Sydney, NSW, Australia

[3] HKUST, Hong Kong, Peoples R China

[4] Huawei Technol Co Ltd, Shenzhen, Peoples R China

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

基金：

中国国家自然科学基金;

关键词：

Adversarial Examples; Natural Perturbation; Adversarial Perturbation; Robustness;

D O I：

10.1109/IJCNN54540.2023.10191198

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning models can be fooled by small l(p)-norm adversarial perturbations and natural perturbations in terms of attributes. Although the robustness against each perturbation has been explored, it remains a challenge to address the robustness against joint perturbations effectively. In this paper, we study the robustness of deep learning models against joint perturbations by proposing a novel attack mechanism named Semantic-Preserving Adversarial (SPA) attack, which can then be used to enhance adversarial training. Specifically, we introduce an attribute manipulator to generate natural and human-comprehensible perturbations and a noise generator to generate diverse adversarial noises. Based on such combined noises, we optimize both the attribute value and the diversity variable to generate jointlyperturbed samples. For robust training, we adversarially train the deep learning model against the generated joint perturbations. Empirical results on four benchmarks show that the SPA attack causes a larger performance decline with small l1 norm-ball constraints compared to existing approaches. Furthermore, our SPA-enhanced training outperforms existing defense methods against such joint perturbations.

引用

页数：8

共 50 条

[1] Semantic-Preserving Adversarial Text Attacks
Yang, Xinghao
Gong, Yongshun
Liu, Weifeng
Bailey, James
Tao, Dacheng
Liu, Wei
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2023, 8 (04): : 583 - 595
[2] Robust Adversarial Objects against Deep Learning Models
Tsai, Tzungyu
Yang, Kaichen
Ho, Tsung-Yi
Jin, Yier
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 954 - 962
[3] Unsupervised Semantic-Preserving Adversarial Hashing for Image Search
Deng, Cheng
Yang, Erkun
Liu, Tongliang
Li, Jie
Liu, Wei
Tao, Dacheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) : 4032 - 4044
[4] A Theoretical Insight Into the Effect of Loss Function for Deep Semantic-Preserving Learning
Akbari, Ali
Awais, Muhammad
Bashar, Manijeh
Kittler, Josef
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 119 - 133
[5] BadNL: Backdoor Attacks against NLP Models with Semantic-preserving Improvements
Chen, Xiaoyi
Salem, Ahmed
Chen, Dingfan
Backes, Michael
Ma, Shiqing
Shen, Qingni
Wu, Zhonghai
Zhang, Yang
37TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2021, 2021, : 554 - 569
[6] Flexible Semantic-Preserving Flattening of Hierarchical Component Models
Leveque, Thomas
Carlson, Jan
Sentilles, Severine
Borde, Etienne
2011 37TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2011), 2011, : 31 - 38
[7] Robust Roadside Physical Adversarial Attack Against Deep Learning in Lidar Perception Modules
Yang, Kaichen
Tsai, Tzungyu
Yu, Honggang
Panoff, Max
Ho, Tsung-Yi
Jin, Yier
ASIA CCS'21: PROCEEDINGS OF THE 2021 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 349 - 362
[8] Deep learning models for electrocardiograms are susceptible to adversarial attack
Han, Xintian
Hu, Yuxuan
Foschini, Luca
Chinitz, Larry
Jankelson, Lior
Ranganath, Rajesh
NATURE MEDICINE, 2020, 26 (03) : 360 - +
[9] Deep learning models for electrocardiograms are susceptible to adversarial attack
Xintian Han
Yuxuan Hu
Luca Foschini
Larry Chinitz
Lior Jankelson
Rajesh Ranganath
Nature Medicine, 2020, 26 : 360 - 363
[10] SEMANTIC-PRESERVING METRIC LEARNING FOR VIDEO-TEXT RETRIEVAL
Choo, Sungkwon
Ha, Seong Jong
Lee, Joonsoo
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2388 - 2392

← 1 2 3 4 5 →