Robust Deep Learning Models against Semantic-Preserving Adversarial Attack

被引:1
|
作者
Zhao, Yunce [1 ,2 ]
Gao, Dashan [1 ,3 ]
Yao, Yinghua [1 ,2 ]
Zhang, Zeqi [4 ]
Mao, Bifei [4 ]
Yao, Xin [1 ]
机构
[1] SUSTech, Dept CSE, Shenzhen, Peoples R China
[2] Univ Technol Sydney, Sydney, NSW, Australia
[3] HKUST, Hong Kong, Peoples R China
[4] Huawei Technol Co Ltd, Shenzhen, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
基金
中国国家自然科学基金;
关键词
Adversarial Examples; Natural Perturbation; Adversarial Perturbation; Robustness;
D O I
10.1109/IJCNN54540.2023.10191198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models can be fooled by small l(p)-norm adversarial perturbations and natural perturbations in terms of attributes. Although the robustness against each perturbation has been explored, it remains a challenge to address the robustness against joint perturbations effectively. In this paper, we study the robustness of deep learning models against joint perturbations by proposing a novel attack mechanism named Semantic-Preserving Adversarial (SPA) attack, which can then be used to enhance adversarial training. Specifically, we introduce an attribute manipulator to generate natural and human-comprehensible perturbations and a noise generator to generate diverse adversarial noises. Based on such combined noises, we optimize both the attribute value and the diversity variable to generate jointlyperturbed samples. For robust training, we adversarially train the deep learning model against the generated joint perturbations. Empirical results on four benchmarks show that the SPA attack causes a larger performance decline with small l1 norm-ball constraints compared to existing approaches. Furthermore, our SPA-enhanced training outperforms existing defense methods against such joint perturbations.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Semantic-Preserving Adversarial Text Attacks
    Yang, Xinghao
    Gong, Yongshun
    Liu, Weifeng
    Bailey, James
    Tao, Dacheng
    Liu, Wei
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2023, 8 (04): : 583 - 595
  • [2] Robust Adversarial Objects against Deep Learning Models
    Tsai, Tzungyu
    Yang, Kaichen
    Ho, Tsung-Yi
    Jin, Yier
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 954 - 962
  • [3] Unsupervised Semantic-Preserving Adversarial Hashing for Image Search
    Deng, Cheng
    Yang, Erkun
    Liu, Tongliang
    Li, Jie
    Liu, Wei
    Tao, Dacheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) : 4032 - 4044
  • [4] A Theoretical Insight Into the Effect of Loss Function for Deep Semantic-Preserving Learning
    Akbari, Ali
    Awais, Muhammad
    Bashar, Manijeh
    Kittler, Josef
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 119 - 133
  • [5] BadNL: Backdoor Attacks against NLP Models with Semantic-preserving Improvements
    Chen, Xiaoyi
    Salem, Ahmed
    Chen, Dingfan
    Backes, Michael
    Ma, Shiqing
    Shen, Qingni
    Wu, Zhonghai
    Zhang, Yang
    37TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2021, 2021, : 554 - 569
  • [6] Flexible Semantic-Preserving Flattening of Hierarchical Component Models
    Leveque, Thomas
    Carlson, Jan
    Sentilles, Severine
    Borde, Etienne
    2011 37TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2011), 2011, : 31 - 38
  • [7] Robust Roadside Physical Adversarial Attack Against Deep Learning in Lidar Perception Modules
    Yang, Kaichen
    Tsai, Tzungyu
    Yu, Honggang
    Panoff, Max
    Ho, Tsung-Yi
    Jin, Yier
    ASIA CCS'21: PROCEEDINGS OF THE 2021 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 349 - 362
  • [8] Deep learning models for electrocardiograms are susceptible to adversarial attack
    Han, Xintian
    Hu, Yuxuan
    Foschini, Luca
    Chinitz, Larry
    Jankelson, Lior
    Ranganath, Rajesh
    NATURE MEDICINE, 2020, 26 (03) : 360 - +
  • [9] Deep learning models for electrocardiograms are susceptible to adversarial attack
    Xintian Han
    Yuxuan Hu
    Luca Foschini
    Larry Chinitz
    Lior Jankelson
    Rajesh Ranganath
    Nature Medicine, 2020, 26 : 360 - 363
  • [10] SEMANTIC-PRESERVING METRIC LEARNING FOR VIDEO-TEXT RETRIEVAL
    Choo, Sungkwon
    Ha, Seong Jong
    Lee, Joonsoo
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2388 - 2392