Interpreting Adversarial Examples in Deep Learning: A Review

被引:29
|
作者
Han, Sicong [1 ]
Lin, Chenhao [1 ]
Shen, Chao [1 ]
Wang, Qian [2 ]
Guan, Xiaohong [1 ]
机构
[1] Xi An Jiao Tong Univ, 28 Xianning West Rd, Xian 710049, Shaanxi, Peoples R China
[2] Wuhan Univ, 299 Bayi Rd, Wuhan 430072, Hubei, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Deep learning; adversarial example; interpretability; adversarial robustness;
D O I
10.1145/3594869
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep learning technology is increasingly being applied in safety-critical scenarios but has recently been found to be susceptible to imperceptible adversarial perturbations. This raises a serious concern regarding the adversarial robustness of deep neural network (DNN)-based applications. Accordingly, various adversarial attacks and defense approaches have been proposed. However, current studies implement different types of attacks and defenses with certain assumptions. There is still a lack of full theoretical understanding and interpretation of adversarial examples. Instead of reviewing technical progress in adversarial attacks and defenses, this article presents a framework consisting of three perspectives to discuss recent works focusing on theoretically explaining adversarial examples comprehensively. In each perspective, various hypotheses are further categorized and summarized into several subcategories and introduced systematically. To the best of our knowledge, this study is the first to concentrate on surveying existing research on adversarial examples and adversarial robustness from the interpretability perspective. By drawing on the reviewed literature, this survey characterizes current problems and challenges that need to be addressed and highlights potential future research directions to further investigate adversarial examples.
引用
收藏
页数:38
相关论文
共 50 条
  • [21] Towards Interpreting and Utilizing Symmetry Property in Adversarial Examples
    Mei, Shibin
    Zhao, Chenglong
    Ni, Bingbing
    Yuan, Shengchao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9126 - 9133
  • [22] Plausible Counterfactuals: Auditing Deep Learning Classifiers with Realistic Adversarial Examples
    Barredo-Arrieta, Alejandro
    Del Ser, Javier
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [23] Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmentation
    Ozbulak, Utku
    Van Messem, Arnout
    De Neve, Wesley
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 300 - 308
  • [24] Metamorphic Detection of Adversarial Examples in Deep Learning Models With Affine Transformations
    Mekala, Rohan Reddy
    Magnusson, Gudjon Einar
    Porter, Adam
    Lindvall, Mikael
    Diep, Madeline
    2019 IEEE/ACM 4TH INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2019), 2019, : 55 - 62
  • [25] Experiments on Adversarial Examples for Deep Learning Model Using Multimodal Sensors
    Kurniawan, Ade
    Ohsita, Yuichi
    Murata, Masayuki
    SENSORS, 2022, 22 (22)
  • [26] ADVERSARIAL-PLAYGROUND: A Visualization Suite Showing How Adversarial Examples Fool Deep Learning
    Norton, Andrew P.
    Qi, Yanjun
    2017 IEEE SYMPOSIUM ON VISUALIZATION FOR CYBER SECURITY (VIZSEC), 2017,
  • [27] Feature-Based Adversarial Training for Deep Learning Models Resistant to Transferable Adversarial Examples
    Ryu, Gwonsang
    Choi, Daeseon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 1039 - 1049
  • [28] Using Adversarial Examples to Bypass Deep Learning Based URL Detection System
    Chen, Wencheng
    Zeng, Yi
    Qiu, Meikang
    4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 128 - 130
  • [29] Is Deep Learning Safe for Robot Vision? Adversarial Examples against the iCub Humanoid
    Melis, Marco
    Demontis, Ambra
    Biggio, Battista
    Brown, Gavin
    Fumera, Giorgio
    Roli, Fabio
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 751 - 759
  • [30] Minimal Adversarial Examples for Deep Learning on 3D Point Clouds
    Kim, Jaeyeon
    Hua, Binh-Son
    Duc Thanh Nguyen
    Yeung, Sai-Kit
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7777 - 7786