UNMASK: Adversarial Detection and Defense Through Robust Feature Alignment

被引:15
|
作者
Freitas, Scott [1 ]
Chen, Shang-Tse [2 ]
Wang, Zijie J. [1 ]
Chau, Duen Horng [1 ]
机构
[1] Georgia Tech, Atlanta, GA 30332 USA
[2] Natl Taiwan Univ, Taipei, Taiwan
关键词
deep learning; adversarial defense; robust features; adversarial detection;
D O I
10.1109/BigData50022.2020.9378303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research has demonstrated that deep learning architectures are vulnerable to adversarial attacks, highlighting the vital need for defensive techniques to detect and mitigate these attacks before they occur. We present UNMASK, an adversarial detection and defense framework based on robust feature alignment. UNMASK combats adversarial attacks by extracting robust features (e.g., beak, wings, eyes) from an image (e.g., "bird") and comparing them to the expected features of the classification. For example, if the extracted features for a "bird" image are wheel, saddle and frame, the model may be under attack. UNMASK detects such attacks and defends the model by rectifying the misclassification, re-classifying the image based on its robust features. Our extensive evaluation shows that UNMASK detects up to 96.75% of attacks, and defends the model by correctly classifying up to 93% of adversarial images produced by the current strongest attack, Projected Gradient Descent, in the gray-box setting. UNMASK provides significantly better protection than adversarial training across 8 attack vectors, averaging 31.18% higher accuracy. We open source the code repository and data with this paper: https://github.com/safreita1/unmask.
引用
收藏
页码:1081 / 1088
页数:8
相关论文
共 50 条
  • [21] Defense Against Adversarial Attacks Using Feature Scattering-based Adversarial Training
    Zhang, Haichao
    Wang, Jianyu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [22] Boosting adversarial robustness via feature refinement, suppression, and alignment
    Wu, Yulun
    Guo, Yanming
    Chen, Dongmei
    Yu, Tianyuan
    Xiao, Huaxin
    Guo, Yuanhao
    Bai, Liang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3213 - 3233
  • [23] Boosting adversarial robustness via feature refinement, suppression, and alignment
    Yulun Wu
    Yanming Guo
    Dongmei Chen
    Tianyuan Yu
    Huaxin Xiao
    Yuanhao Guo
    Liang Bai
    Complex & Intelligent Systems, 2024, 10 : 3213 - 3233
  • [24] Adversarial Alignment for Source Free Object Detection
    Chu, Qiaosong
    Li, Shuyan
    Chen, Guangyi
    Li, Kai
    Li, Xiu
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 452 - 460
  • [25] Adversarial feature distribution alignment for semi-supervised learning
    Mayer, Christoph
    Paul, Matthieu
    Timofte, Radu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 202
  • [26] Identity- and Pose-Robust Facial Expression Recognition through Adversarial Feature Learning
    Wang, Can
    Wang, Shangfei
    Liang, Guang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 238 - 246
  • [27] Salient feature extractor for adversarial defense on deep neural networks
    Chen, Ruoxi
    Chen, Jinyin
    Zheng, Haibin
    Xuan, Qi
    Ming, Zhaoyan
    Jiang, Wenrong
    Cui, Chen
    INFORMATION SCIENCES, 2022, 600 : 118 - 143
  • [28] Detection and Defense of Topological Adversarial Attacks on Graphs
    Zhang, Yingxue
    Regol, Florence
    Pal, Soumyasundar
    Khan, Sakif
    Ma, Liheng
    Coates, Mark
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [29] Adversarial Defense on Harmony: Reverse Attack for Robust AI Models Against Adversarial Attacks
    Kim, Yebon
    Jung, Jinhyo
    Kim, Hyunjun
    So, Hwisoo
    Ko, Yohan
    Shrivastava, Aviral
    Lee, Kyoungwoo
    Hwang, Uiwon
    IEEE ACCESS, 2024, 12 : 176485 - 176497
  • [30] Towards Robust Detection of Adversarial Examples
    Pang, Tianyu
    Du, Chao
    Dong, Yinpeng
    Zhu, Jun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31