UNMASK: Adversarial Detection and Defense Through Robust Feature Alignment

被引：15

作者：

Freitas, Scott ^{[1
]}

Chen, Shang-Tse ^{[2
]}

Wang, Zijie J. ^{[1
]}

Chau, Duen Horng ^{[1
]}

机构：

[1] Georgia Tech, Atlanta, GA 30332 USA

[2] Natl Taiwan Univ, Taipei, Taiwan

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2020年

关键词：

deep learning; adversarial defense; robust features; adversarial detection;

D O I：

10.1109/BigData50022.2020.9378303

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent research has demonstrated that deep learning architectures are vulnerable to adversarial attacks, highlighting the vital need for defensive techniques to detect and mitigate these attacks before they occur. We present UNMASK, an adversarial detection and defense framework based on robust feature alignment. UNMASK combats adversarial attacks by extracting robust features (e.g., beak, wings, eyes) from an image (e.g., "bird") and comparing them to the expected features of the classification. For example, if the extracted features for a "bird" image are wheel, saddle and frame, the model may be under attack. UNMASK detects such attacks and defends the model by rectifying the misclassification, re-classifying the image based on its robust features. Our extensive evaluation shows that UNMASK detects up to 96.75% of attacks, and defends the model by correctly classifying up to 93% of adversarial images produced by the current strongest attack, Projected Gradient Descent, in the gray-box setting. UNMASK provides significantly better protection than adversarial training across 8 attack vectors, averaging 31.18% higher accuracy. We open source the code repository and data with this paper: https://github.com/safreita1/unmask.

引用

页码：1081 / 1088

页数：8

共 50 条

[21] Defense Against Adversarial Attacks Using Feature Scattering-based Adversarial Training
Zhang, Haichao
Wang, Jianyu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[22] Boosting adversarial robustness via feature refinement, suppression, and alignment
Wu, Yulun
Guo, Yanming
Chen, Dongmei
Yu, Tianyuan
Xiao, Huaxin
Guo, Yuanhao
Bai, Liang
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3213 - 3233
[23] Boosting adversarial robustness via feature refinement, suppression, and alignment
Yulun Wu
Yanming Guo
Dongmei Chen
Tianyuan Yu
Huaxin Xiao
Yuanhao Guo
Liang Bai
Complex & Intelligent Systems, 2024, 10 : 3213 - 3233
[24] Adversarial Alignment for Source Free Object Detection
Chu, Qiaosong
Li, Shuyan
Chen, Guangyi
Li, Kai
Li, Xiu
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 452 - 460
[25] Adversarial feature distribution alignment for semi-supervised learning
Mayer, Christoph
Paul, Matthieu
Timofte, Radu
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 202
[26] Identity- and Pose-Robust Facial Expression Recognition through Adversarial Feature Learning
Wang, Can
Wang, Shangfei
Liang, Guang
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 238 - 246
[27] Salient feature extractor for adversarial defense on deep neural networks
Chen, Ruoxi
Chen, Jinyin
Zheng, Haibin
Xuan, Qi
Ming, Zhaoyan
Jiang, Wenrong
Cui, Chen
INFORMATION SCIENCES, 2022, 600 : 118 - 143
[28] Detection and Defense of Topological Adversarial Attacks on Graphs
Zhang, Yingxue
Regol, Florence
Pal, Soumyasundar
Khan, Sakif
Ma, Liheng
Coates, Mark
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[29] Adversarial Defense on Harmony: Reverse Attack for Robust AI Models Against Adversarial Attacks
Kim, Yebon
Jung, Jinhyo
Kim, Hyunjun
So, Hwisoo
Ko, Yohan
Shrivastava, Aviral
Lee, Kyoungwoo
Hwang, Uiwon
IEEE ACCESS, 2024, 12 : 176485 - 176497
[30] Towards Robust Detection of Adversarial Examples
Pang, Tianyu
Du, Chao
Dong, Yinpeng
Zhu, Jun
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31

← 1 2 3 4 5 →