Improving transferable adversarial attack for vision transformers via global attention and local drop

被引：3

作者：

Li, Tuo ^{[1
]}

Han, Yahong ^{[1
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

来源：

MULTIMEDIA SYSTEMS | 2023年 / 29卷 / 06期

关键词：

Adversarial examples; Vision transformer; Transferability; Self-attention;

D O I：

10.1007/s00530-023-01157-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Vision Transformers (ViTs) have been a new paradigm in several computer vision tasks, yet they are susceptible to adversarial examples. Recent studies show it is difficult to transfer adversarial examples generated by ViTs to other models. Existing methods have poor transferability because they do not target the specific structural characteristics (e.g., self-attention and patch-embedding) of ViTs. To address this problem and further boost transferability, we propose a method, namely Global Attention and Local Drop (GALD), to boost the transferability of adversarial examples from ViTs to other models, including ViTs and convolutional neural networks (CNNs). Specifically, our method contains two parts: Global Attention Guidance (GAG) and Drop Patch (DP). The GAG improves the attention representation in shallow layers by adding global guidance attention to every layer except the final layer of ViTs. Therefore, the perturbations could focus on the object regions. DP randomly drops some patches in every iteration to diversify the input patterns and mitigate overfitting of adversarial examples to the surrogate model. Experiments show that adversarial examples generated by our method own the best transferability to black-box models with unknown structures. Code is available at Link.

引用

页码：3467 / 3480

页数：14

共 50 条

[21] Understanding and improving adversarial transferability of vision transformers and convolutional neural networks
Chen, Zhiyu
Xu, Chi
Lv, Huanhuan
Liu, Shangdong
Ji, Yimu
INFORMATION SCIENCES, 2023, 648
[22] Towards the Transferable Reversible Adversarial Example via Distribution-Relevant Attack
Tian, Zhuo
Zhou, Xiaoyi
Xing, Fan
Zhao, Ruiyang
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 292 - 305
[23] Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training
Di Ming
Ren, Peng
Wang, Yunlong
Feng, Xin
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 24696 - 24705
[24] Improving adversarial robustness of medical imaging systems via adding global attention noise
Dai, Yinyao
Qian, Yaguan
Lu, Fang
Wang, Bin
Gu, Zhaoquan
Wang, Wei
Wan, Jian
Zhang, Yanchun
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
[25] Inheritance Attention Matrix-Based Universal Adversarial Perturbations on Vision Transformers
Hu, Haoqi
Lu, Xiaofeng
Zhang, Xinpeng
Zhang, Tianxing
Sun, Guangling
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1923 - 1927
[26] Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal
Shi, Yucheng
Han, Yahong
Tan, Yu-an
Kuang, Xiaohui
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[27] Attentional Feature Erase: Towards task-wise transferable adversarial attack on cloud vision APIs
Cheng, Bo
Lu, Yantao
Li, Yilan
You, Tao
Zhang, Peng
DISPLAYS, 2024, 82
[28] Improving Transferable Targeted Adversarial Attack for Object Detection Using RCEN Framework and Logit Loss Optimization
Ding, Zhiyi
Sun, Lei
Mao, Xiuqing
Dai, Leyu
Ding, Ruiyang
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4387 - 4412
[29] Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision Transformers
Sahiner, Arda
Ergen, Tolga
Ozturkler, Batu
Pauly, John
Mardani, Morteza
Pilanci, Mert
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19050 - 19088
[30] Omnidirectional image quality assessment with local-global vision transformers
Tofighi, Nafiseh Jabbari
Elfkir, Mohamed Hedi
Imamoglu, Nevrez
Ozcinar, Cagri
Erdem, Aykut
Erdem, Erkut
IMAGE AND VISION COMPUTING, 2024, 148

← 1 2 3 4 5 →