Improving transferable adversarial attack for vision transformers via global attention and local drop

被引:3
|
作者
Li, Tuo [1 ]
Han, Yahong [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
关键词
Adversarial examples; Vision transformer; Transferability; Self-attention;
D O I
10.1007/s00530-023-01157-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vision Transformers (ViTs) have been a new paradigm in several computer vision tasks, yet they are susceptible to adversarial examples. Recent studies show it is difficult to transfer adversarial examples generated by ViTs to other models. Existing methods have poor transferability because they do not target the specific structural characteristics (e.g., self-attention and patch-embedding) of ViTs. To address this problem and further boost transferability, we propose a method, namely Global Attention and Local Drop (GALD), to boost the transferability of adversarial examples from ViTs to other models, including ViTs and convolutional neural networks (CNNs). Specifically, our method contains two parts: Global Attention Guidance (GAG) and Drop Patch (DP). The GAG improves the attention representation in shallow layers by adding global guidance attention to every layer except the final layer of ViTs. Therefore, the perturbations could focus on the object regions. DP randomly drops some patches in every iteration to diversify the input patterns and mitigate overfitting of adversarial examples to the surrogate model. Experiments show that adversarial examples generated by our method own the best transferability to black-box models with unknown structures. Code is available at Link.
引用
收藏
页码:3467 / 3480
页数:14
相关论文
共 50 条
  • [21] Understanding and improving adversarial transferability of vision transformers and convolutional neural networks
    Chen, Zhiyu
    Xu, Chi
    Lv, Huanhuan
    Liu, Shangdong
    Ji, Yimu
    INFORMATION SCIENCES, 2023, 648
  • [22] Towards the Transferable Reversible Adversarial Example via Distribution-Relevant Attack
    Tian, Zhuo
    Zhou, Xiaoyi
    Xing, Fan
    Zhao, Ruiyang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 292 - 305
  • [23] Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training
    Di Ming
    Ren, Peng
    Wang, Yunlong
    Feng, Xin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 24696 - 24705
  • [24] Improving adversarial robustness of medical imaging systems via adding global attention noise
    Dai, Yinyao
    Qian, Yaguan
    Lu, Fang
    Wang, Bin
    Gu, Zhaoquan
    Wang, Wei
    Wan, Jian
    Zhang, Yanchun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [25] Inheritance Attention Matrix-Based Universal Adversarial Perturbations on Vision Transformers
    Hu, Haoqi
    Lu, Xiaofeng
    Zhang, Xinpeng
    Zhang, Tianxing
    Sun, Guangling
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1923 - 1927
  • [26] Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal
    Shi, Yucheng
    Han, Yahong
    Tan, Yu-an
    Kuang, Xiaohui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [27] Attentional Feature Erase: Towards task-wise transferable adversarial attack on cloud vision APIs
    Cheng, Bo
    Lu, Yantao
    Li, Yilan
    You, Tao
    Zhang, Peng
    DISPLAYS, 2024, 82
  • [28] Improving Transferable Targeted Adversarial Attack for Object Detection Using RCEN Framework and Logit Loss Optimization
    Ding, Zhiyi
    Sun, Lei
    Mao, Xiuqing
    Dai, Leyu
    Ding, Ruiyang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4387 - 4412
  • [29] Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision Transformers
    Sahiner, Arda
    Ergen, Tolga
    Ozturkler, Batu
    Pauly, John
    Mardani, Morteza
    Pilanci, Mert
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19050 - 19088
  • [30] Omnidirectional image quality assessment with local-global vision transformers
    Tofighi, Nafiseh Jabbari
    Elfkir, Mohamed Hedi
    Imamoglu, Nevrez
    Ozcinar, Cagri
    Erdem, Aykut
    Erdem, Erkut
    IMAGE AND VISION COMPUTING, 2024, 148