Robust shortcut and disordered robustness: Improving adversarial training through adaptive smoothing

被引:0
|
作者
Li, Lin [1 ]
Spratling, Michael [1 ,2 ]
机构
[1] Kings Coll London, Dept Informat, London WC2B 4BG, England
[2] Univ Luxembourg, Dept Behav & Cognit Sci, L-4366 Esch Belval, Luxembourg
关键词
Adversarial robustness; Adversarial training; Loss smoothing; Instance adaptive;
D O I
10.1016/j.patcog.2025.111474
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks are highly susceptible to adversarial perturbations: artificial noise that corrupts input data in ways imperceptible to humans but causes incorrect predictions. Among the various defenses against these attacks, adversarial training has emerged as the most effective. In this work, we aim to enhance adversarial training to improve robustness against adversarial attacks. We begin by analyzing how adversarial vulnerability evolves during training from an instance-wise perspective. This analysis reveals two previously unrecognized phenomena: robust shortcut and disordered robustness. We then demonstrate that these phenomena are related to robust overfitting, a well-known issue in adversarial training. Building on these insights, we propose a novel adversarial training method: Instance-adaptive Smoothness Enhanced Adversarial Training (ISEAT). This method jointly smooths the input and weight loss landscapes in an instance-adaptive manner, preventing the exploitation of robust shortcut and thereby mitigating robust overfitting. Extensive experiments demonstrate the efficacy of ISEAT and its superiority over existing adversarial training methods. Code is available at https://github.com/TreeLLi/ISEAT.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] GAAT: Group Adaptive Adversarial Training to Improve the Trade-Off Between Robustness and Accuracy
    Qian, Yaguan
    Liang, Xiaoyu
    Kang, Ming
    Wang, Bin
    Gu, Zhaoquan
    Wang, Xing
    Wu, Chunming
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (13)
  • [42] Improving the Shortest Plank: Vulnerability-Aware Adversarial Training for Robust Recommender System
    Zhang, Kaike
    Cao, Qi
    Wu, Yunfan
    Sun, Fei
    Shen, Huawei
    Cheng, Xueqi
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 680 - 689
  • [43] A Gift from Label Smoothing: Robust Training with Adaptive Label Smoothing via Auxiliary Classifier under Label Noise
    Ko, Jongwoo
    Yi, Bongsoo
    Yun, Se-Young
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8325 - 8333
  • [44] Push Stricter to Decide Better: A Class-Conditional Feature Adaptive Framework for Improving Adversarial Robustness
    Yin, Jia-Li
    Chen, Bin
    Zhu, Wanqing
    Chen, Bo-Hao
    Liu, Ximeng
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2119 - 2131
  • [45] Alignment-Based Adversarial Training (ABAT) for Improving the Robustness and Accuracy of EEG-Based BCIs
    Chen, Xiaoqing
    Wang, Ziwei
    Wu, Dongrui
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 1703 - 1714
  • [46] Improving the Robustness and Quality of Biomedical CNN Models through Adaptive Hyperparameter Tuning
    Iqbal, Saeed
    Qureshi, Adnan N.
    Ullah, Amin
    Li, Jianqiang
    Mahmood, Tariq
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [47] Understanding adversarial training: Increasing local stability of supervised models through robust optimization
    Shaham, Uri
    Yamada, Yutaro
    Negahban, Sahand
    NEUROCOMPUTING, 2018, 307 : 195 - 204
  • [48] SNN-RAT: Robustness-enhanced Spiking Neural Network through Regularized Adversarial Training
    Ding, Jianhao
    Bu, Tong
    Yu, Zhaofei
    Huang, Tiejun
    Liu, Jian K.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [49] Improving the performance of adaptive arrays in nonstationary environments through data-adaptive training
    Rabideau, DJ
    Steinhardt, AO
    THIRTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1997, : 75 - 79
  • [50] Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
    Wang, Zhenyu
    Hansen, John H. L.
    IEEE ACCESS, 2024, 12 : 99894 - 99911