Robust shortcut and disordered robustness: Improving adversarial training through adaptive smoothing

被引:0
|
作者
Li, Lin [1 ]
Spratling, Michael [1 ,2 ]
机构
[1] Kings Coll London, Dept Informat, London WC2B 4BG, England
[2] Univ Luxembourg, Dept Behav & Cognit Sci, L-4366 Esch Belval, Luxembourg
关键词
Adversarial robustness; Adversarial training; Loss smoothing; Instance adaptive;
D O I
10.1016/j.patcog.2025.111474
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks are highly susceptible to adversarial perturbations: artificial noise that corrupts input data in ways imperceptible to humans but causes incorrect predictions. Among the various defenses against these attacks, adversarial training has emerged as the most effective. In this work, we aim to enhance adversarial training to improve robustness against adversarial attacks. We begin by analyzing how adversarial vulnerability evolves during training from an instance-wise perspective. This analysis reveals two previously unrecognized phenomena: robust shortcut and disordered robustness. We then demonstrate that these phenomena are related to robust overfitting, a well-known issue in adversarial training. Building on these insights, we propose a novel adversarial training method: Instance-adaptive Smoothness Enhanced Adversarial Training (ISEAT). This method jointly smooths the input and weight loss landscapes in an instance-adaptive manner, preventing the exploitation of robust shortcut and thereby mitigating robust overfitting. Extensive experiments demonstrate the efficacy of ISEAT and its superiority over existing adversarial training methods. Code is available at https://github.com/TreeLLi/ISEAT.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Desheng Wang
    Weidong Jin
    Yunpu Wu
    Aamir Khan
    Applied Intelligence, 2023, 53 : 24492 - 24508
  • [22] Improving adversarial robustness of Bayesian neural networks via multi-task adversarial training
    Chen, Xu
    Liu, Chuancai
    Zhao, Yue
    Jia, Zhiyang
    Jin, Ge
    INFORMATION SCIENCES, 2022, 592 : 156 - 173
  • [23] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Wang, Desheng
    Jin, Weidong
    Wu, Yunpu
    Khan, Aamir
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24492 - 24508
  • [24] Improving adversarial robustness through a curriculum-guided reliable distillation
    Li, Jiawen
    Fang, Kun
    Huang, Xiaolin
    Yang, Jie
    COMPUTERS & SECURITY, 2023, 133
  • [25] Self-adaptive Adversarial Training for Robust Medical Segmentation
    Wang, Fu
    Fu, Zeyu
    Zhang, Yanghao
    Ruan, Wenjie
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 725 - 735
  • [26] Improving adversarial robustness of deep neural networks via adaptive margin evolution
    Ma, Linhai
    Liang, Liang
    NEUROCOMPUTING, 2023, 551
  • [27] Improving Robustness of DNNs against Common Corruptions via Gaussian Adversarial Training
    Yi, Chenyu
    Li, Haoliang
    Wan, Renjie
    Kot, Alex C.
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 17 - 20
  • [28] IMPROVING ROBUSTNESS OF DEEP NETWORKS USING CLUSTER-BASED ADVERSARIAL TRAINING
    Rasheed, Bader
    Khan, Adil
    RUSSIAN LAW JOURNAL, 2023, 11 (09) : 412 - 420
  • [29] Improving the Robustness of Deep Neural Networks via Adversarial Training with Triplet Loss
    Li, Pengcheng
    Yi, Jinfeng
    Zhou, Bowen
    Zhang, Lijun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2909 - 2915
  • [30] Increasing the Robustness of Image Quality Assessment Models Through Adversarial Training
    Chistyakova, Anna
    Antsiferova, Anastasia
    Khrebtov, Maksim
    Lavrushkin, Sergey
    Arkhipenko, Konstantin
    Vatolin, Dmitriy
    Turdakov, Denis
    TECHNOLOGIES, 2024, 12 (11)