Robust shortcut and disordered robustness: Improving adversarial training through adaptive smoothing

被引：0

作者：

Li, Lin ^{[1
]}

Spratling, Michael ^{[1
,2
]}

机构：

[1] Kings Coll London, Dept Informat, London WC2B 4BG, England

[2] Univ Luxembourg, Dept Behav & Cognit Sci, L-4366 Esch Belval, Luxembourg

来源：

PATTERN RECOGNITION | 2025年 / 163卷

关键词：

Adversarial robustness; Adversarial training; Loss smoothing; Instance adaptive;

D O I：

10.1016/j.patcog.2025.111474

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks are highly susceptible to adversarial perturbations: artificial noise that corrupts input data in ways imperceptible to humans but causes incorrect predictions. Among the various defenses against these attacks, adversarial training has emerged as the most effective. In this work, we aim to enhance adversarial training to improve robustness against adversarial attacks. We begin by analyzing how adversarial vulnerability evolves during training from an instance-wise perspective. This analysis reveals two previously unrecognized phenomena: robust shortcut and disordered robustness. We then demonstrate that these phenomena are related to robust overfitting, a well-known issue in adversarial training. Building on these insights, we propose a novel adversarial training method: Instance-adaptive Smoothness Enhanced Adversarial Training (ISEAT). This method jointly smooths the input and weight loss landscapes in an instance-adaptive manner, preventing the exploitation of robust shortcut and thereby mitigating robust overfitting. Extensive experiments demonstrate the efficacy of ISEAT and its superiority over existing adversarial training methods. Code is available at https://github.com/TreeLLi/ISEAT.

引用

页数：11

共 50 条

[41] GAAT: Group Adaptive Adversarial Training to Improve the Trade-Off Between Robustness and Accuracy
Qian, Yaguan
Liang, Xiaoyu
Kang, Ming
Wang, Bin
Gu, Zhaoquan
Wang, Xing
Wu, Chunming
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (13)
[42] Improving the Shortest Plank: Vulnerability-Aware Adversarial Training for Robust Recommender System
Zhang, Kaike
Cao, Qi
Wu, Yunfan
Sun, Fei
Shen, Huawei
Cheng, Xueqi
PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 680 - 689
[43] A Gift from Label Smoothing: Robust Training with Adaptive Label Smoothing via Auxiliary Classifier under Label Noise
Ko, Jongwoo
Yi, Bongsoo
Yun, Se-Young
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8325 - 8333
[44] Push Stricter to Decide Better: A Class-Conditional Feature Adaptive Framework for Improving Adversarial Robustness
Yin, Jia-Li
Chen, Bin
Zhu, Wanqing
Chen, Bo-Hao
Liu, Ximeng
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2119 - 2131
[45] Alignment-Based Adversarial Training (ABAT) for Improving the Robustness and Accuracy of EEG-Based BCIs
Chen, Xiaoqing
Wang, Ziwei
Wu, Dongrui
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 1703 - 1714
[46] Improving the Robustness and Quality of Biomedical CNN Models through Adaptive Hyperparameter Tuning
Iqbal, Saeed
Qureshi, Adnan N.
Ullah, Amin
Li, Jianqiang
Mahmood, Tariq
APPLIED SCIENCES-BASEL, 2022, 12 (22):
[47] Understanding adversarial training: Increasing local stability of supervised models through robust optimization
Shaham, Uri
Yamada, Yutaro
Negahban, Sahand
NEUROCOMPUTING, 2018, 307 : 195 - 204
[48] SNN-RAT: Robustness-enhanced Spiking Neural Network through Regularized Adversarial Training
Ding, Jianhao
Bu, Tong
Yu, Zhaofei
Huang, Tiejun
Liu, Jian K.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[49] Improving the performance of adaptive arrays in nonstationary environments through data-adaptive training
Rabideau, DJ
Steinhardt, AO
THIRTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1997, : 75 - 79
[50] Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Wang, Zhenyu
Hansen, John H. L.
IEEE ACCESS, 2024, 12 : 99894 - 99911

← 1 2 3 4 5 →