Robust shortcut and disordered robustness: Improving adversarial training through adaptive smoothing

被引：0

作者：

Li, Lin ^{[1
]}

Spratling, Michael ^{[1
,2
]}

机构：

[1] Kings Coll London, Dept Informat, London WC2B 4BG, England

[2] Univ Luxembourg, Dept Behav & Cognit Sci, L-4366 Esch Belval, Luxembourg

来源：

PATTERN RECOGNITION | 2025年 / 163卷

关键词：

Adversarial robustness; Adversarial training; Loss smoothing; Instance adaptive;

D O I：

10.1016/j.patcog.2025.111474

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks are highly susceptible to adversarial perturbations: artificial noise that corrupts input data in ways imperceptible to humans but causes incorrect predictions. Among the various defenses against these attacks, adversarial training has emerged as the most effective. In this work, we aim to enhance adversarial training to improve robustness against adversarial attacks. We begin by analyzing how adversarial vulnerability evolves during training from an instance-wise perspective. This analysis reveals two previously unrecognized phenomena: robust shortcut and disordered robustness. We then demonstrate that these phenomena are related to robust overfitting, a well-known issue in adversarial training. Building on these insights, we propose a novel adversarial training method: Instance-adaptive Smoothness Enhanced Adversarial Training (ISEAT). This method jointly smooths the input and weight loss landscapes in an instance-adaptive manner, preventing the exploitation of robust shortcut and thereby mitigating robust overfitting. Extensive experiments demonstrate the efficacy of ISEAT and its superiority over existing adversarial training methods. Code is available at https://github.com/TreeLLi/ISEAT.

引用

页数：11

共 50 条

[21] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
Desheng Wang
Weidong Jin
Yunpu Wu
Aamir Khan
Applied Intelligence, 2023, 53 : 24492 - 24508
[22] Improving adversarial robustness of Bayesian neural networks via multi-task adversarial training
Chen, Xu
Liu, Chuancai
Zhao, Yue
Jia, Zhiyang
Jin, Ge
INFORMATION SCIENCES, 2022, 592 : 156 - 173
[23] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
Wang, Desheng
Jin, Weidong
Wu, Yunpu
Khan, Aamir
APPLIED INTELLIGENCE, 2023, 53 (20) : 24492 - 24508
[24] Improving adversarial robustness through a curriculum-guided reliable distillation
Li, Jiawen
Fang, Kun
Huang, Xiaolin
Yang, Jie
COMPUTERS & SECURITY, 2023, 133
[25] Self-adaptive Adversarial Training for Robust Medical Segmentation
Wang, Fu
Fu, Zeyu
Zhang, Yanghao
Ruan, Wenjie
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 725 - 735
[26] Improving adversarial robustness of deep neural networks via adaptive margin evolution
Ma, Linhai
Liang, Liang
NEUROCOMPUTING, 2023, 551
[27] Improving Robustness of DNNs against Common Corruptions via Gaussian Adversarial Training
Yi, Chenyu
Li, Haoliang
Wan, Renjie
Kot, Alex C.
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 17 - 20
[28] IMPROVING ROBUSTNESS OF DEEP NETWORKS USING CLUSTER-BASED ADVERSARIAL TRAINING
Rasheed, Bader
Khan, Adil
RUSSIAN LAW JOURNAL, 2023, 11 (09) : 412 - 420
[29] Improving the Robustness of Deep Neural Networks via Adversarial Training with Triplet Loss
Li, Pengcheng
Yi, Jinfeng
Zhou, Bowen
Zhang, Lijun
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2909 - 2915
[30] Increasing the Robustness of Image Quality Assessment Models Through Adversarial Training
Chistyakova, Anna
Antsiferova, Anastasia
Khrebtov, Maksim
Lavrushkin, Sergey
Arkhipenko, Konstantin
Vatolin, Dmitriy
Turdakov, Denis
TECHNOLOGIES, 2024, 12 (11)

← 1 2 3 4 5 →