PRECISE STATISTICAL ANALYSIS OF CLASSIFICATION ACCURACIES FOR ADVERSARIAL TRAINING

被引：11

作者：

Javanmard, Adel ^{[1
]}

Soltanolkotabi, Mahdi ^{[2
]}

机构：

[1] Univ Southern Calif, Dept Data Sci & Operat, Los Angeles, CA 90089 USA

[2] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90089 USA

来源：

ANNALS OF STATISTICS | 2022年 / 50卷 / 04期

关键词：

Precise high-dimensional asymptotics; adversarial training; binary classification; PHASE-TRANSITIONS; SLOPE;

D O I：

10.1214/22-AOS2180

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Despite the wide empirical success of modern machine learning algorithms and models in a multitude of applications, they are known to be highly susceptible to seemingly small indiscernible perturbations to the input data known as adversarial attacks. A variety of recent adversarial training procedures have been proposed to remedy this issue. Despite the success of such procedures at increasing accuracy on adversarially perturbed inputs or robust accuracy, these techniques often reduce accuracy on natural unperturbed inputs or standard accuracy. Complicating matters further, the effect and trend of adversarial training procedures on standard and robust accuracy is rather counter intuitive and radically dependent on a variety of factors including the perceived form of the perturbation during training, size/quality of data, model overparameterization, etc. In this paper, we focus on binary classification problems where the data is generated according to the mixture of two Gaussians with general anisotropic covariance matrices and derive a precise characterization of the standard and robust accuracy for a class of minimax adversarially trained models. We consider a general norm-based adversarial model, where the adversary can add perturbations of bounded l(p) norm to each input data, for an arbitrary p >= 1. Our comprehensive analysis allows us to theoretically explain several intriguing empirical phenomena and provide a precise understanding of the role of different problem parameters on standard and robust accuracies.

引用

页码：2127 / 2156

页数：30

共 50 条

[31] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
Desheng Wang
Weidong Jin
Yunpu Wu
Aamir Khan
Applied Intelligence, 2023, 53 : 24492 - 24508
[32] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
Wang, Desheng
Jin, Weidong
Wu, Yunpu
Khan, Aamir
APPLIED INTELLIGENCE, 2023, 53 (20) : 24492 - 24508
[33] A STATISTICAL TECHNIQUE FOR COMPARING THE ACCURACIES OF SEVERAL CLASSIFIERS
LOONEY, SW
PATTERN RECOGNITION LETTERS, 1988, 8 (01) : 5 - 9
[34] A Game-Theoretic Analysis of Adversarial Classification
Dritsoula, Lemonia
Loiseau, Patrick
Musacchio, John
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (12) : 3094 - 3109
[35] Stability Analysis and Generalization Bounds of Adversarial Training
Xiao, Jiancong
Fan, Yanbo
Sun, Ruoyu
Wang, Jue
Luo, Zhi-Quan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[36] Enhanced DNNs for malware classification with GAN-based adversarial training
Yunchun Zhang
Haorui Li
Yang Zheng
Shaowen Yao
Jiaqi Jiang
Journal of Computer Virology and Hacking Techniques, 2021, 17 : 153 - 163
[37] Enhanced DNNs for malware classification with GAN-based adversarial training
Zhang, Yunchun
Li, Haorui
Zheng, Yang
Yao, Shaowen
Jiang, Jiaqi
JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2021, 17 (02) : 153 - 163
[38] Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training
Wan, Yao
Yan, Wenqiang
Gao, Jianwei
Zhao, Zhou
Wu, Jian
Yu, Philip S.
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 841 - 850
[39] Accelerate adversarial training with loss guided propagation for robust image classification
Xu, Changkai
Zhang, Chunjie
Yang, Yanwu
Yang, Huaizhi
Bo, Yijun
Li, Danyong
Zhang, Riquan
INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (01)
[40] Directional adversarial training for cost sensitive deep learning classification applications
Terzi, Matteo
Susto, Gian Antonio
Chaudhari, Pratik
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 91

← 1 2 3 4 5 →