PRECISE STATISTICAL ANALYSIS OF CLASSIFICATION ACCURACIES FOR ADVERSARIAL TRAINING

被引：11

作者：

Javanmard, Adel ^{[1
]}

Soltanolkotabi, Mahdi ^{[2
]}

机构：

[1] Univ Southern Calif, Dept Data Sci & Operat, Los Angeles, CA 90089 USA

[2] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90089 USA

来源：

ANNALS OF STATISTICS | 2022年 / 50卷 / 04期

关键词：

Precise high-dimensional asymptotics; adversarial training; binary classification; PHASE-TRANSITIONS; SLOPE;

D O I：

10.1214/22-AOS2180

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Despite the wide empirical success of modern machine learning algorithms and models in a multitude of applications, they are known to be highly susceptible to seemingly small indiscernible perturbations to the input data known as adversarial attacks. A variety of recent adversarial training procedures have been proposed to remedy this issue. Despite the success of such procedures at increasing accuracy on adversarially perturbed inputs or robust accuracy, these techniques often reduce accuracy on natural unperturbed inputs or standard accuracy. Complicating matters further, the effect and trend of adversarial training procedures on standard and robust accuracy is rather counter intuitive and radically dependent on a variety of factors including the perceived form of the perturbation during training, size/quality of data, model overparameterization, etc. In this paper, we focus on binary classification problems where the data is generated according to the mixture of two Gaussians with general anisotropic covariance matrices and derive a precise characterization of the standard and robust accuracy for a class of minimax adversarially trained models. We consider a general norm-based adversarial model, where the adversary can add perturbations of bounded l(p) norm to each input data, for an arbitrary p >= 1. Our comprehensive analysis allows us to theoretically explain several intriguing empirical phenomena and provide a precise understanding of the role of different problem parameters on standard and robust accuracies.

引用

页码：2127 / 2156

页数：30

共 50 条

[21] Parameter Interpolation Adversarial Training for Robust Image Classification
Liu, Xin
Yang, Yichen
He, Kun
Hopcroft, John E.
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1613 - 1623
[22] Improved Text Classification via Contrastive Adversarial Training
Pan, Lin
Hang, Chung-Wei
Sil, Avi
Potdar, Saloni
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11130 - 11138
[23] Hierarchical gated recurrent neural network with adversarial and virtual adversarial training on text classification
Poon, Hoon-Keng
Yap, Wun-She
Tee, Yee-Kai
Lee, Wai-Kong
Goi, Bok-Min
NEURAL NETWORKS, 2019, 119 : 299 - 312
[24] Effective Classification Using a Small Training Set Based on Discretization and Statistical Analysis
Bruni, Renato
Bianchi, Gianpiero
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (09) : 2349 - 2361
[25] Domain adversarial training for classification of cracking in images of concrete surfaces
Bruno Oliveira Santos
Jónatas Valença
João P. Costeira
Eduardo Julio
AI in Civil Engineering, 1 (1):
[26] AdvAndMal: Adversarial Training for Android Malware Detection and Family Classification
Wang, Chenyue
Zhang, Linlin
Zhao, Kai
Ding, Xuhui
Wang, Xusheng
SYMMETRY-BASEL, 2021, 13 (06):
[27] Meta-Adversarial Training of Neural Networks for Binary Classification
Saadallah, Amal
Morik, Katharina
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[28] Adversarial Divergence Training for Universal Cross-Scene Classification
Zhu, Sihan
Wu, Chen
Du, Bo
Zhang, Liangpei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[29] Adversarial Training for Relation Classification with Attention Based Gate Mechanism
Cao, Pengfei
Chen, Yubo
Liu, Kang
Zhao, Jun
KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING (CCKS 2018), 2019, 957 : 91 - 102
[30] Precise Tweet Classification and Sentiment Analysis
Batool, Rabia
Khattak, Asad Masood
Maqbool, Jahanzeb
Lee, Sungyoung
2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 461 - 466

← 1 2 3 4 5 →