PRECISE STATISTICAL ANALYSIS OF CLASSIFICATION ACCURACIES FOR ADVERSARIAL TRAINING

被引:11
|
作者
Javanmard, Adel [1 ]
Soltanolkotabi, Mahdi [2 ]
机构
[1] Univ Southern Calif, Dept Data Sci & Operat, Los Angeles, CA 90089 USA
[2] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90089 USA
来源
ANNALS OF STATISTICS | 2022年 / 50卷 / 04期
关键词
Precise high-dimensional asymptotics; adversarial training; binary classification; PHASE-TRANSITIONS; SLOPE;
D O I
10.1214/22-AOS2180
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Despite the wide empirical success of modern machine learning algorithms and models in a multitude of applications, they are known to be highly susceptible to seemingly small indiscernible perturbations to the input data known as adversarial attacks. A variety of recent adversarial training procedures have been proposed to remedy this issue. Despite the success of such procedures at increasing accuracy on adversarially perturbed inputs or robust accuracy, these techniques often reduce accuracy on natural unperturbed inputs or standard accuracy. Complicating matters further, the effect and trend of adversarial training procedures on standard and robust accuracy is rather counter intuitive and radically dependent on a variety of factors including the perceived form of the perturbation during training, size/quality of data, model overparameterization, etc. In this paper, we focus on binary classification problems where the data is generated according to the mixture of two Gaussians with general anisotropic covariance matrices and derive a precise characterization of the standard and robust accuracy for a class of minimax adversarially trained models. We consider a general norm-based adversarial model, where the adversary can add perturbations of bounded l(p) norm to each input data, for an arbitrary p >= 1. Our comprehensive analysis allows us to theoretically explain several intriguing empirical phenomena and provide a precise understanding of the role of different problem parameters on standard and robust accuracies.
引用
收藏
页码:2127 / 2156
页数:30
相关论文
共 50 条
  • [21] Parameter Interpolation Adversarial Training for Robust Image Classification
    Liu, Xin
    Yang, Yichen
    He, Kun
    Hopcroft, John E.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1613 - 1623
  • [22] Improved Text Classification via Contrastive Adversarial Training
    Pan, Lin
    Hang, Chung-Wei
    Sil, Avi
    Potdar, Saloni
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11130 - 11138
  • [23] Hierarchical gated recurrent neural network with adversarial and virtual adversarial training on text classification
    Poon, Hoon-Keng
    Yap, Wun-She
    Tee, Yee-Kai
    Lee, Wai-Kong
    Goi, Bok-Min
    NEURAL NETWORKS, 2019, 119 : 299 - 312
  • [24] Effective Classification Using a Small Training Set Based on Discretization and Statistical Analysis
    Bruni, Renato
    Bianchi, Gianpiero
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (09) : 2349 - 2361
  • [25] Domain adversarial training for classification of cracking in images of concrete surfaces
    Bruno Oliveira Santos
    Jónatas Valença
    João P. Costeira
    Eduardo Julio
    AI in Civil Engineering, 1 (1):
  • [26] AdvAndMal: Adversarial Training for Android Malware Detection and Family Classification
    Wang, Chenyue
    Zhang, Linlin
    Zhao, Kai
    Ding, Xuhui
    Wang, Xusheng
    SYMMETRY-BASEL, 2021, 13 (06):
  • [27] Meta-Adversarial Training of Neural Networks for Binary Classification
    Saadallah, Amal
    Morik, Katharina
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [28] Adversarial Divergence Training for Universal Cross-Scene Classification
    Zhu, Sihan
    Wu, Chen
    Du, Bo
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [29] Adversarial Training for Relation Classification with Attention Based Gate Mechanism
    Cao, Pengfei
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING (CCKS 2018), 2019, 957 : 91 - 102
  • [30] Precise Tweet Classification and Sentiment Analysis
    Batool, Rabia
    Khattak, Asad Masood
    Maqbool, Jahanzeb
    Lee, Sungyoung
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 461 - 466