PRECISE STATISTICAL ANALYSIS OF CLASSIFICATION ACCURACIES FOR ADVERSARIAL TRAINING

被引:11
|
作者
Javanmard, Adel [1 ]
Soltanolkotabi, Mahdi [2 ]
机构
[1] Univ Southern Calif, Dept Data Sci & Operat, Los Angeles, CA 90089 USA
[2] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90089 USA
来源
ANNALS OF STATISTICS | 2022年 / 50卷 / 04期
关键词
Precise high-dimensional asymptotics; adversarial training; binary classification; PHASE-TRANSITIONS; SLOPE;
D O I
10.1214/22-AOS2180
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Despite the wide empirical success of modern machine learning algorithms and models in a multitude of applications, they are known to be highly susceptible to seemingly small indiscernible perturbations to the input data known as adversarial attacks. A variety of recent adversarial training procedures have been proposed to remedy this issue. Despite the success of such procedures at increasing accuracy on adversarially perturbed inputs or robust accuracy, these techniques often reduce accuracy on natural unperturbed inputs or standard accuracy. Complicating matters further, the effect and trend of adversarial training procedures on standard and robust accuracy is rather counter intuitive and radically dependent on a variety of factors including the perceived form of the perturbation during training, size/quality of data, model overparameterization, etc. In this paper, we focus on binary classification problems where the data is generated according to the mixture of two Gaussians with general anisotropic covariance matrices and derive a precise characterization of the standard and robust accuracy for a class of minimax adversarially trained models. We consider a general norm-based adversarial model, where the adversary can add perturbations of bounded l(p) norm to each input data, for an arbitrary p >= 1. Our comprehensive analysis allows us to theoretically explain several intriguing empirical phenomena and provide a precise understanding of the role of different problem parameters on standard and robust accuracies.
引用
收藏
页码:2127 / 2156
页数:30
相关论文
共 50 条
  • [1] Analysis and Extensions of Adversarial Training for Video Classification
    Kinfu, Kaleab A.
    Vidal, Rene
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3415 - 3424
  • [2] Precise Tradeoffs in Adversarial Training for Linear Regression
    Javanmard, Adel
    Soltanolkotabi, Mahdi
    Hassani, Hamed
    CONFERENCE ON LEARNING THEORY, VOL 125, 2020, 125
  • [3] An Adversarial Training Framework for Relation Classification
    Liu, Wenpeng
    Cao, Yanan
    Cao, Cong
    Liu, Yanbing
    Hu, Yue
    Guo, Li
    COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 194 - 205
  • [4] An adversarial training method for text classification
    Liu, Xiaoyang
    Dai, Shanghong
    Fiumara, Giacomo
    De Meo, Pasquale
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)
  • [5] The geometry of adversarial training in binary classification
    Bungert, Leon
    Trillos, Nicolas Garcia
    Murray, Ryan
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2023, 12 (02) : 921 - 968
  • [6] Adversarial Training for Fake News Classification
    Tariq, Abdullah
    Mehmood, Abid
    Elhadef, Mourad
    Khan, Muhammad Usman Ghani
    IEEE ACCESS, 2022, 10 : 82706 - 82715
  • [7] Improvements to adversarial training for text classification
    He, Jia-Long
    Zhang, Xiao-Lin
    Wang, Yong-Ping
    Gu, Rui-Chun
    Liu, Li-Xin
    Xu, En-Hui
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 5191 - 5202
  • [8] THE CURSE OF OVERPARAMETRIZATION IN ADVERSARIAL TRAINING: PRECISE ANALYSIS OF ROBUST GENERALIZATION FOR RANDOM FEATURES REGRESSION
    Hassani, Hamed
    Javanmard, Adel
    ANNALS OF STATISTICS, 2024, 52 (02): : 441 - 465
  • [9] A Statistical Threshold for Adversarial Classification in Laplace Mechanisms
    Unsal, Ayse
    Onen, Melek
    2021 IEEE INFORMATION THEORY WORKSHOP (ITW), 2021,
  • [10] Adversarial classification: An adversarial risk analysis approach
    Naveiro, Roi
    Redondo, Alberto
    Insua, David Rios
    Ruggeri, Fabrizio
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 113 : 133 - 148