PRECISE STATISTICAL ANALYSIS OF CLASSIFICATION ACCURACIES FOR ADVERSARIAL TRAINING

被引:11
|
作者
Javanmard, Adel [1 ]
Soltanolkotabi, Mahdi [2 ]
机构
[1] Univ Southern Calif, Dept Data Sci & Operat, Los Angeles, CA 90089 USA
[2] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90089 USA
来源
ANNALS OF STATISTICS | 2022年 / 50卷 / 04期
关键词
Precise high-dimensional asymptotics; adversarial training; binary classification; PHASE-TRANSITIONS; SLOPE;
D O I
10.1214/22-AOS2180
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Despite the wide empirical success of modern machine learning algorithms and models in a multitude of applications, they are known to be highly susceptible to seemingly small indiscernible perturbations to the input data known as adversarial attacks. A variety of recent adversarial training procedures have been proposed to remedy this issue. Despite the success of such procedures at increasing accuracy on adversarially perturbed inputs or robust accuracy, these techniques often reduce accuracy on natural unperturbed inputs or standard accuracy. Complicating matters further, the effect and trend of adversarial training procedures on standard and robust accuracy is rather counter intuitive and radically dependent on a variety of factors including the perceived form of the perturbation during training, size/quality of data, model overparameterization, etc. In this paper, we focus on binary classification problems where the data is generated according to the mixture of two Gaussians with general anisotropic covariance matrices and derive a precise characterization of the standard and robust accuracy for a class of minimax adversarially trained models. We consider a general norm-based adversarial model, where the adversary can add perturbations of bounded l(p) norm to each input data, for an arbitrary p >= 1. Our comprehensive analysis allows us to theoretically explain several intriguing empirical phenomena and provide a precise understanding of the role of different problem parameters on standard and robust accuracies.
引用
收藏
页码:2127 / 2156
页数:30
相关论文
共 50 条
  • [31] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Desheng Wang
    Weidong Jin
    Yunpu Wu
    Aamir Khan
    Applied Intelligence, 2023, 53 : 24492 - 24508
  • [32] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Wang, Desheng
    Jin, Weidong
    Wu, Yunpu
    Khan, Aamir
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24492 - 24508
  • [34] A Game-Theoretic Analysis of Adversarial Classification
    Dritsoula, Lemonia
    Loiseau, Patrick
    Musacchio, John
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (12) : 3094 - 3109
  • [35] Stability Analysis and Generalization Bounds of Adversarial Training
    Xiao, Jiancong
    Fan, Yanbo
    Sun, Ruoyu
    Wang, Jue
    Luo, Zhi-Quan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [36] Enhanced DNNs for malware classification with GAN-based adversarial training
    Yunchun Zhang
    Haorui Li
    Yang Zheng
    Shaowen Yao
    Jiaqi Jiang
    Journal of Computer Virology and Hacking Techniques, 2021, 17 : 153 - 163
  • [37] Enhanced DNNs for malware classification with GAN-based adversarial training
    Zhang, Yunchun
    Li, Haorui
    Zheng, Yang
    Yao, Shaowen
    Jiang, Jiaqi
    JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2021, 17 (02) : 153 - 163
  • [38] Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training
    Wan, Yao
    Yan, Wenqiang
    Gao, Jianwei
    Zhao, Zhou
    Wu, Jian
    Yu, Philip S.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 841 - 850
  • [39] Accelerate adversarial training with loss guided propagation for robust image classification
    Xu, Changkai
    Zhang, Chunjie
    Yang, Yanwu
    Yang, Huaizhi
    Bo, Yijun
    Li, Danyong
    Zhang, Riquan
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (01)
  • [40] Directional adversarial training for cost sensitive deep learning classification applications
    Terzi, Matteo
    Susto, Gian Antonio
    Chaudhari, Pratik
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 91