Understanding the Detrimental Class-level Effects of Data Augmentation

被引:0
|
作者
Kirichenko, Polina [1 ,2 ]
Ibrahim, Mark [2 ]
Balestriero, Randall [2 ]
Bouchacourt, Diane [2 ]
Vedantam, Ramakrishna [2 ]
Firooz, Hamed [2 ]
Wilson, Andrew Gordon [1 ]
机构
[1] New York Univ, New York, NY 10012 USA
[2] Meta AI, Boston, MA 02199 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation (DA) encodes invariance and provides implicit regularization critical to a model's performance in image classification tasks. However, while DA improves average accuracy, recent studies have shown that its impact can be highly class dependent: achieving optimal average accuracy comes at the cost of significantly hurting individual class accuracy by as much as 20% on ImageNet. There has been little progress in resolving class-level accuracy drops due to a limited understanding of these effects. In this work, we present a framework for understanding how DA interacts with class-level learning dynamics. Using higher-quality multi-label annotations on ImageNet, we systematically categorize the affected classes and find that the majority are inherently ambiguous, co-occur, or involve fine-grained distinctions, while DA controls the model's bias towards one of the closely related classes. While many of the previously reported performance drops are explained by multi-label annotations, our analysis of class confusions reveals other sources of accuracy degradation. We show that simple class-conditional augmentation strategies informed by our framework improve performance on the negatively affected classes.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Understanding Class-level Testability Through Dynamic Analysis
    Tahir, Amjed
    MacDonell, Stephen G.
    Buchan, Jim
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING (ENASE 2014), 2014, : 38 - 47
  • [2] Class-Level Logit Perturbation
    Li, Mengyang
    Su, Fengguang
    Wu, Ou
    Zhang, Ji
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 13926 - 13940
  • [3] Generalized Data Weighting via Class-level Gradient Manipulation
    Chen, Can
    Zheng, Shuhao
    Chen, Xi
    Dong, Erqun
    Liu, Xue
    Liu, Hao
    Dou, Dejing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Class-level effects on cybervictimization in secondary students:A multilevel analysis
    Alvarez-Garcia, David
    Nunez, Andrea
    del Carmen Perez-Fuentes, Maria
    Vallejo, Guillermo
    REVISTA DE EDUCACION, 2022, (397): : 153 - 178
  • [5] Class-level Non-Interference
    Zanardini, Damiano
    NEW GENERATION COMPUTING, 2012, 30 (2-3) : 241 - 270
  • [6] Class-level Non-Interference
    Damiano Zanardini
    New Generation Computing, 2012, 30 : 241 - 270
  • [7] Class-level spectral features for emotion recognition
    Bitouk, Dmitri
    Verma, Ragini
    Nenkova, Ani
    SPEECH COMMUNICATION, 2010, 52 (7-8) : 613 - 625
  • [8] Effects of Class-level Vegetation Characteristics on Nesting Success of Bewick's Wrens
    Harrod, Sara E.
    Green, M. Clay
    SOUTHEASTERN NATURALIST, 2018, 17 (03) : 381 - 395
  • [9] Web of Students: Class-Level Friendship Network Discovery from Educational Big Data
    Guo, Teng
    Tang, Tao
    Zhang, Dongyu
    Li, Jianxin
    Xia, Feng
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2021, PT I, 2021, 13080 : 497 - 511
  • [10] Class-level modular analysis for object oriented languages
    Logozzo, F
    STATIC ANALYSIS, PROCEEDINGS, 2003, 2694 : 37 - 54