Understanding the Detrimental Class-level Effects of Data Augmentation

被引:0
|
作者
Kirichenko, Polina [1 ,2 ]
Ibrahim, Mark [2 ]
Balestriero, Randall [2 ]
Bouchacourt, Diane [2 ]
Vedantam, Ramakrishna [2 ]
Firooz, Hamed [2 ]
Wilson, Andrew Gordon [1 ]
机构
[1] New York Univ, New York, NY 10012 USA
[2] Meta AI, Boston, MA 02199 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation (DA) encodes invariance and provides implicit regularization critical to a model's performance in image classification tasks. However, while DA improves average accuracy, recent studies have shown that its impact can be highly class dependent: achieving optimal average accuracy comes at the cost of significantly hurting individual class accuracy by as much as 20% on ImageNet. There has been little progress in resolving class-level accuracy drops due to a limited understanding of these effects. In this work, we present a framework for understanding how DA interacts with class-level learning dynamics. Using higher-quality multi-label annotations on ImageNet, we systematically categorize the affected classes and find that the majority are inherently ambiguous, co-occur, or involve fine-grained distinctions, while DA controls the model's bias towards one of the closely related classes. While many of the previously reported performance drops are explained by multi-label annotations, our analysis of class confusions reveals other sources of accuracy degradation. We show that simple class-conditional augmentation strategies informed by our framework improve performance on the negatively affected classes.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] Feature selection using class-level regularized self-representation
    Lu, Zhenghua
    Chu, Qihuan
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13130 - 13144
  • [32] Attaining Class-Level Forgetting in Pretrained Model Using Few Samples
    Singh, Pravendra
    Mazumder, Pratik
    Karim, Mohammed Asad
    COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 433 - 448
  • [33] Addressing Label Sparsity With Class-Level Common Sense for Google Maps
    Welty, Chris
    Aroyo, Lora
    Korn, Flip
    McCarthy, Sara M.
    Zhao, Shubin
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [34] Generating Class-Level Integration Tests Using Call Site Information
    Derakhshanfar, Pouria
    Devroey, Xavier
    Panichella, Annibale
    Zaidman, Andy
    van Deursen, Arie
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (04) : 2069 - 2087
  • [35] Feature selection using class-level regularized self-representation
    Zhenghua Lu
    Qihuan Chu
    Applied Intelligence, 2023, 53 : 13130 - 13144
  • [36] Sexual harassment and psychological complaints: student- and class-level associations
    Laftman, S. Brolin
    Bjereld, Y.
    Modin, B.
    Lofstedt, P.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2020, 30
  • [37] An Early Ordovician (Floian) asterozoan (Echinodermata) of problematic class-level affinities
    Blake, Daniel B.
    Gahn, Forest J.
    Guensburg, Thomas E.
    JOURNAL OF PALEONTOLOGY, 2020, 94 (02) : 358 - 365
  • [38] A practical approach for technical debt prioritization based on class-level forecasting
    Tsoukalas, Dimitrios
    Siavvas, Miltiadis
    Kehagias, Dionysios
    Ampatzoglou, Apostolos
    Chatzigeorgiou, Alexander
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (04)
  • [39] Class-level risk factors for bullying and victimization in Portuguese middle schools
    Coelho, Vitor Alexandre
    Sousa, Vanda
    SCHOOL PSYCHOLOGY INTERNATIONAL, 2018, 39 (02) : 121 - 137
  • [40] An extensive study of class-level and method-level test case selection for continuous integration
    Li, Yingling
    Wang, Junjie
    Yang, Yun
    Wang, Qing
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 167