Understanding the Detrimental Class-level Effects of Data Augmentation

被引：0

作者：

Kirichenko, Polina ^{[1
,2
]}

Ibrahim, Mark ^{[2
]}

Balestriero, Randall ^{[2
]}

Bouchacourt, Diane ^{[2
]}

Vedantam, Ramakrishna ^{[2
]}

Firooz, Hamed ^{[2
]}

Wilson, Andrew Gordon ^{[1
]}

机构：

[1] New York Univ, New York, NY 10012 USA

[2] Meta AI, Boston, MA 02199 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data augmentation (DA) encodes invariance and provides implicit regularization critical to a model's performance in image classification tasks. However, while DA improves average accuracy, recent studies have shown that its impact can be highly class dependent: achieving optimal average accuracy comes at the cost of significantly hurting individual class accuracy by as much as 20% on ImageNet. There has been little progress in resolving class-level accuracy drops due to a limited understanding of these effects. In this work, we present a framework for understanding how DA interacts with class-level learning dynamics. Using higher-quality multi-label annotations on ImageNet, we systematically categorize the affected classes and find that the majority are inherently ambiguous, co-occur, or involve fine-grained distinctions, while DA controls the model's bias towards one of the closely related classes. While many of the previously reported performance drops are explained by multi-label annotations, our analysis of class confusions reveals other sources of accuracy degradation. We show that simple class-conditional augmentation strategies informed by our framework improve performance on the negatively affected classes.

引用

页数：29

共 50 条

[31] Feature selection using class-level regularized self-representation
Lu, Zhenghua
Chu, Qihuan
APPLIED INTELLIGENCE, 2023, 53 (11) : 13130 - 13144
[32] Attaining Class-Level Forgetting in Pretrained Model Using Few Samples
Singh, Pravendra
Mazumder, Pratik
Karim, Mohammed Asad
COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 433 - 448
[33] Addressing Label Sparsity With Class-Level Common Sense for Google Maps
Welty, Chris
Aroyo, Lora
Korn, Flip
McCarthy, Sara M.
Zhao, Shubin
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
[34] Generating Class-Level Integration Tests Using Call Site Information
Derakhshanfar, Pouria
Devroey, Xavier
Panichella, Annibale
Zaidman, Andy
van Deursen, Arie
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (04) : 2069 - 2087
[35] Feature selection using class-level regularized self-representation
Zhenghua Lu
Qihuan Chu
Applied Intelligence, 2023, 53 : 13130 - 13144
[36] Sexual harassment and psychological complaints: student- and class-level associations
Laftman, S. Brolin
Bjereld, Y.
Modin, B.
Lofstedt, P.
EUROPEAN JOURNAL OF PUBLIC HEALTH, 2020, 30
[37] An Early Ordovician (Floian) asterozoan (Echinodermata) of problematic class-level affinities
Blake, Daniel B.
Gahn, Forest J.
Guensburg, Thomas E.
JOURNAL OF PALEONTOLOGY, 2020, 94 (02) : 358 - 365
[38] A practical approach for technical debt prioritization based on class-level forecasting
Tsoukalas, Dimitrios
Siavvas, Miltiadis
Kehagias, Dionysios
Ampatzoglou, Apostolos
Chatzigeorgiou, Alexander
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (04)
[39] Class-level risk factors for bullying and victimization in Portuguese middle schools
Coelho, Vitor Alexandre
Sousa, Vanda
SCHOOL PSYCHOLOGY INTERNATIONAL, 2018, 39 (02) : 121 - 137
[40] An extensive study of class-level and method-level test case selection for continuous integration
Li, Yingling
Wang, Junjie
Yang, Yun
Wang, Qing
JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 167

← 1 2 3 4 5 →