Understanding the Detrimental Class-level Effects of Data Augmentation

被引:0
|
作者
Kirichenko, Polina [1 ,2 ]
Ibrahim, Mark [2 ]
Balestriero, Randall [2 ]
Bouchacourt, Diane [2 ]
Vedantam, Ramakrishna [2 ]
Firooz, Hamed [2 ]
Wilson, Andrew Gordon [1 ]
机构
[1] New York Univ, New York, NY 10012 USA
[2] Meta AI, Boston, MA 02199 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation (DA) encodes invariance and provides implicit regularization critical to a model's performance in image classification tasks. However, while DA improves average accuracy, recent studies have shown that its impact can be highly class dependent: achieving optimal average accuracy comes at the cost of significantly hurting individual class accuracy by as much as 20% on ImageNet. There has been little progress in resolving class-level accuracy drops due to a limited understanding of these effects. In this work, we present a framework for understanding how DA interacts with class-level learning dynamics. Using higher-quality multi-label annotations on ImageNet, we systematically categorize the affected classes and find that the majority are inherently ambiguous, co-occur, or involve fine-grained distinctions, while DA controls the model's bias towards one of the closely related classes. While many of the previously reported performance drops are explained by multi-label annotations, our analysis of class confusions reveals other sources of accuracy degradation. We show that simple class-conditional augmentation strategies informed by our framework improve performance on the negatively affected classes.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] Class-Level Multiple Distributions Representation are Necessary for Semantic Segmentation
    Yin, Jianjian
    Peng, Ningkang
    Chen, Yi
    Zheng, Zhichao
    Gu, Yanhui
    Zhou, Junsheng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT VII, DASFAA 2024, 2024, 14856 : 340 - 351
  • [22] ClassSum: a deep learning model for class-level code summarization
    Mingchen Li
    Huiqun Yu
    Guisheng Fan
    Ziyi Zhou
    Jiawen Huang
    Neural Computing and Applications, 2023, 35 : 3373 - 3393
  • [23] Behavior of class-level landscape metrics across gradients of class aggregation and area
    Neel, MC
    McGarigal, K
    Cushman, SA
    LANDSCAPE ECOLOGY, 2004, 19 (04) : 435 - 455
  • [24] Instance-level and Class-level Contrastive Incremental Learning for Image Classification
    Han, Jia-yi
    Liu, Jian-wei
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [25] Improving Emotion Recognition using Class-Level Spectral Features
    Bitouk, Dmitri
    Nenkova, Ani
    Verma, Ragini
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1991 - +
  • [26] Visualization of aggregated information to support class-level software evolution?
    Rahimi, Mona
    Vierhauser, Michael
    JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 192
  • [27] Class-level Structural Relation Modelling and Smoothing for Visual Representation Learning
    Chen, Zitan
    Qi, Zhuang
    Cao, Xiao
    Li, Xiangxian
    Meng, Xiangxu
    Meng, Lei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2964 - 2972
  • [28] Admission control schemes to provide class-level QoS in multiservice networks
    Kalyanasundaram, S
    Chong, EKP
    Shroff, NB
    COMPUTER NETWORKS, 2001, 35 (2-3) : 307 - 326
  • [29] Assessing software product maintainability based on class-level structural measures
    Benestad, Hans Christian
    Anda, Bente
    Arisholm, Erik
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROCEEDINGS, 2006, 4034 : 94 - 111
  • [30] Class-Level Adaptation Network with Self Training for Unsupervised Domain Adaptation
    Jin, Yuncheng
    Chen, Zhihong
    Cheng, Zhaowei
    Chen, Chao
    Jin, Xinyu
    Sun, Bin
    BDCAT'19: PROCEEDINGS OF THE 6TH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2019, : 137 - 143