Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels

被引:11
|
作者
Zhao, Ganlong [1 ,2 ]
Li, Guanbin [1 ]
Qin, Yipeng [3 ]
Liu, Feng [4 ]
Yu, Yizhou [2 ]
机构
[1] Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
[2] Univ Hong Kong, Hong Kong, Peoples R China
[3] Cardiff Univ, Cardiff, Wales
[4] Deepwise AI Lab, Beijing, Peoples R China
来源
COMPUTER VISION, ECCV 2022, PT XXV | 2022年 / 13685卷
基金
中国国家自然科学基金;
关键词
Instance-dependent noise; Noisy label; Image classification;
D O I
10.1007/978-3-031-19806-9_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep models trained with noisy labels are prone to overfitting and struggle in generalization. Most existing solutions are based on an ideal assumption that the label noise is class-conditional, i.e. instances of the same class share the same noise model, and are independent of features. While in practice, the real-world noise patterns are usually more fine-grained as instance-dependent ones, which poses a big challenge, especially in the presence of inter-class imbalance. In this paper, we propose a two-stage clean samples identification method to address the aforementioned challenge. First, we employ a class-level feature clustering procedure for the early identification of clean samples that are near the class-wise prediction centers. Notably, we address the class imbalance problem by aggregating rare classes according to their prediction entropy. Second, for the remaining clean samples that are close to the ground truth class boundary (usually mixed with the samples with instance-dependent noises), we propose a novel consistency-based classification method that identifies them using the consistency of two classifier heads: the higher the consistency, the larger the probability that a sample is clean. Extensive experiments on several challenging benchmarks demonstrate the superior performance of our method against the state-of-the-art. Code is available at https://github.com/uitrbn/TSCSI_IDN.
引用
收藏
页码:21 / 37
页数:17
相关论文
共 50 条
  • [11] Instance-Dependent Noisy-Label Learning with Graphical Model Based Noise-Rate Estimation
    Garg, Arpit
    Cuong Nguyen
    Felix, Rafael
    Thanh-Toan Do
    Carneiro, Gustavo
    COMPUTER VISION-ECCV 2024, PT IV, 2025, 15062 : 372 - 389
  • [12] CT-based COPD identification using multiple instance learning with two-stage attention
    Xue, Mengfan
    Jia, Shishen
    Chen, Ling
    Huang, Hailiang
    Yu, Lijuan
    Zhu, Wentao
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 230
  • [13] Weakly Supervised Instance Segmentation Based on Two-Stage Transfer Learning
    Sun, Yongqing
    Liao, Shisha
    Gao, Chenqiang
    Xie, Chengjuan
    Yang, Feng
    Zhao, Yue
    Sagata, Atsushi
    IEEE ACCESS, 2020, 8 : 24135 - 24144
  • [14] Typicality- and instance-dependent label noise-combating: a novel framework for simulating and combating real-world noisy labels for endoscopic polyp classification
    Gao, Yun
    Fu, Junhu
    Wang, Yuanyuan
    Guo, Yi
    VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2024, 7 (01)
  • [15] Two-Stage Deep Learning for Noisy-Reverberant Speech Enhancement
    Zhao, Yan
    Wang, Zhong-Qiu
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 53 - 62
  • [16] A Two-Stage Multiple Instance Learning Framework for the Detection of Breast Cancer in Mammograms
    Chandra, Sarath K.
    Chakravarty, Arunava
    Ghosh, Nirmalya
    Sarkar, Tandra
    Sethuraman, Ramanathan
    Sheet, Debdoot
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 1128 - 1131
  • [17] TCC-net: A two-stage training method with contradictory loss and co-teaching based on meta-learning for learning with noisy labels
    Xia, Qiangqiang
    Lee, Feifei
    Chen, Qiu
    INFORMATION SCIENCES, 2023, 639
  • [18] Enhancing Retrieval-Augmented LMs with a Two-Stage Consistency Learning Compressor
    Xu, Chuankai
    Zhao, Dongming
    Wang, Bo
    Xing, Hanwen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 511 - 522
  • [19] A two-stage deep learning strategy for weed identification in grassfields
    Calderara-Cea, Felipe
    Torres-Torriti, Miguel
    Cheein, Fernando Auat
    Delpiano, Jose
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 225
  • [20] Multiple instance learning-based two-stage metric learning network for whole slide image classification
    Li, Xiaoyu
    Yang, Bei
    Chen, Tiandong
    Gao, Zheng
    Li, Huijie
    VISUAL COMPUTER, 2024, 40 (08): : 5717 - 5732