Class-Wise Denoising for Robust Learning Under Label Noise

被引:11
|
作者
Gong, Chen [1 ]
Ding, Yongliang [1 ]
Han, Bo [2 ]
Niu, Gang [3 ]
Yang, Jian [4 ]
You, Jane [5 ]
Tao, Dacheng [6 ,7 ]
Sugiyama, Masashi [3 ,8 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Key Lab Intelligent Percept & Syst High Dimens Inf, Minist Educ,Jiangsu Key Lab Image & Video Understa, Nanjing 210094, Jiangsu, Peoples R China
[2] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[3] RIKEN Ctr Adv Intelligence Project, Tokyo 1030027, Japan
[4] Nankai Univ, Coll Comp Sci, Tianjin 300071, Peoples R China
[5] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[6] JD Explore Acad, Beijing 101100, Peoples R China
[7] Univ Sydney, Sydney, NSW 2006, Australia
[8] Univ Tokyo, Grad Sch Frontier Sci, Chiba 1138654, Japan
关键词
Noise measurement; Training; Entropy; Estimation; Neural networks; Matrix decomposition; Fasteners; Label noise; centroid estimation; unbiasedness; variance reduction;
D O I
10.1109/TPAMI.2022.3178690
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Label noise is ubiquitous in many real-world scenarios which often misleads training algorithm and brings about the degraded classification performance. Therefore, many approaches have been proposed to correct the loss function given corrupted labels to combat such label noise. Among them, a trend of works achieve this goal by unbiasedly estimating the data centroid, which plays an important role in constructing an unbiased risk estimator for minimization. However, they usually handle the noisy labels in different classes all at once, so the local information inherited by each class is ignored which often leads to unsatisfactory performance. To address this defect, this paper presents a novel robust learning algorithm dubbed "Class-Wise Denoising" (CWD), which tackles the noisy labels in a class-wise way to ease the entire noise correction task. Specifically, two virtual auxiliary sets are respectively constructed by presuming that the positive and negative labels in the training set are clean, so the original false-negative labels and false-positive ones are tackled separately. As a result, an improved centroid estimator can be designed which helps to yield more accurate risk estimator. Theoretically, we prove that: 1) the variance in centroid estimation can often be reduced by our CWD when compared with existing methods with unbiased centroid estimator; and 2) the performance of CWD trained on the noisy set will converge to that of the optimal classifier trained on the clean set with a convergence rate O(1/vn) )where n is the number of the training examples. These sound theoretical properties critically enable our CWD to produce the improved classification performance under label noise, which is also demonstrated by the comparisons with ten representative state-of-the-art methods on a variety of benchmark datasets.
引用
收藏
页码:2835 / 2848
页数:14
相关论文
共 50 条
  • [31] Class-Wise Adaptive Self Distillation for Federated Learning on Non-IID Data
    He, Yuting
    Chen, Yiqiang
    Yang, Xiaodong
    Zhang, Yingwei
    Zeng, Bixiao
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12967 - 12968
  • [32] Analysis and Applications of Class-wise Robustness in Adversarial Training
    Tian, Qi
    Kuang, Kun
    Jiang, Kelu
    Wu, Fei
    Wang, Yisen
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1561 - 1570
  • [33] Improved Open World Object Detection Using Class-Wise Feature Space Learning
    Iqbal, Muhammad Ali
    Yoon, Yeo Chan
    Khan, Muhammad U. S.
    Kim, Soo Kyun
    IEEE ACCESS, 2023, 11 : 131221 - 131236
  • [34] Class-wise feature extraction technique for multimodal data
    Silva, Elias R., Jr.
    Cavalcanti, George D. C.
    Ren, Tsang Ing
    NEUROCOMPUTING, 2016, 214 : 1001 - 1010
  • [35] Extended Class-wise Sparse Representation for Face Recognition
    Wu, Minghua
    Li, Shiren
    Hu, Jianguo
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1611 - 1615
  • [36] Class-wise Graph Embedding-Based Active Learning for Hyperspectral Image Classification
    Liao, Xiaolong
    Tu, Bing
    Li, Jun
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [37] Prototypical class-wise test-time adaptation
    Lee, Hojoon
    Lee, Seunghwan
    Jung, Inyoung
    Korea, Sungeun Hong
    PATTERN RECOGNITION LETTERS, 2025, 187 : 49 - 55
  • [38] CFA: Class-wise Calibrated Fair Adversarial Training
    Wei, Zeming
    Wang, Yifei
    Guo, Yiwen
    Wang, Yisen
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8193 - 8201
  • [39] A Game Theoretic Approach to Class-wise Selective Rationalization
    Chang, Shiyu
    Zhang, Yang
    Yu, Mo
    Jaakkola, Tommi S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [40] SAR Image Quality Assessment: From Sample-Wise to Class-Wise
    Yu, Ziyi
    Dong, Ganggang
    Liu, Hongwei
    REMOTE SENSING, 2023, 15 (08)