Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels

被引:25
|
作者
Wang, Yikai [1 ]
Sun, Xinwei [1 ]
Fu, Yanwei [1 ]
机构
[1] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
关键词
CONSISTENCY;
D O I
10.1109/CVPR52688.2022.00044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Noisy training set usually leads to the degradation of generalization and robustness of neural networks. In this paper, we propose using a theoretically guaranteed noisy label detection framework to detect and remove noisy data for Learning with Noisy Labels (LNL). Specifically, we design a penalized regression to model the linear relation between network features and one-hot labels, where the noisy data are identified by the non-zero mean shift parameters solved in the regression model. To make the framework scalable to datasets that contain a large number of categories and training data, we propose a split algorithm to divide the whole training set into small pieces that can be solved by the penalized regression in parallel, leading to the Scalable Penalized Regression (SPR) framework. We provide the non-asymptotic probabilistic condition for SPR to correctly identify the noisy data. While SPR can be regarded as a sample selection module for standard supervised training pipeline, we further combine it with semi-supervised algorithm to further exploit the support of noisy data as unlabeled data. Experimental results on several benchmark datasets and real-world noisy datasets show the effectiveness of our framework.
引用
收藏
页码:346 / 355
页数:10
相关论文
共 50 条
  • [21] Learning with Neighbor Consistency for Noisy Labels
    Iscen, Ahmet
    Valmadre, Jack
    Arnab, Anurag
    Schmid, Cordelia
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4662 - 4671
  • [22] To Aggregate or Not? Learning with Separate Noisy Labels
    Wei, Jiaheng
    Zhu, Zhaowei
    Luo, Tianyi
    Amid, Ehsan
    Kumar, Abhishek
    Liu, Yang
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2523 - 2535
  • [23] DEEP LEARNING CLASSIFICATION WITH NOISY LABELS
    Sanchez, Guillaume
    Guis, Vincente
    Marxer, Ricard
    Bouchara, Frederic
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [24] Twin Contrastive Learning with Noisy Labels
    Huang, Zhizhong
    Zhang, Junping
    Shan, Hongming
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11661 - 11670
  • [25] Iterative Cross Learning on Noisy Labels
    Yuan, Bodi
    Chen, Jianyu
    Zhang, Weidong
    Tai, Hung-Shuo
    McMains, Sara
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 757 - 765
  • [26] Robust Federated Learning With Noisy Labels
    Yang, Seunghan
    Park, Hyoungseob
    Byun, Junyoung
    Kim, Changick
    IEEE INTELLIGENT SYSTEMS, 2022, 37 (02) : 35 - 43
  • [27] Robust Collaborative Learning with Noisy Labels
    Sun, Mengying
    Xing, Jing
    Chen, Bin
    Zhou, Jiayu
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1274 - 1279
  • [28] NLNL: Negative Learning for Noisy Labels
    Kim, Youngdong
    Yim, Junho
    Yun, Juseung
    Kim, Junmo
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 101 - 110
  • [29] Compressing Features for Learning With Noisy Labels
    Chen, Yingyi
    Hu, Shell Xu
    Shen, Xi
    Ai, Chunrong
    Suykens, Johan A. K.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2124 - 2138
  • [30] Learning from Noisy Labels with Distillation
    Li, Yuncheng
    Yang, Jianchao
    Song, Yale
    Cao, Liangliang
    Luo, Jiebo
    Li, Li-Jia
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1928 - 1936