Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels

被引:25
|
作者
Wang, Yikai [1 ]
Sun, Xinwei [1 ]
Fu, Yanwei [1 ]
机构
[1] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
关键词
CONSISTENCY;
D O I
10.1109/CVPR52688.2022.00044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Noisy training set usually leads to the degradation of generalization and robustness of neural networks. In this paper, we propose using a theoretically guaranteed noisy label detection framework to detect and remove noisy data for Learning with Noisy Labels (LNL). Specifically, we design a penalized regression to model the linear relation between network features and one-hot labels, where the noisy data are identified by the non-zero mean shift parameters solved in the regression model. To make the framework scalable to datasets that contain a large number of categories and training data, we propose a split algorithm to divide the whole training set into small pieces that can be solved by the penalized regression in parallel, leading to the Scalable Penalized Regression (SPR) framework. We provide the non-asymptotic probabilistic condition for SPR to correctly identify the noisy data. While SPR can be regarded as a sample selection module for standard supervised training pipeline, we further combine it with semi-supervised algorithm to further exploit the support of noisy data as unlabeled data. Experimental results on several benchmark datasets and real-world noisy datasets show the effectiveness of our framework.
引用
收藏
页码:346 / 355
页数:10
相关论文
共 50 条
  • [1] Learning with noisy labels for robust fatigue detection
    Wang, Mei
    Hu, Ruimin
    Zhu, Xiaojie
    Zhu, Dongliang
    Wang, Xiaochen
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [2] FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
    Li, Jichang
    Li, Guanbin
    Cheng, Hui
    Liao, Zicheng
    Yu, Yizhou
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3118 - 3126
  • [3] Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels
    Jiang, Lu
    Huang, Di
    Liu, Mason
    Yang, Weilong
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [4] Gradient optimization for object detection in learning with noisy labels
    Qiangqiang Xia
    Chunyan Hu
    Feifei Lee
    Qiu Chen
    Applied Intelligence, 2024, 54 : 4248 - 4259
  • [5] Deep Learning With Noisy Labels for Spatiotemporal Drought Detection
    Cortes-Andres, Jordi
    Fernandez-Torres, Miguel-Angel
    Camps-Valls, Gustau
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [6] Gradient optimization for object detection in learning with noisy labels
    Xia, Qiangqiang
    Hu, Chunyan
    Lee, Feifei
    Chen, Qiu
    APPLIED INTELLIGENCE, 2024, 54 (05) : 4248 - 4259
  • [7] Probabilistic End-to-end Noise Correction for Learning with Noisy Labels
    Yi, Kun
    Wu, Jianxin
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7010 - 7018
  • [8] Hierarchical Noise-Tolerant Meta-Learning With Noisy Labels
    Liu, Yahui
    Wang, Jian
    Yang, Yuntai
    Wang, Renlong
    Wang, Simiao
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 3020 - 3024
  • [9] PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction
    Sun, Zeren
    Shen, Fumin
    Huang, Dan
    Wang, Qiong
    Shu, Xiangbo
    Yao, Yazhou
    Tang, Jinhui
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5301 - 5310
  • [10] An improved noise loss correction algorithm for learning from noisy labels
    Zhang, Qian
    Lee, Feifei
    Wang, Ya-gang
    Miao, Ran
    Chen, Lei
    Chen, Qiu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 72