Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels

被引：25

作者：

Wang, Yikai ^{[1
]}

Sun, Xinwei ^{[1
]}

Fu, Yanwei ^{[1
]}

机构：

[1] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

CONSISTENCY;

D O I：

10.1109/CVPR52688.2022.00044

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Noisy training set usually leads to the degradation of generalization and robustness of neural networks. In this paper, we propose using a theoretically guaranteed noisy label detection framework to detect and remove noisy data for Learning with Noisy Labels (LNL). Specifically, we design a penalized regression to model the linear relation between network features and one-hot labels, where the noisy data are identified by the non-zero mean shift parameters solved in the regression model. To make the framework scalable to datasets that contain a large number of categories and training data, we propose a split algorithm to divide the whole training set into small pieces that can be solved by the penalized regression in parallel, leading to the Scalable Penalized Regression (SPR) framework. We provide the non-asymptotic probabilistic condition for SPR to correctly identify the noisy data. While SPR can be regarded as a sample selection module for standard supervised training pipeline, we further combine it with semi-supervised algorithm to further exploit the support of noisy data as unlabeled data. Experimental results on several benchmark datasets and real-world noisy datasets show the effectiveness of our framework.

引用

页码：346 / 355

页数：10

共 50 条

[21] Learning with Neighbor Consistency for Noisy Labels
Iscen, Ahmet
Valmadre, Jack
Arnab, Anurag
Schmid, Cordelia
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4662 - 4671
[22] To Aggregate or Not? Learning with Separate Noisy Labels
Wei, Jiaheng
Zhu, Zhaowei
Luo, Tianyi
Amid, Ehsan
Kumar, Abhishek
Liu, Yang
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2523 - 2535
[23] DEEP LEARNING CLASSIFICATION WITH NOISY LABELS
Sanchez, Guillaume
Guis, Vincente
Marxer, Ricard
Bouchara, Frederic
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
[24] Twin Contrastive Learning with Noisy Labels
Huang, Zhizhong
Zhang, Junping
Shan, Hongming
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11661 - 11670
[25] Iterative Cross Learning on Noisy Labels
Yuan, Bodi
Chen, Jianyu
Zhang, Weidong
Tai, Hung-Shuo
McMains, Sara
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 757 - 765
[26] Robust Federated Learning With Noisy Labels
Yang, Seunghan
Park, Hyoungseob
Byun, Junyoung
Kim, Changick
IEEE INTELLIGENT SYSTEMS, 2022, 37 (02) : 35 - 43
[27] Robust Collaborative Learning with Noisy Labels
Sun, Mengying
Xing, Jing
Chen, Bin
Zhou, Jiayu
20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1274 - 1279
[28] NLNL: Negative Learning for Noisy Labels
Kim, Youngdong
Yim, Junho
Yun, Juseung
Kim, Junmo
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 101 - 110
[29] Compressing Features for Learning With Noisy Labels
Chen, Yingyi
Hu, Shell Xu
Shen, Xi
Ai, Chunrong
Suykens, Johan A. K.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2124 - 2138
[30] Learning from Noisy Labels with Distillation
Li, Yuncheng
Yang, Jianchao
Song, Yale
Cao, Liangliang
Luo, Jiebo
Li, Li-Jia
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1928 - 1936

← 1 2 3 4 5 →