An improved sample selection framework for learning with noisy labels

被引:0
|
作者
Zhang, Qian [1 ]
Zhu, Yi [1 ]
Yang, Ming [2 ]
Jin, Ge [1 ]
Zhu, Yingwen [1 ]
Lu, Yanjun [1 ]
Zou, Yu [1 ,3 ]
Chen, Qiu [4 ]
机构
[1] Jiangsu Open Univ, Sch Informat Technol, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing, Jiangsu, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Artificial Intelligence, Sch Future Technol, Nanjing, Jiangsu, Peoples R China
[4] Kogakuin Univ, Grad Sch Engn, Dept Elect Engn & Elect, Tokyo, Japan
来源
PLOS ONE | 2024年 / 19卷 / 12期
基金
中国国家自然科学基金;
关键词
D O I
10.1371/journal.pone.0309841
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep neural networks have powerful memory capabilities, yet they frequently suffer from overfitting to noisy labels, leading to a decline in classification and generalization performance. To address this issue, sample selection methods that filter out potentially clean labels have been proposed. However, there is a significant gap in size between the filtered, possibly clean subset and the unlabeled subset, which becomes particularly pronounced at high-noise rates. Consequently, this results in underutilizing label-free samples in sample selection methods, leaving room for performance improvement. This study introduces an enhanced sample selection framework with an oversampling strategy (SOS) to overcome this limitation. This framework leverages the valuable information contained in label-free instances to enhance model performance by combining an SOS with state-of-the-art sample selection methods. We validate the effectiveness of SOS through extensive experiments conducted on both synthetic noisy datasets and real-world datasets such as CIFAR, WebVision, and Clothing1M. The source code for SOS will be made available at https://github.com/LanXiaoPang613/SOS.
引用
收藏
页数:37
相关论文
共 50 条
  • [11] ANNE: Adaptive Nearest Neighbours and Eigenvector-based sample selection for robust learning with noisy labels
    Cordeiro, Filipe R.
    Carneiro, Gustavo
    PATTERN RECOGNITION, 2025, 159
  • [12] NCMatch: Semi-supervised Learning with Noisy Labels via Noisy Sample Filter and Contrastive Learning
    Sun, Yuanbo
    Gao, Can
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 15 - 27
  • [13] An joint end-to-end framework for learning with noisy labels
    Zhang, Qian
    Lee, Feifei
    Wang, Ya-gang
    Ding, Damin
    Yao, Wei
    Chen, Lu
    Chen, Qiu
    APPLIED SOFT COMPUTING, 2021, 108
  • [14] Learning with noisy labels via Mamba and entropy KNN framework
    Wang, Ningwei
    Jin, Weiqiang
    Jing, Shirou
    Bi, Haixia
    Yang, Guang
    APPLIED SOFT COMPUTING, 2025, 169
  • [15] RT2S: A Framework for Learning with Noisy Labels
    Bhattacharya, Indranil
    Ye, Ze
    Pavani, Kaushik
    Dasgupta, Sunny
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5234 - 5235
  • [16] Sample Prior Guided Robust Model Learning to Suppress Noisy Labels
    Chen, Wenkai
    Zhu, Chuang
    Li, Mengting
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 3 - 19
  • [17] Sample-wise Label Confidence Incorporation for Learning with Noisy Labels
    Ahn, Chanho
    Kim, Kikyung
    Baek, Ji-won
    Lim, Jongin
    Han, Seungju
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1823 - 1832
  • [18] "Learning with Noisy Labels Using Collaborative Sample Selection and Contrastive Semi-Supervised Learning" (vol 296 ,111860, 2024)
    Miao, Qing
    Wu, Xiaohe
    Xu, Chao
    Ji, Yanli
    Zuo, Wangmeng
    Guo, Yiwen
    Meng, Zhaopeng
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [19] Combating Noisy Labels with Sample Selection by Mining High-Discrepancy Examples
    Xia, Xiaobo
    Han, Bo
    Zhan, Yibing
    Yu, Jun
    Gong, Mingming
    Gong, Chen
    Liu, Tongliang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1833 - 1843
  • [20] A deep learning framework to classify breast density with noisy labels regularization
    Lopez-Almazan, Hector
    Perez-Benito, Francisco Javier
    Larroza, Andres
    Perez-Cortes, Juan-Carlos
    Pollan, Marina
    Perez-Gomez, Beatriz
    Trejo, Dolores Salas
    Casals, Maria
    Llobet, Rafael
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 221