An improved sample selection framework for learning with noisy labels

被引:0
|
作者
Zhang, Qian [1 ]
Zhu, Yi [1 ]
Yang, Ming [2 ]
Jin, Ge [1 ]
Zhu, Yingwen [1 ]
Lu, Yanjun [1 ]
Zou, Yu [1 ,3 ]
Chen, Qiu [4 ]
机构
[1] Jiangsu Open Univ, Sch Informat Technol, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing, Jiangsu, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Artificial Intelligence, Sch Future Technol, Nanjing, Jiangsu, Peoples R China
[4] Kogakuin Univ, Grad Sch Engn, Dept Elect Engn & Elect, Tokyo, Japan
来源
PLOS ONE | 2024年 / 19卷 / 12期
基金
中国国家自然科学基金;
关键词
D O I
10.1371/journal.pone.0309841
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep neural networks have powerful memory capabilities, yet they frequently suffer from overfitting to noisy labels, leading to a decline in classification and generalization performance. To address this issue, sample selection methods that filter out potentially clean labels have been proposed. However, there is a significant gap in size between the filtered, possibly clean subset and the unlabeled subset, which becomes particularly pronounced at high-noise rates. Consequently, this results in underutilizing label-free samples in sample selection methods, leaving room for performance improvement. This study introduces an enhanced sample selection framework with an oversampling strategy (SOS) to overcome this limitation. This framework leverages the valuable information contained in label-free instances to enhance model performance by combining an SOS with state-of-the-art sample selection methods. We validate the effectiveness of SOS through extensive experiments conducted on both synthetic noisy datasets and real-world datasets such as CIFAR, WebVision, and Clothing1M. The source code for SOS will be made available at https://github.com/LanXiaoPang613/SOS.
引用
收藏
页数:37
相关论文
共 50 条
  • [41] FINE Samples for Learning with Noisy Labels
    Kim, Taehyeon
    Ko, Jongwoo
    Cho, Sangwook
    Choi, Jinhwan
    Yun, Se-Young
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [42] Learning from Noisy Labels with Distillation
    Li, Yuncheng
    Yang, Jianchao
    Song, Yale
    Cao, Liangliang
    Luo, Jiebo
    Li, Li-Jia
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1928 - 1936
  • [43] Progressive Stochastic Learning for Noisy Labels
    Han, Bo
    Tsang, Ivor W.
    Chen, Ling
    Yu, Celina P.
    Fung, Sai-Fu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 5136 - 5148
  • [44] Label Distribution for Learning with Noisy Labels
    Liu, Yun-Peng
    Xu, Ning
    Zhang, Yu
    Geng, Xin
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2568 - 2574
  • [45] A Serial Sample Selection Framework for Active Learning
    Li, Chengchao
    Zhao, Pengpeng
    Wu, Jian
    Xu, Haihui
    Cui, Zhiming
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014, 2014, 8933 : 435 - 446
  • [46] A serial sample selection framework for active learning
    Li, Chengchao
    Zhao, Pengpeng
    Wu, Jian
    Xu, Haihui
    Cui, Zhiming
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8933 : 435 - 446
  • [47] Intelligent agent for hyperspectral image classification with noisy labels: a deep reinforcement learning framework
    Fang, Chunhua
    Zhang, Guifeng
    Li, Jia
    Li, Xinping
    Chen, Tengfei
    Zhao, Lin
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (09) : 2939 - 2964
  • [48] A two-stage denoising framework for zero-shot learning with noisy labels
    Tang, Long
    Zhao, Pan
    Pan, Zhigeng
    Duan, Xingxing
    Pardalos, Panos M.
    INFORMATION SCIENCES, 2024, 654
  • [49] Learning from Multiple Annotator Noisy Labels via Sample-Wise Label Fusion
    Gao, Zhengqi
    Sun, Fan-Keng
    Yang, Mingran
    Ren, Sucheng
    Xiong, Zikai
    Engeler, Marc
    Burazer, Antonio
    Wildling, Linda
    Daniel, Luca
    Boning, Duane S.
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 407 - 422
  • [50] Limited Gradient Descent: Learning With Noisy Labels
    Sun, Yi
    Tian, Yan
    Xu, Yiping
    Li, Jianxiang
    IEEE ACCESS, 2019, 7 : 168296 - 168306