An instance-dependent simulation framework for learning with label noise

被引:0
|
作者
Keren Gu
Xander Masotto
Vandana Bachani
Balaji Lakshminarayanan
Jack Nikodem
Dong Yin
机构
[1] DeepMind,
[2] Google Research,undefined
[3] Brain Team,undefined
来源
Machine Learning | 2023年 / 112卷
关键词
Noisy labels; Simulation; Datasets; Rater features;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a simulation framework for generating instance-dependent noisy labels via a pseudo-labeling paradigm. We show that the distribution of the synthetic noisy labels generated with our framework is closer to human labels compared to independent and class-conditional random flipping. Equipped with controllable label noise, we study the negative impact of noisy labels across a few practical settings to understand when label noise is more problematic. We also benchmark several existing algorithms for learning with noisy labels and compare their behavior on our synthetic datasets and on the datasets with independent random label noise. Additionally, with the availability of annotator information from our simulation framework, we propose a new technique, Label Quality Model (LQM), that leverages annotator features to predict and correct against noisy labels. We show that by adding LQM as a label correction step before applying existing noisy label techniques, we can further improve the models’ performance. The synthetic datasets that we generated in this work are released at https://github.com/deepmind/deepmind-research/tree/master/noisy_label.
引用
收藏
页码:1871 / 1896
页数:25
相关论文
共 50 条
  • [31] Optimistic PAC Reinforcement Learning: the Instance-Dependent View
    Tirinzoni, Andrea
    Al-Marjani, Aymen
    Kaufmann, Emilie
    Proceedings of Machine Learning Research, 2023, 201 : 1460 - 1480
  • [32] Beyond No Regret: Instance-Dependent PAC Reinforcement Learning
    Wagenmaker, Andrew
    Simchowitz, Max
    Jamieson, Kevin
    CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178 : 358 - 418
  • [33] Instance-Dependent Positive and Unlabeled Learning With Labeling Bias Estimation
    Gong, Chen
    Wang, Qizhou
    Liu, Tongliang
    Han, Bo
    You, Jane
    Yang, Jian
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4163 - 4177
  • [34] InstanT: Semi-supervised Learning with Instance-dependent Thresholds
    Li, Muyang
    Wu, Runze
    Liu, Haoyu
    Yu, Jun
    Yang, Xun
    Han, Bo
    Liu, Tongliang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [35] A framework for non-interactive instance-dependent commitment schemes (NIC)
    Kapron, Bruce
    Malka, Lior
    Srinivasan, Venkatesh
    Theoretical Computer Science, 2015, 593 : 1 - 15
  • [36] A framework for non-interactive instance-dependent commitment schemes (NIC)
    Kapron, Bruce
    Malka, Lior
    Srinivasan, Venkatesh
    THEORETICAL COMPUTER SCIENCE, 2015, 593 : 1 - 15
  • [37] A framework for non-interactive instance-dependent commitment schemes (NIC)
    Kapron, Bruce
    Malka, Lior
    Srinivasan, Venkatesh
    Theoretical Computer Science, 2015, 593 : 1 - 15
  • [38] Candidate-aware Selective Disambiguation Based On Normalized Entropy for Instance-dependent Partial-label Learning
    He, Shuo
    Yang, Guowu
    Feng, Lei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1792 - 1801
  • [39] Typicality- and instance-dependent label noise-combating: a novel framework for simulating and combating real-world noisy labels for endoscopic polyp classification
    Gao, Yun
    Fu, Junhu
    Wang, Yuanyuan
    Guo, Yi
    VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2024, 7 (01)
  • [40] Instance-Dependent Multilabel Noise Generation for Multilabel Remote Sensing Image Classification
    Kim, Youngwook
    Kim, Sehwan
    Ro, Youngmin
    Lee, Jungwoo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 17087 - 17098