An instance-dependent simulation framework for learning with label noise

被引:0
|
作者
Keren Gu
Xander Masotto
Vandana Bachani
Balaji Lakshminarayanan
Jack Nikodem
Dong Yin
机构
[1] DeepMind,
[2] Google Research,undefined
[3] Brain Team,undefined
来源
Machine Learning | 2023年 / 112卷
关键词
Noisy labels; Simulation; Datasets; Rater features;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a simulation framework for generating instance-dependent noisy labels via a pseudo-labeling paradigm. We show that the distribution of the synthetic noisy labels generated with our framework is closer to human labels compared to independent and class-conditional random flipping. Equipped with controllable label noise, we study the negative impact of noisy labels across a few practical settings to understand when label noise is more problematic. We also benchmark several existing algorithms for learning with noisy labels and compare their behavior on our synthetic datasets and on the datasets with independent random label noise. Additionally, with the availability of annotator information from our simulation framework, we propose a new technique, Label Quality Model (LQM), that leverages annotator features to predict and correct against noisy labels. We show that by adding LQM as a label correction step before applying existing noisy label techniques, we can further improve the models’ performance. The synthetic datasets that we generated in this work are released at https://github.com/deepmind/deepmind-research/tree/master/noisy_label.
引用
收藏
页码:1871 / 1896
页数:25
相关论文
共 50 条
  • [21] Label Correction of Crowdsourced Noisy Annotations with an Instance-Dependent Noise Transition Model
    Guo, Hui
    Wang, Boyu
    Yi, Grace Y.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [22] Instance-Dependent Noisy-Label Learning with Graphical Model Based Noise-Rate Estimation
    Garg, Arpit
    Cuong Nguyen
    Felix, Rafael
    Thanh-Toan Do
    Carneiro, Gustavo
    COMPUTER VISION-ECCV 2024, PT IV, 2025, 15062 : 372 - 389
  • [23] Towards instance-dependent label noise-tolerant classification: a probabilistic approach
    Jakramate Bootkrajang
    Jeerayut Chaijaruwanich
    Pattern Analysis and Applications, 2020, 23 : 95 - 111
  • [24] Robust Product Classification with Instance-Dependent Noise
    Huy Nguyen
    Khatwani, Devashish
    PROCEEDINGS OF THE 5TH WORKSHOP ON E-COMMERCE AND NLP (ECNLP 5), 2022, : 171 - 180
  • [25] Beyond Class-Conditional Assumption: A Primary Attempt to Combat Instance-Dependent Label Noise
    Chen, Pengfei
    Ye, Junjie
    Chen, Guangyong
    Zhao, Jingwei
    Heng, Pheng-Ann
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11442 - 11450
  • [26] Learning with Bounded Instance- and Label-dependent Label Noise
    Cheng, Jiacheng
    Liu, Tongliang
    Ramamohanarao, Kotagiri
    Tao, Dacheng
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [27] Learning with Bounded Instance- and Label-dependent Label Noise
    Cheng, Jiacheng
    Liu, Tongliang
    Ramamohanarao, Kotagiri
    Tao, Dacheng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [28] Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis
    Khamaru, Koulik
    Pananjady, Ashwin
    Ruan, Feng
    Wainwright, Martin J.
    Jordan, Michael, I
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (04): : 1013 - 1040
  • [29] Optimistic PAC Reinforcement Learning: the Instance-Dependent View
    Tirinzoni, Andrea
    Al-Marjani, Aymen
    Kaufmann, Emilie
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 1460 - 1480
  • [30] Instance-dependent cost-sensitive parametric learning
    C-Rella, Jorge
    Claeskens, Gerda
    Cao, Ricardo
    Vilar, Juan M.
    NEUROCOMPUTING, 2025, 615