An instance-dependent simulation framework for learning with label noise

被引:0
|
作者
Keren Gu
Xander Masotto
Vandana Bachani
Balaji Lakshminarayanan
Jack Nikodem
Dong Yin
机构
[1] DeepMind,
[2] Google Research,undefined
[3] Brain Team,undefined
来源
Machine Learning | 2023年 / 112卷
关键词
Noisy labels; Simulation; Datasets; Rater features;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a simulation framework for generating instance-dependent noisy labels via a pseudo-labeling paradigm. We show that the distribution of the synthetic noisy labels generated with our framework is closer to human labels compared to independent and class-conditional random flipping. Equipped with controllable label noise, we study the negative impact of noisy labels across a few practical settings to understand when label noise is more problematic. We also benchmark several existing algorithms for learning with noisy labels and compare their behavior on our synthetic datasets and on the datasets with independent random label noise. Additionally, with the availability of annotator information from our simulation framework, we propose a new technique, Label Quality Model (LQM), that leverages annotator features to predict and correct against noisy labels. We show that by adding LQM as a label correction step before applying existing noisy label techniques, we can further improve the models’ performance. The synthetic datasets that we generated in this work are released at https://github.com/deepmind/deepmind-research/tree/master/noisy_label.
引用
收藏
页码:1871 / 1896
页数:25
相关论文
共 50 条
  • [1] An instance-dependent simulation framework for learning with label noise
    Gu, Keren
    Masotto, Xander
    Bachani, Vandana
    Lakshminarayanan, Balaji
    Nikodem, Jack
    Yin, Dong
    MACHINE LEARNING, 2023, 112 (06) : 1871 - 1896
  • [2] Instance-dependent Label Distribution Estimation for Learning with Label Noise
    Liao, Zehui
    Hu, Shishuai
    Xie, Yutong
    Xia, Yong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 2568 - 2580
  • [3] Instance-Dependent Partial Label Learning
    Xu, Ning
    Qiao, Congyu
    Geng, Xin
    Zhang, Min-Ling
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Fair Classification with Instance-dependent Label Noise
    Wu, Songhua
    Gong, Mingming
    Han, Bo
    Liu, Yang
    Liu, Tongliang
    CONFERENCE ON CAUSAL LEARNING AND REASONING, VOL 177, 2022, 177
  • [5] A Parametrical Model for Instance-Dependent Label Noise
    Yang, Shuo
    Wu, Songhua
    Yang, Erkun
    Han, Bo
    Liu, Yang
    Xu, Min
    Niu, Gang
    Liu, Tongliang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14055 - 14068
  • [6] Part-dependent Label Noise: Towards Instance-dependent Label Noise
    Xia, Xiaobo
    Liu, Tongliang
    Han, Bo
    Wang, Nannan
    Gong, Mingming
    Liu, Haifeng
    Niu, Gang
    Tao, Dacheng
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [7] A Second-Order Approach to Learning with Instance-Dependent Label Noise
    Zhu, Zhaowei
    Liu, Tongliang
    Liu, Yang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10108 - 10118
  • [8] Instance-Dependent Inaccurate Label Distribution Learning
    Kou, Zhiqiang
    Wang, Jing
    Jia, Yuheng
    Liu, Biao
    Geng, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1425 - 1437
  • [9] Confidence Scores Make Instance-dependent Label-noise Learning Possible
    Berthon, Antonin
    Han, Bo
    Niu, Gang
    Liu, Tongliang
    Sugiyama, Masashi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Instance-Dependent Label-Noise Learning under Structural Causal Models
    Yao, Yu
    Liu, Tongliang
    Gong, Mingming
    Han, Bo
    Niu, Gang
    Zhang, Kun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34