Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification

被引:0
|
作者
Zhu, Dawei [1 ]
Hedderich, Michael A. [1 ]
Zhai, Fangzhou [1 ]
Adelani, David Ifeoluwa [1 ]
Klakow, Dietrich [1 ]
机构
[1] Saarland Univ, Saarland Informat Campus, Homburg, Germany
关键词
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
Incorrect labels in training data occur when human annotators make mistakes or when the data is generated via weak or distant supervision. It has been shown that complex noise-handling techniques - by modeling, cleaning or filtering the noisy instances - are required to prevent models from fitting this label noise. However, we show in this work that, for text classification tasks with modern NLP models like BERT, over a variety of noise types, existing noise-handling methods do not always improve its performance, and may even deteriorate it, suggesting the need for further investigation. We also back our observations with a comprehensive analysis.
引用
收藏
页码:62 / 67
页数:6
相关论文
共 50 条
  • [1] Towards Robust Learning with Noisy and Pseudo Labels for Text Classification
    Wen, Murtadha Ahmeda Bo
    Ao, Luo
    Pan, Shengfeng
    Su, Jianlin
    Cao, Xinxin
    Liu, Yunfeng
    INFORMATION SCIENCES, 2024, 661
  • [2] Classification on soft labels is robust against label noise
    Thiel, Christian
    KNOWLEDGE - BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2008, 5177 : 65 - 73
  • [3] Partial Label Learning with Noisy Labels
    Zhao, Pan
    Tang, Long
    Pan, Zhigeng
    Annals of Data Science, 2025, 12 (01) : 199 - 212
  • [4] Label Distribution for Learning with Noisy Labels
    Liu, Yun-Peng
    Xu, Ning
    Zhang, Yu
    Geng, Xin
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2568 - 2574
  • [5] PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction
    Sun, Zeren
    Shen, Fumin
    Huang, Dan
    Wang, Qiong
    Shu, Xiangbo
    Yao, Yazhou
    Tang, Jinhui
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5301 - 5310
  • [6] Learning with Confident Examples: Rank Pruning for Robust Classification with Noisy Labels
    Northcutt, Curtis G.
    Wu, Tailin
    Chuang, Isaac L.
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [7] Spectral Label Refinement for Noisy and Missing Text Labels
    Song, Yangqiu
    Wang, Chenguang
    Zhang, Ming
    Sun, Hailong
    Yang, Qiang
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2972 - 2978
  • [8] Distributionally Robust Federated Learning for Network Traffic Classification With Noisy Labels
    Shi, Siping
    Guo, Yingya
    Wang, Dan
    Zhu, Yifei
    Han, Zhu
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 6212 - 6226
  • [9] Robust Federated Learning With Noisy Labels
    Yang, Seunghan
    Park, Hyoungseob
    Byun, Junyoung
    Kim, Changick
    IEEE INTELLIGENT SYSTEMS, 2022, 37 (02) : 35 - 43
  • [10] Robust Collaborative Learning with Noisy Labels
    Sun, Mengying
    Xing, Jing
    Chen, Bin
    Zhou, Jiayu
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1274 - 1279