Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

被引:0
|
作者
Meng, Yu [1 ]
Zhang, Yunyi [1 ]
Huang, Jiaxin [1 ]
Wang, Xuan [1 ]
Zhang, Yu [1 ]
Ji, Heng [1 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of training named entity recognition (NER) models using only distantly-labeled data, which can be automatically obtained by matching entity mentions in the raw text with entity types in a knowledge base. The biggest challenge of distantlysupervised NER is that the distant supervision may induce incomplete and noisy labels, rendering the straightforward application of supervised learning ineffective. In this paper, we propose (1) a noise-robust learning scheme comprised of a new loss function and a noisy label removal step, for training NER models on distantly-labeled data, and (2) a self-training method that uses contextualized augmentations created by pre-trained language models to improve the generalization ability of the NER model. On three benchmark datasets, our method achieves superior performance, outperforming existing distantlysupervised NER models by significant margins(1).
引用
收藏
页码:10367 / 10378
页数:12
相关论文
共 50 条
  • [1] Noise-Robust Training with Dynamic Loss and Contrastive Learning for Distantly-Supervised Named Entity Recognition
    Ma, Zhiyuan
    Du, Jintao
    Zhou, Shuheng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10119 - 10128
  • [2] A Class-Rebalancing Self-Training Framework for Distantly-Supervised Named Entity Recognition
    Li, Qi
    Xie, Tingyu
    Peng, Peng
    Wang, Hongwei
    Wang, Gaoang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11054 - 11068
  • [3] Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning
    Zhang, Xinghua
    Yu, Bowen
    Liu, Tingwen
    Zhang, Zhenyu
    Sheng, Jiawei
    Xue Mengge
    Xu, Hongbo
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10746 - 10757
  • [4] Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning
    Zhang, Xinghua
    Yu, Bowen
    Liu, Tingwen
    Zhang, Zhenyu
    Sheng, Jiawei
    Xue, Mengge
    Xu, Hongbo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1518 - 1529
  • [5] SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition
    Si, Shuzheng
    Cai, Zefan
    Zeng, Shuang
    Feng, Guoqiang
    Lin, Jiaxing
    Chang, Baobao
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 3883 - 3896
  • [6] Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-Grained Student Ensemble
    Qu, Xiaoye
    Zeng, Jun
    Liu, Daizong
    Wang, Zhefeng
    Huai, Baoxing
    Zhou, Pan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13501 - 13509
  • [7] Chinese named entity recognition combined active learning with self-training
    Zhong, Zhinong, 1600, National University of Defense Technology (36):
  • [8] DISTALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem
    Banerjee, Somnath
    Dutta, Avik
    Agrawal, Aaditya
    Hazra, Rima
    Mukherjee, Animesh
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-APPLIED DATA SCIENCE TRACK, PT X, ECML PKDD 2024, 2024, 14950 : 313 - 331
  • [9] Noise-Robust Semi-Supervised Learning for Distantly Supervised Relation Extraction
    Sun, Xin
    Liu, Qiang
    Wu, Shu
    Wang, Zilei
    Wang, Liang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 13145 - 13157
  • [10] Reinforcement learning based distantly supervised biomedical named entity recognition
    Bali, Manish
    Anandaraj, S. P.
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (02): : 317 - 330