A Class-Rebalancing Self-Training Framework for Distantly-Supervised Named Entity Recognition

被引:0
|
作者
Li, Qi [1 ,2 ]
Xie, Tingyu [1 ,2 ]
Peng, Peng [2 ]
Wang, Hongwei [1 ,2 ]
Wang, Gaoang [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ, ZJU UIUC Inst, Hangzhou, Zhejiang, Peoples R China
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Distant supervision reduces the reliance on human annotation in the named entity recognition tasks. The class-level imbalanced distant annotation is a realistic and unexplored problem, and the popular method of self-training can not handle class-level imbalanced learning. More importantly, self-training is dominated by the high-performance class in selecting candidates, and deteriorates the low-performance class with the bias of generated pseudo label. To address the class-level imbalance performance, we propose a class-rebalancing self-training framework for improving the distantly-supervised named entity recognition. In candidate selection, a class-wise flexible threshold is designed to fully explore other classes besides the high-performance class. In label generation, injecting the distant label, a hybrid pseudo label is adopted to provide straight semantic information for the low-performance class. Experiments on five flat and two nested datasets show that our model achieves state-of-the-art results. We also conduct extensive research to analyze the effectiveness of the flexible threshold and the hybrid pseudo label.
引用
收藏
页码:11054 / 11068
页数:15
相关论文
共 50 条
  • [21] Improving Distantly-Supervised Named Entity Recognition for Traditional Chinese Medicine Text via a Novel Back-Labeling Approach
    Zhang, Dezheng
    Xia, Chao
    Xu, Cong
    Jia, Qi
    Yang, Shibing
    Luo, Xiong
    Xie, Yonghong
    IEEE ACCESS, 2020, 8 : 145413 - 145421
  • [22] Improving distantly supervised named entity recognition by emphasizing uncertain examples
    Nie, Binling
    Shao, Yiming
    Wang, Yigang
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [23] Reinforcement learning based distantly supervised biomedical named entity recognition
    Bali, Manish
    Anandaraj, S. P.
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (02): : 317 - 330
  • [24] Self-Training With Double Selectors for Low-Resource Named Entity Recognition
    Fu, Yingwen
    Lin, Nankai
    Yu, Xiaohui
    Jiang, Shengyi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1265 - 1275
  • [25] Distantly Supervised Named Entity Recognition with Category-Oriented Confidence Calibration
    Ding, Liangping
    Huang, Tian-Yuan
    Liu, Huan
    Wang, Yufei
    Zhang, Zhixiong
    FROM BORN-PHYSICAL TO BORN-VIRTUAL: AUGMENTING INTELLIGENCE IN DIGITAL LIBRARIES, ICADL 2022, 2022, 13636 : 46 - 55
  • [26] Distantly Supervised Named Entity Recognition using Positive-Unlabeled Learning
    Peng, Minlong
    Xing, Xiaoyu
    Zhang, Qi
    Fu, Jinlan
    Huang, Xuanjing
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2409 - 2419
  • [27] Mitigating Effect of Dictionary Matching Errors in Distantly Supervised Named Entity Recognition
    Kobayashi, Koga
    Wakabayashi, Kei
    22ND INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2020), 2020, : 111 - 114
  • [28] Denoising Distantly Supervised Named Entity Recognition via a Hypergeometric Probabilistic Model
    Zhang, Wenkai
    Lin, Hongyu
    Han, Xianpei
    Sun, Le
    Liu, Huidan
    Wei, Zhicheng
    Yuan, Nicholas Jing
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14481 - 14488
  • [29] De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention
    Zhang, Wenkai
    Lin, Hongyu
    Han, Xianpei
    Sun, Le
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4803 - 4813
  • [30] Improving Self-training for Cross-lingual Named Entity Recognition with Contrastive and Prototype Learning
    Zhou, Ran
    Li, Xin
    Bing, Lidong
    Cambria, Erik
    Miao, Chunyan
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4018 - 4031