Distantly Supervised Named Entity Recognition Combined with Prototypical Networks

被引:0
|
作者
Luo S. [1 ]
Lin Z. [1 ]
Pan L. [1 ]
Wu Z. [1 ]
机构
[1] School of Information and Electronics, Beijing Institute of Technology, Beijing
关键词
automatic corpus annotation; distant supervision; named entity recognition; positive-unlabeled learning{PUL); prototypical network;
D O I
10.15918/j.tbit1001-0645.2022.098
中图分类号
学科分类号
摘要
Aiming at the problem of entity category labeling errors in the process of using distant supervision to label text entities, it is difficult for the model to effectively distinguish the category characteristics of each entity and affect the accuracy of the model. A named entity recognition (NER)method was proposed in this paper. It was designed to use pre-trained prototypical network coding to correctly label entities to generate category prototype representations, and to filter those far away samples from category prototypes in the corpus. Experiments show that the use of the prototype network can effectively improve the annotation quality of the corpus and improve the performance of the model. © 2023 Beijing Institute of Technology. All rights reserved.
引用
收藏
页码:410 / 416
页数:6
相关论文
共 20 条
  • [11] HOU Y, CHE W, LAI Y, Et al., Few-shot slot tagging with collapsed dependency transfer and label-enhanced task-adaptive projection network, 58th Annual Meeting of the Association for Computational Linguistics, pp. 1381-1393, (2020)
  • [12] YANG Y, KATIYAR A., Simple and effective few-shot named entity recognition with structured nearest neighbor learning, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, pp. 6365-6375, (2020)
  • [13] DING N, XU G, CHEN Y, Et al., Few-NERD: a few-shot named entity recognition dataset, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 3198-3213, (2021)
  • [14] COLLOBERT R, WESTON J, BOTTOU L, Et al., Natural language processing (almost) from scratch, Journal of Machine Learning Research, 12, 2, pp. 2493-2537, (2011)
  • [15] GRAVES A, SCHMIDHUBER J., Framewise phoneme classification with bidirectional LSTM and other neural network architectures, Neural Networks, 18, 5-6, pp. 602-610, (2005)
  • [16] TJONG KIM, SANG E F., Introduction to the CoNLL-2003 shared task: language-independent named entity recognition [C], Proceedings of CoNLL-2003, pp. 142-147, (2003)
  • [17] CHINCHOR N A., Overview of muc-7, 7th Message Understanding Conference, MUC 1998, pp. 1-2, (1998)
  • [18] ZHANG Q, FU J, LIU X, Et al., Adaptive co-attention network for named entity recognition in tweets, 32nd AAAI Conference on Artificial Intelligence, pp. 5674-5681, (2018)
  • [19] FINKEL J R, GRENAGER T, MANNING C D., Incorporating non-local information into information extraction systems by gibbs sampling, Proceedings of the 43rd Annual Meeting of The Association for Computational Linguistics (ACL’05), pp. 363-370, (2005)
  • [20] CHEN X, QIU X, ZHU C, Et al., Long short-term memory neural networks for chinese word segmentation, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1197-1206, (2015)