Span-based Named Entity Recognition by Generating and Compressing Information

被引:0
|
作者
Nguyen, Nhung T. H. [1 ]
Miwa, Makoto [2 ,3 ]
Ananiadou, Sophia [1 ,3 ,4 ]
机构
[1] Univ Manchester, Natl Ctr Text Min, Dept Comp Sci, Manchester, Lancs, England
[2] Toyota Technol Inst, Toyota, Japan
[3] Natl Inst Adv Ind Sci & Technol, Artificial Intelligence Res Ctr AIRC, Tokyo, Japan
[4] Alan Turing Inst, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The information bottleneck (IB) principle has been proven effective in various NLP applications. The existing work, however, only used either generative or information compression models to improve the performance of the target task. In this paper, we propose to combine the two types of IB models into one system to enhance Named Entity Recognition (NER). For one type of IB model, we incorporate two unsupervised generative components, span reconstruction and synonym generation, into a span-based NER system. The span reconstruction ensures that the contextualised span representation keeps the span information, while the synonym generation makes synonyms have similar representations even in different contexts. For the other type of IB model, we add a supervised IB layer that performs information compression into the system to preserve useful features for NER in the resulting span representations. Experiments on five different corpora indicate that jointly training both generative and information compression models can enhance the performance of the baseline span-based NER system. Our source code is publicly available at https://github.com/nguyennth/joint-ib-models.
引用
收藏
页码:1984 / 1996
页数:13
相关论文
共 50 条
  • [1] A Neural Span-Based Continual Named Entity Recognition Model
    Zhang, Yunan
    Chen, Qingcai
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13993 - 14001
  • [2] Span-Based Nested Named Entity Recognition with Pretrained Language Model
    Liu, Chenxu
    Fan, Hongjie
    Liu, Junfei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 620 - 628
  • [3] A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recognition
    Li, Fei
    Lin, Zhichao
    Zhang, Meishan
    Ji, Donghong
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4814 - 4828
  • [4] A segment enhanced span-based model for nested named entity recognition
    Li, Fei
    Wang, Zheng
    Hui, Siu Cheung
    Liao, Lejian
    Zhu, Xinhua
    Huang, Heyan
    NEUROCOMPUTING, 2021, 465 : 26 - 37
  • [5] A Boundary Feature Enhanced Span-Based Nested Named Entity Recognition Method
    Song, Jiaqi
    Wang, Xingxing
    Zhang, Huihui
    Li, Bohan
    Wang, Tiexin
    WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 3 - 17
  • [6] Handling negative samples problems in span-based nested named entity recognition
    Liu, Chenxu
    Fan, Hongjie
    Liu, Junfei
    NEUROCOMPUTING, 2022, 505 : 353 - 361
  • [7] S-NER: A Concise and Efficient Span-Based Model for Named Entity Recognition
    Yu, Jie
    Ji, Bin
    Li, Shasha
    Ma, Jun
    Liu, Huijun
    Xu, Hao
    SENSORS, 2022, 22 (08)
  • [8] ScdNER: Span-Based Consistency-Aware Document-Level Named Entity Recognition
    Wei, Ying
    Li, Qi
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 15677 - 15685
  • [9] A dynamic programming algorithm for span-based nested named-entity recognition in O(n2)
    Corro, Caio
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10712 - 10724
  • [10] SKD-NER: Continual Named Entity Recognition via Span-based Knowledge Distillation with Reinforcement Learning
    Chen, Yi
    He, Liang
    Wang, Lei
    Han, Zhenxiang
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 6689 - 6700