A Flat-Span Contrastive Learning Method for Nested Named Entity Recognition

被引:0
|
作者
Liu, Yaodi [1 ]
Zhang, Kun [1 ]
Tong, Rong [2 ]
Cai, Chenxi [1 ]
Chen, Dianying [1 ]
Wu, Xiaohe [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp & Engn, Nanjing, Peoples R China
[2] Singapore Inst Technol, Infocomm Technol Cluster, Singapore, Singapore
关键词
nested named entity recognition; sequence labeling; span classification; contrastive learning; multi-task learning;
D O I
10.1109/IALP63756.2024.10661137
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most Named entity recognition (NER) methods can only handle flat entities and ignore nested entities. In Natural language processing (NLP), it is common to contain other entities within entities. Therefore, we propose a Flat-Span contrastive learning (Fla-SpaCL) method for nested NER. This method includes two sub-modules: a flat NER module for outer entities and a candidate span classification module based on contrastive learning. In the flat NER module, we use Star-Transformer and Conditional random field (CRF) to identify the outer entities. In the candidate span classification module, we first generate inner candidate spans based on the identified outer entities. Secondly, to better distinguish entity spans and non-entity spans, we introduce contrastive learning to maximize the similarity between entity spans and use the InfoNEC loss function to handle hard negative samples. Finally, multi-task learning is used to jointly optimize the flat NER module and the candidate span classification module to reduce error propagation and improve model performance. In the experimental analysis, we compared the proposed model with baseline models to verify its effectiveness.
引用
收藏
页码:37 / 42
页数:6
相关论文
共 50 条
  • [21] Global Span Semantic Dependency Awareness and Filtering Network for nested named entity recognition
    Sun, Yunlei
    Wang, Xiaoyang
    Wu, Haosheng
    Hu, Miao
    NEUROCOMPUTING, 2025, 617
  • [22] Handling negative samples problems in span-based nested named entity recognition
    Liu, Chenxu
    Fan, Hongjie
    Liu, Junfei
    NEUROCOMPUTING, 2022, 505 : 353 - 361
  • [23] Mitigating prototype shift: Few-shot nested named entity recognition with prototype-attention contrastive learning
    Ming, Hong
    Yang, Jiaoyun
    Liu, Shuo
    Jiang, Lili
    An, Ning
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268
  • [24] A Controlled Attention for Nested Named Entity Recognition
    Chen, Yanping
    Huang, Rong
    Pan, Lijun
    Huang, Ruizhang
    Zheng, Qinghua
    Chen, Ping
    COGNITIVE COMPUTATION, 2023, 15 (01) : 132 - 145
  • [25] A Controlled Attention for Nested Named Entity Recognition
    Yanping Chen
    Rong Huang
    Lijun Pan
    Ruizhang Huang
    Qinghua Zheng
    Ping Chen
    Cognitive Computation, 2023, 15 : 132 - 145
  • [26] Research on End-To-End Nested Named Entity Recognition Method
    Deng, Liyuan
    Chen, Yanpin
    Wu, Yuefei
    Qin, Yongbin
    Huang, Ruizhang
    Zheng, Qinghua
    Tan, Xi
    Computer Engineering and Applications, 2023, 59 (07) : 278 - 284
  • [27] Thai Nested Named Entity Recognition Corpus
    Buaphet, Weerayut
    Udomcharoenchaikit, Can
    Limkonchotiwat, Peerat
    Rutherford, Attapol T.
    Nutanong, Sarana
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1473 - 1486
  • [28] Deep Span Representations for Named Entity Recognition
    Zhu, Enwei
    Liu, Yiyang
    Li, Jinpeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10565 - 10582
  • [29] CONTAINER: Few-Shot Named Entity Recognition via Contrastive Learning
    Das, Sarkar Snigdha Sarathi
    Katiyar, Arzoo
    Passonneau, Rebecca J.
    Zhang, Rui
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6338 - 6353
  • [30] Joint contrastive learning and belief rule base for named entity recognition in cybersecurity
    Chenxi Hu
    Tao Wu
    Chunsheng Liu
    Chao Chang
    Cybersecurity, 7