A multi-head adjacent attention-based pyramid layered model for nested named entity recognition

被引:4
|
作者
Cui, Shengmin [1 ]
Joe, Inwhee [1 ]
机构
[1] Hanyang Univ, Dept Comp Sci, 222 Wangsimni Ro, Seoul 04763, South Korea
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 03期
关键词
Nested named entity recognition; Named entity recognition; Attention; Pyramid; Natural language processing; EXTRACTION;
D O I
10.1007/s00521-022-07747-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) is one of the widely studied natural language processing tasks in recent years. Conventional solutions treat the NER as a sequence labeling problem, but these approaches cannot handle nested NER. This is due to the fact that nested NER refers to the case where one entity contains another entity and it is not feasible to tag each token with a single tag. The pyramid model stacks L flat NER layers for prediction, which subtly enumerates all spans with length less than or equal to L. However, the original model introduces a block consisting of a convolutional layer and a bidirectional long short-term memory (Bi-LSTM) layer as the decoder, which does not consider the dependency between adjacent inputs and the Bi-LSTM cannot perform parallel computation on sequential inputs. For the purpose of improving performance and reducing the forward computation, we propose a Multi-Head Adjacent Attention-based Pyramid Layered model. In addition, when constructing a pyramid structure for span representation, the information of the intermediate words has more proportion than words on the two sides. To address this imbalance in the span representation, we fuse the output of the attention layer with the features of head and tail words when doing classification. We conducted experiments on nested NER datasets such as GENIA, SciERC, and ADE to validate the effectiveness of our proposed model.
引用
收藏
页码:2561 / 2574
页数:14
相关论文
共 50 条
  • [31] Research on named entity recognition of chinese electronic medical records based on multi-head attention mechanism and character-word information fusion
    Zhang, Qinghui
    Wu, Meng
    Lv, Pengtao
    Zhang, Mengya
    Yang, Hongwei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 4105 - 4116
  • [32] An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records
    Li, Luqi
    Zhao, Jie
    Hou, Li
    Zhai, Yunkai
    Shi, Jinming
    Cui, Fangfang
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [33] An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records
    Luqi Li
    Jie Zhao
    Li Hou
    Yunkai Zhai
    Jinming Shi
    Fangfang Cui
    BMC Medical Informatics and Decision Making, 19
  • [34] Deep Exhaustive Model for Nested Named Entity Recognition
    Sohrab, Mohammad Golam
    Miwa, Makoto
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2843 - 2849
  • [35] A Boundary Regression Model for Nested Named Entity Recognition
    Yanping Chen
    Lefei Wu
    Qinghua Zheng
    Ruizhang Huang
    Jun Liu
    Liyuan Deng
    Junhui Yu
    Yongbin Qing
    Bo Dong
    Ping Chen
    Cognitive Computation, 2023, 15 : 534 - 551
  • [36] A Boundary Regression Model for Nested Named Entity Recognition
    Chen, Yanping
    Wu, Lefei
    Zheng, Qinghua
    Huang, Ruizhang
    Liu, Jun
    Deng, Liyuan
    Yu, Junhui
    Qing, Yongbin
    Dong, Bo
    Chen, Ping
    COGNITIVE COMPUTATION, 2023, 15 (02) : 534 - 551
  • [37] A fiber recognition framework based on multi-head attention mechanism
    Xu, Luoli
    Li, Fenying
    Chang, Shan
    TEXTILE RESEARCH JOURNAL, 2024, 94 (23-24) : 2629 - 2640
  • [38] Multi-head CRF classifier for biomedical multi-class named entity recognition on Spanish clinical notes
    Jonker, Richard A. A.
    Almeida, Tiago
    Antunes, Rui
    Almeida, Joao R.
    Matos, Sergio
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2024, 2024
  • [39] Multi-head attention graph convolutional network model: End-to-end entity and relation joint extraction based on multi-head attention graph convolutional network
    Tao, Zhihua
    Ouyang, Chunping
    Liu, Yongbin
    Chung, Tonglee
    Cao, Yixin
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (02) : 468 - 477
  • [40] Self Multi-Head Attention for Speaker Recognition
    India, Miquel
    Safari, Pooyan
    Hernando, Javier
    INTERSPEECH 2019, 2019, : 4305 - 4309