Biomedical Flat and Nested Named Entity Recognition: Methods, Challenges, and Advances

被引:1
|
作者
Park, Yesol [1 ]
Son, Gyujin [2 ]
Rho, Mina [1 ,2 ,3 ]
机构
[1] Hanyang Univ, Dept Comp Sci, Seoul 04763, South Korea
[2] Hanyang Univ, Dept Artificial Intelligence, Seoul 04763, South Korea
[3] Hanyang Univ, Dept Biomed Informat, Seoul 04763, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 20期
关键词
named entity recognition; biomedical named entity recognition; flat named entity recognition; nested named entity recognition; flat and nested named entity recognition; natural language processing; CORPUS;
D O I
10.3390/app14209302
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Biomedical named entity recognition (BioNER) aims to identify and classify biomedical entities (i.e., diseases, chemicals, and genes) from text into predefined classes. This process serves as an important initial step in extracting biomedical information from textual sources. Considering the structure of the entities it addresses, BioNER tasks are divided into two categories: flat NER, where entities are non-overlapping, and nested NER, which identifies entities embedded within another. While early studies primarily addressed flat NER, recent advances in neural models have enabled more sophisticated approaches to nested NER, gaining increasing relevance in the biomedical field, where entity relationships are often complex and hierarchically structured. This review, thus, focuses on the latest progress in large-scale pre-trained language model-based approaches, which have shown the significantly improved performance of NER. The state-of-the-art flat NER models have achieved average F1-scores of 84% on BC2GM, 89% on NCBI Disease, and 92% on BC4CHEM, while nested NER models have reached 80% on the GENIA dataset, indicating room for enhancement. In addition, we discuss persistent challenges, including inconsistencies of named entities annotated across different corpora and the limited availability of named entities of various entity types, particularly for multi-type or nested NER. To the best of our knowledge, this paper is the first comprehensive review of pre-trained language model-based flat and nested BioNER models, providing a categorical analysis among the methods and related challenges for future research and development in the field.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Nested named entity recognition in historical archive text
    Byrne, Kate
    ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 589 - 596
  • [32] A Boundary Regression Model for Nested Named Entity Recognition
    Yanping Chen
    Lefei Wu
    Qinghua Zheng
    Ruizhang Huang
    Jun Liu
    Liyuan Deng
    Junhui Yu
    Yongbin Qing
    Bo Dong
    Ping Chen
    Cognitive Computation, 2023, 15 : 534 - 551
  • [33] Deep Exhaustive Model for Nested Named Entity Recognition
    Sohrab, Mohammad Golam
    Miwa, Makoto
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2843 - 2849
  • [34] Nested Named Entity Recognition as Building Local Hypergraphs
    Yan, Yukun
    Cai, Bingling
    Song, Sen
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13878 - 13886
  • [35] Candidate region aware nested named entity recognition
    Jiang, Deng
    Ren, Haopeng
    Cai, Yi
    Xu, Jingyun
    Liu, Yanxia
    Leung, Ho-fung
    NEURAL NETWORKS, 2021, 142 : 340 - 350
  • [36] A Boundary Regression Model for Nested Named Entity Recognition
    Chen, Yanping
    Wu, Lefei
    Zheng, Qinghua
    Huang, Ruizhang
    Liu, Jun
    Deng, Liyuan
    Yu, Junhui
    Qing, Yongbin
    Dong, Bo
    Chen, Ping
    COGNITIVE COMPUTATION, 2023, 15 (02) : 534 - 551
  • [37] Planarized sentence representation for nested named entity recognition
    Geng, Rushan
    Chen, Yanping
    Huang, Ruizhang
    Qin, Yongbin
    Zheng, Qinghua
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [38] MMBERT: a unified framework for biomedical named entity recognition
    Lei Fu
    Zuquan Weng
    Jiheng Zhang
    Haihe Xie
    Yiqing Cao
    Medical & Biological Engineering & Computing, 2024, 62 : 327 - 341
  • [39] Comparison of named entity recognition methodologies in biomedical documents
    Song, Hye-Jeong
    Jo, Byeong-Cheol
    Park, Chan-Young
    Kim, Jong-Dae
    Kim, Yu-Seop
    BIOMEDICAL ENGINEERING ONLINE, 2018, 17
  • [40] Various criteria in the evaluation of biomedical named entity recognition
    Tsai, RTH
    Wu, SH
    Chou, WC
    Lin, YC
    He, D
    Hsiang, J
    Sung, TY
    Hsu, WL
    BMC BIOINFORMATICS, 2006, 7 (1) : 1 - 8