ADPG: Biomedical entity recognition based on Automatic Dependency Parsing Graph

被引:0
|
作者
Yang, Yumeng [1 ]
Lin, Hongfei [1 ]
Yang, Zhihao [1 ]
Zhang, Yijia [2 ]
Zhao, Di [3 ]
Huai, Shuaiheng [2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian, Peoples R China
[3] Dalian Minzu Univ, Sch Comp Sci & Engn, Dalian, Peoples R China
基金
中国博士后科学基金;
关键词
NER; Tree-transformer; Dependency parsing; Biomedical;
D O I
10.1016/j.jbi.2023.104317
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Named entity recognition is a key task in text mining. In the biomedical field, entity recognition focuses on extracting key information from large-scale biomedical texts for the downstream information extraction task. Biomedical literature contains a large amount of long-dependent text, and previous studies use external syntactic parsing tools to capture word dependencies in sentences to achieve nested biomedical entity recognition. However, the addition of external parsing tools often introduces unnecessary noise to the current auxiliary task and cannot improve the performance of entity recognition in an end-to-end way. Therefore, we propose a novel automatic dependency parsing approach, namely the ADPG model, to fuse syntactic structure information in an end-to-end way to recognize biomedical entities. Specifically, the method is based on a multilayer Tree-Transformer structure to automatically extract the semantic representation and syntactic structure in long-dependent sentences, and then combines a multilayer graph attention neural network (GAT) to extract the dependency paths between words in the syntactic structure to improve the performance of biomedical entity recognition. We evaluated our ADPG model on three biomedical domain and one news domain datasets, and the experimental results demonstrate that our model achieves state-of-the-art results on these four datasets with certain generalization performance. Our model is released on GitHub: https://github.com/Yumeng-Y/ADPG.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] From POS tagging to dependency parsing for biomedical event extraction
    Dat Quoc Nguyen
    Verspoor, Karin
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [42] From POS tagging to dependency parsing for biomedical event extraction
    Dat Quoc Nguyen
    Karin Verspoor
    BMC Bioinformatics, 20
  • [43] Improvements to Dependency Parsing Using Automatic Simplification of Data
    Jelinek, Tomas
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [44] Automatic Construction of Entity Semantic Representation Model Based on Dependency Analysis
    Liu Xiaoming
    Liu Jie
    Li Fangfang
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE II, PTS 1-6, 2012, 121-126 : 1947 - +
  • [45] Faster biomedical named entity recognition based on knowledge distillation
    Hu B.
    Geng T.
    Deng G.
    Duan L.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2021, 61 (09): : 936 - 942
  • [46] A Kernel-Based Approach for Biomedical Named Entity Recognition
    Patra, Rakesh
    Saha, Sujan Kumar
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [47] Clustering Based Active Learning for Biomedical Named Entity Recognition
    Han, Xu
    Kwoh, Chee Keong
    Kim, Jung-jae
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1253 - 1260
  • [48] Ensemble based Active Annotation for Biomedical Named Entity Recognition
    Verma, Mridula
    Sikdar, Utpal
    Saha, Sriparna
    Ekbal, Asif
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 973 - 978
  • [49] An Entity-Relation Extraction Method Based on the Mixture-of-Experts Model and Dependency Parsing
    Li, Yuanxi
    Wang, Haiyan
    Zhang, Dong
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [50] An Empirical Investigation of Structured Output Modeling for Graph-based Neural Dependency Parsing
    Zhang, Zhisong
    Ma, Xuezhe
    Hovy, Eduard
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5592 - 5598