Integrating deep learning architectures for enhanced biomedical relation extraction: a pipeline approach

被引:0
|
作者
Sarol, M. Janina [1 ]
Hong, Gibong [2 ]
Guerra, Evan [2 ]
Kilicoglu, Halil [2 ]
机构
[1] Univ Illinois, Informat Programs, 614 E Daniel St, Champaign, IL 61820 USA
[2] Univ Illinois, Sch Informat Sci, 501 Daniel St, Champaign, IL 61820 USA
基金
美国国家卫生研究院;
关键词
NORMALIZATION; RECOGNITION; RESOURCE; CORPUS; ENTITY;
D O I
10.1093/database/baae079
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biomedical relation extraction from scientific publications is a key task in biomedical natural language processing (NLP) and can facilitate the creation of large knowledge bases, enable more efficient knowledge discovery, and accelerate evidence synthesis. In this paper, building upon our previous effort in the BioCreative VIII BioRED Track, we propose an enhanced end-to-end pipeline approach for biomedical relation extraction (RE) and novelty detection (ND) that effectively leverages existing datasets and integrates state-of-the-art deep learning methods. Our pipeline consists of four tasks performed sequentially: named entity recognition (NER), entity linking (EL), RE, and ND. We trained models using the BioRED benchmark corpus that was the basis of the shared task. We explored several methods for each task and combinations thereof: for NER, we compared a BERT-based sequence labeling model that uses the BIO scheme with a span classification model. For EL, we trained a convolutional neural network model for diseases and chemicals and used an existing tool, PubTator 3.0, for mapping other entity types. For RE and ND, we adapted the BERT-based, sentence-bound PURE model to bidirectional and document-level extraction. We also performed extensive hyperparameter tuning to improve model performance. We obtained our best performance using BERT-based models for NER, RE, and ND, and the hybrid approach for EL. Our enhanced and optimized pipeline showed substantial improvement compared to our shared task submission, NER: 93.53 (+3.09), EL: 83.87 (+9.73), RE: 46.18 (+15.67), and ND: 38.86 (+14.9). While the performances of the NER and EL models are reasonably high, RE and ND tasks remain challenging at the document level. Further enhancements to the dataset could enable more accurate and useful models for practical use. We provide our models and code at https://github.com/janinaj/e2eBioMedRE/.Database URL: https://github.com/janinaj/e2eBioMedRE/
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Integrating graph convolutional networks to enhance prompt learning for biomedical relation extraction
    Guo, Bocheng
    Meng, Jiana
    Zhao, Di
    Jia, Xiangxing
    Chu, Yonghe
    Lin, Hongfei
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 157
  • [2] A multi-task learning based approach to biomedical entity relation extraction
    Li, Qingqing
    Yang, Zhihao
    Luo, Ling
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    Yang, Liang
    Xu, Kan
    Zhang, Yijia
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 680 - 682
  • [3] A deep learning relation extraction approach to support a biomedical semi-automatic curation task: The case of the gluten bibliome
    Perez-Perez, Martin
    Ferreira, Tania
    Igrejas, Gilberto
    Fdez-Riverola, Florentino
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
  • [4] A study of deep active learning methods to reduce labelling efforts in biomedical relation extraction
    Nachtegael, Charlotte
    De Stefani, Jacopo
    Lenaerts, Tom
    PLOS ONE, 2023, 18 (12):
  • [5] Relation Extraction Between Medical Entities Using Deep Learning Approach
    Patel, Ruchi
    Tanwani, Sanjay
    Patidar, Chhaya
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (03): : 359 - 366
  • [6] A Military Named Entity Relation Extraction Approach Based on Deep Learning
    Wang, Xuefeng
    Yang, Ruopeng
    Feng, Yulong
    Li, Dongsheng
    Hou, Jianfeng
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [7] Relation extraction between medical entities using deep learning approach
    Patel R.
    Tanwani S.
    Patidar C.
    Informatica (Slovenia), 2021, 45 (03): : 359 - 366
  • [8] Relation Extraction with Deep Reinforcement Learning
    Zhang, Hongjun
    Feng, Yuntian
    Hao, Wenning
    Chen, Gang
    Jin, Dawei
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (08) : 1893 - 1902
  • [9] A neural network-based joint learning approach for biomedical entity and relation extraction from biomedical literature
    Luo, Ling
    Yang, Zhihao
    Cao, Mingyu
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 103
  • [10] A Deep Learning Approach With Deep Contextualized Word Representations for ChemicalProtein Interaction Extraction From Biomedical Literature
    Sun, Cong
    Yang, Zhihao
    Luo, Ling
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    IEEE ACCESS, 2019, 7 : 151034 - 151046