BioKGrapher: Initial evaluation of automated knowledge graph construction from biomedical literature

被引:0
|
作者
Schaefer, Henning [1 ,2 ]
Idrissi-Yaghir, Ahmad [2 ,3 ]
Arzideh, Kamyar [4 ]
Damm, Hendrik [2 ,3 ]
Pakull, Tabea M. G. [1 ,2 ]
Schmidt, Cynthia S. [1 ,4 ]
Bahn, Mikel [4 ]
Lodde, Georg [6 ]
Livingstone, Elisabeth [6 ]
Schadendorf, Dirk [6 ]
Nensa, Felix [4 ,5 ]
Horn, Peter A. [1 ]
Friedrich, Christoph M. [2 ,3 ]
机构
[1] Univ Hosp Essen, Inst Transfus Med, Hufelandstr 55, D-45147 Essen, Germany
[2] Univ Appl Sci & Arts Dortmund FHDO, Dept Comp Sci, Emil Figge Str 42, D-44227 Dortmund, Germany
[3] Univ Hosp Essen, Inst Med Informat Biometry & Epidemiol IMIBE, Hufelandstr 55, D-45147 Essen, Germany
[4] Univ Hosp Essen, Inst Med IKIM, Girardetstr 2, D-45131 Essen, Germany
[5] Univ Hosp Essen, Inst Intervent & Diagnost Radiol & Neuroradiol, Hufelandstr 55, D-45147 Essen, Germany
[6] Univ Hosp Essen, Dept Dermatol, Hufelandstr 55, D-45147 Essen, Germany
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2024年 / 24卷
关键词
Knowledge graph; Named entity recognition; Entity linking; Clinical guidelines; Software; B-CELL LYMPHOMA; RITUXIMAB THERAPY; GASTROESOPHAGEAL JUNCTION; SCIENTIFIC LITERATURE; COMBINED NIVOLUMAB; CHEMOTHERAPY; HALLMARKS; CANCER; SYSTEM; TRIAL;
D O I
10.1016/j.csbj.2024.10.017
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background The growth of biomedical literature presents challenges in extracting and structuring knowledge. Knowledge Graphs (KGs) offer a solution by representing relationships between biomedical entities. However, manual construction of KGs is labor-intensive and time-consuming, highlighting the need for automated methods. This work introduces BioKGrapher, a tool for automatic KG construction using large-scale publication data, with a focus on biomedical concepts related to specific medical conditions. BioKGrapher allows researchers to construct KGs from PubMed IDs. Methods The BioKGrapher pipeline begins with Named Entity Recognition and Linking (NER+NEL) to extract and normalize biomedical concepts from PubMed, mapping them to the Unified Medical Language System (UMLS). Extracted concepts are weighted and re-ranked using Kullback-Leibler divergence and local frequency balancing. These concepts are then integrated into hierarchical KGs, with relationships formed using terminologies like SNOMED CT and NCIt. Downstream applications include multi-label document classification using Adapter- infused Transformer models. Results BioKGrapher effectively aligns generated concepts with clinical practice guidelines from the German Guideline Program in Oncology (GGPO), achieving F1-Scores of up to 0.6. In multi-label classification, Adapter- infused models using a BioKGrapher cancer-specific KG improved micro F1-Scores by up to 0.89 percentage points over a non-specific KG and 2.16 points over base models across three BERT variants. The drug-disease extraction case study identified indications for Nivolumab and Rituximab. Conclusion BioKGrapher is a tool for automatic KG construction, aligning with the GGPO and enhancing downstream task performance. It offers a scalable solution for managing biomedical knowledge, with potential applications in literature recommendation, decision support, and drug repurposing.
引用
收藏
页码:639 / 660
页数:22
相关论文
共 50 条
  • [21] A systematic literature review of knowledge graph construction and application in education
    Abu-Salih, Bilal
    Alotaibi, Salihah
    HELIYON, 2024, 10 (03)
  • [22] Discovering hidden knowledge from biomedical literature
    University of Nova Gorica, Vipavska 13, 5000 Nova Gorica, Slovenia
    不详
    不详
    Informatica, 2007, 1 (15-20):
  • [23] Discovering hidden knowledge from biomedical literature
    Petrič, Ingrid
    Urbančič, Tanja
    Cestnik, Bojan
    Informatica (Ljubljana), 2007, 31 (01) : 15 - 20
  • [24] Discovering Hidden Knowledge from Biomedical Literature
    Petric, Ingrid
    Urbancic, Tanja
    Cestnik, Bojan
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2007, 31 (01): : 15 - 20
  • [25] A Framework for Automated Knowledge Graph Construction Towards Traditional Chinese Medicine
    Weng, Heng
    Liu, Ziqing
    Yan, Shixing
    Fan, Meiyu
    Ou, Aihua
    Chen, Dacan
    Hao, Tianyong
    HEALTH INFORMATION SCIENCE (HIS 2017), 2017, 10594 : 170 - 181
  • [26] Automated Equipment Defect Knowledge Graph Construction for Power Grid Regulation
    Liu, Wei
    Gu, Yanghao
    Zeng, Zhiqiang
    Qi, Donglian
    Li, Dezhi
    Luo, Yuanyuan
    Li, Qi
    Wei, Su
    ELECTRONICS, 2024, 13 (22)
  • [27] Construction and evaluation of a domain-specific knowledge graph for knowledge discovery
    Nguyen, Huyen
    Chen, Haihua
    Chen, Jiangping
    Kargozari, Kate
    Ding, Junhua
    INFORMATION DISCOVERY AND DELIVERY, 2023, 51 (04) : 358 - 370
  • [28] A Knowledge Graph of Combined Drug Therapies Using Semantic Predications From Biomedical Literature: Algorithm Development
    Du, Jian
    Li, Xiaoying
    JMIR MEDICAL INFORMATICS, 2020, 8 (04)
  • [29] A Graph-Based Keyword Extraction Method for Academic Literature Knowledge Graph Construction
    Zhang, Lin
    Li, Yanan
    Li, Qinru
    MATHEMATICS, 2024, 12 (09)
  • [30] Literature Characteristics and Knowledge Graph Analysis of Credit Construction Research in Domestic Construction Industry
    Wang, Ziyi
    Wang, Yaowu
    ICCREM 2021: CHALLENGES OF THE CONSTRUCTION INDUSTRY UNDER THE PANDEMIC, 2021, : 297 - 308