Storage and Query of Drug Knowledge Graphs Using Distributed Graph Databases: A Case Study

被引:0
|
作者
Han, Xingjian [1 ]
Tian, Yu [2 ]
机构
[1] China Elect Prod Reliabil & Environm Testing Res I, Elect Res Inst MIIT 5, Guangzhou 510610, Peoples R China
[2] Zhejiang Univ, Coll Biomed Engn & Instrument Sci, Engn Res Ctr EMR & Intelligent Expert Syst, Key Lab Biomed Engn,Minist Educ, Hangzhou 310027, Peoples R China
来源
BIOENGINEERING-BASEL | 2025年 / 12卷 / 02期
基金
中国国家自然科学基金;
关键词
distributed graph databases; drug knowledge graph; storage and query; real-time data retrieval;
D O I
10.3390/bioengineering12020115
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Distributed graph databases are a promising method for storing and conducting complex pathway queries on large-scale drug knowledge graphs to support drug research. However, there is a research gap in evaluating drug knowledge graphs' storage and query performance based on distributed graph databases. This study evaluates the feasibility and performance of distributed graph databases in managing large-scale drug knowledge graphs. Methods: First, a drug knowledge graph storage and query system is designed based on the Nebula Graph database. Second, the system's writing and query performance is evaluated. Finally, two drug repurposing benchmarks are used to provide a more extensive and reliable assessment. Results: The performance of distributed graph databases surpasses that of single-machine databases, including data writing, regular queries, constrained queries, and concurrent queries. Additionally, the advantages of distributed graph databases in writing performance become more pronounced as the data volume increases. The query performance benefits of distributed graph databases also improve with the complexity of query tasks. The drug repurposing evaluation results show that 78.54% of the pathways are consistent with currently approved drug treatments according to repoDB. Additionally, 12 potential pathways for new drug indications are found to have literature support according to DrugRepoBank. Conclusions: The proposed system is able to construct, store, and query a large graph of multisource drug knowledge and provides reliable and explainable drug-disease paths for drug repurposing.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] The Graph Signature: A Scalable Query Optimization Index for RDF Graph Databases Using Bisimulation and Trace Equivalence Summarization
    Jarrar, Mustafa
    Deik, Anton
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2015, 11 (02) : 36 - 65
  • [22] Using domain knowledge to learn from heterogeneous distributed databases
    McClean, S
    Scotney, B
    Shapcott, M
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 171 - 177
  • [23] Knowledge-driven drug repurposing using a comprehensive drug knowledge graph
    Zhu, Yongjun
    Che, Chao
    Jin, Bo
    Zhang, Ningrui
    Su, Chang
    Wang, Fei
    HEALTH INFORMATICS JOURNAL, 2020, 26 (04) : 2737 - 2750
  • [24] Join Query Optimization Using Genetic Ant Colony Optimization Algorithm for Distributed Databases
    Tiwari, Preeti
    Chande, Swati V.
    EMERGING TECHNOLOGIES IN COMPUTER ENGINEERING: MICROSERVICES IN BIG DATA ANALYTICS, 2019, 985 : 224 - 239
  • [25] Distributed Query Processing on Compressed Graphs Using K2-Trees
    Alvarez-Garcia, Sandra
    Brisaboa, Nieves R.
    Gomez-Pantoja, Carlos
    Marin, Mauricio
    STRING PROCESSING AND INFORMATION RETRIEVAL (SPIRE 2013), 2013, 8214 : 298 - 310
  • [26] Discovering Causal Rules in Knowledge Graphs using Graph Embeddings
    Simonne, Lucas
    Pemelle, Nathalic
    Sais, Fatiha
    Thomopoulos, Rallou
    2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 95 - 102
  • [27] Large-Scale Ontology Storage and Query Using Graph Database-Oriented Approach: The Case of Freebase
    Elbattah, Mahmoud
    Roushdy, Mohamed
    Aref, Mostafa
    Salem, Abdel-Badeeh M.
    2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS), 2015, : 39 - 43
  • [28] Integrating query of relational and textual data in clinical databases: A case study
    Fisk, JM
    Mutalik, P
    Levin, FW
    Erdos, J
    Taylor, C
    Nadkarni, P
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2003, 10 (01) : 21 - 38
  • [29] A tale of two approaches: Query performance study of XML storage strategies in relational databases
    Prakash, Sandeep
    Bhowmick, Sourav S.
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, 4080 : 149 - 160
  • [30] Using background knowledge for graph based learning:: a case study in chemoinformatics
    Karunaratne, Thashmee
    Bostrom, Henrik
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 153 - +