Storage and Query of Drug Knowledge Graphs Using Distributed Graph Databases: A Case Study

被引:0
|
作者
Han, Xingjian [1 ]
Tian, Yu [2 ]
机构
[1] China Elect Prod Reliabil & Environm Testing Res I, Elect Res Inst MIIT 5, Guangzhou 510610, Peoples R China
[2] Zhejiang Univ, Coll Biomed Engn & Instrument Sci, Engn Res Ctr EMR & Intelligent Expert Syst, Key Lab Biomed Engn,Minist Educ, Hangzhou 310027, Peoples R China
来源
BIOENGINEERING-BASEL | 2025年 / 12卷 / 02期
基金
中国国家自然科学基金;
关键词
distributed graph databases; drug knowledge graph; storage and query; real-time data retrieval;
D O I
10.3390/bioengineering12020115
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Distributed graph databases are a promising method for storing and conducting complex pathway queries on large-scale drug knowledge graphs to support drug research. However, there is a research gap in evaluating drug knowledge graphs' storage and query performance based on distributed graph databases. This study evaluates the feasibility and performance of distributed graph databases in managing large-scale drug knowledge graphs. Methods: First, a drug knowledge graph storage and query system is designed based on the Nebula Graph database. Second, the system's writing and query performance is evaluated. Finally, two drug repurposing benchmarks are used to provide a more extensive and reliable assessment. Results: The performance of distributed graph databases surpasses that of single-machine databases, including data writing, regular queries, constrained queries, and concurrent queries. Additionally, the advantages of distributed graph databases in writing performance become more pronounced as the data volume increases. The query performance benefits of distributed graph databases also improve with the complexity of query tasks. The drug repurposing evaluation results show that 78.54% of the pathways are consistent with currently approved drug treatments according to repoDB. Additionally, 12 potential pathways for new drug indications are found to have literature support according to DrugRepoBank. Conclusions: The proposed system is able to construct, store, and query a large graph of multisource drug knowledge and provides reliable and explainable drug-disease paths for drug repurposing.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Distributed Storage and Query for Domain Knowledge Graphs
    Shan, Xiaohuan
    Shi, Xiyi
    Ma, Wenyuan
    Wang, Junlu
    WEB AND BIG DATA, APWEB-WAIM 2020 INTERNATIONAL WORKSHOPS, KGMA 2020, SEMIBDMA 2020, DEEPLUDA 2020, 2021, 1373 : 116 - 128
  • [2] Querying in the Age of Graph Databases and Knowledge Graphs
    Arenas, Marcelo
    Gutierrez, Claudio
    Sequeda, Juan F.
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2821 - 2828
  • [3] Distributed Knowledge Graph Query Acceleration Algorithm
    Shi, Peifan
    Li, Youhuan
    Li, Wenjie
    Chen, Xinhuan
    WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 32 - 47
  • [4] Automated Query Graph Generation for Querying Knowledge Graphs
    Zheng, Weiguo
    Zhang, Mei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2698 - 2707
  • [5] Graph Embedding based Query Construction over Knowledge Graphs
    Wang, Ruijie
    Wang, Meng
    Liu, Jun
    Yao, Siyu
    Zheng, Qinghua
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 1 - 8
  • [6] Semantic Query Transformations for Increased Parallelization in Distributed Knowledge Graph Query Processing
    Kim, Hyeongsik
    Bhattacharyya, Abhisha
    Anyanwu, Kemafor
    PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2019,
  • [7] Knowledge Graph OLAP A multidimensional model and query operations for contextualized knowledge graphs
    Schuetz, Christoph G.
    Bozzato, Loris
    Neumayr, Bernd
    Schrefl, Michael
    Serafini, Luciano
    SEMANTIC WEB, 2021, 12 (04) : 649 - 683
  • [8] Knowledge Graphs for drug repurposing: a review of databases and methods
    Perdomo-Quinteiro, Pablo
    Belmonte-Hernandez, Alberto
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [9] On Smart Query Routing: For Distributed Graph Querying with Decoupled Storage
    Khan, Arijit
    Segovia, Gustavo
    Kossmann, Donald
    PROCEEDINGS OF THE 2018 USENIX ANNUAL TECHNICAL CONFERENCE, 2018, : 401 - 412
  • [10] Query Optimization using Clustering and Genetic Algorithm for Distributed Databases
    Lakshmi, S. Venkata
    Vatsavayi, Valli Kumari
    2016 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2016,