The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data

被引:0
|
作者
Huan, Jia-Ming [1 ]
Wang, Xiao-Jie [1 ]
Li, Yuan [1 ]
Zhang, Shi-Jun [1 ]
Hu, Yuan-Long [1 ]
Li, Yun-Lun [1 ,2 ,3 ]
机构
[1] Shandong Univ Tradit Chinese Med, Sch Clin Med 1, Jinan 250355, Peoples R China
[2] Shandong Univ Tradit Chinese Med, Dept Cardiovasc, Affiliated Hosp, Jinan 250014, Peoples R China
[3] Shandong Engn Res Ctr, Precis Diag & Treatment Cardiovasc Dis Tradit Chin, Jinan 250355, Peoples R China
来源
BIODATA MINING | 2024年 / 17卷 / 01期
关键词
Coronary artery plate; Biomedical knowledge graph; Symptom phenotypes; Machine learning; Network analysis; Clinical decision support; SEMANTIC SIMILARITY; ATHEROSCLEROSIS; STENOSIS; BURDEN;
D O I
10.1186/s13040-024-00365-1
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A knowledge graph can effectively showcase the essential characteristics of data and is increasingly emerging as a significant means of integrating information in the field of artificial intelligence. Coronary artery plaque represents a significant etiology of cardiovascular events, posing a diagnostic challenge for clinicians who are confronted with a multitude of nonspecific symptoms. To visualize the hierarchical relationship network graph of the molecular mechanisms underlying plaque properties and symptom phenotypes, patient symptomatology was extracted from electronic health record data from real-world clinical settings. Phenotypic networks were constructed utilizing clinical data and protein-protein interaction networks. Machine learning techniques, including convolutional neural networks, Dijkstra's algorithm, and gene ontology semantic similarity, were employed to quantify clinical and biological features within the network. The resulting features were then utilized to train a K-nearest neighbor model, yielding 23 symptoms, 41 association rules, and 61 hub genes across the three types of plaques studied, achieving an area under the curve of 92.5%. Weighted correlation network analysis and pathway enrichment were subsequently utilized to identify lipid status-related genes and inflammation-associated pathways that could help explain the differences in plaque properties. To confirm the validity of the network graph model, we conducted coexpression analysis of the hub genes to evaluate their potential diagnostic value. Additionally, we investigated immune cell infiltration, examined the correlations between hub genes and immune cells, and validated the reliability of the identified biological pathways. By integrating clinical data and molecular network information, this biomedical knowledge graph model effectively elucidated the potential molecular mechanisms that collude symptoms, diseases, and molecules.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Predicting Clinical Remission of Chronic Urticaria Using Random Survival Forests: Machine Learning Applied to Real-World Data
    Pivneva, Irina
    Balp, Maria-Magdalena
    Geissbuhler, Yvonne
    Severin, Thomas
    Smeets, Serge
    Signorovitch, James
    Royer, Jimmy
    Liang, Yawen
    Cornwall, Tom
    Pan, Jutong
    Danyliv, Andrii
    McKenna, Sarah Jane
    Marsland, Alexander M.
    Soong, Weily
    DERMATOLOGY AND THERAPY, 2022, 12 (12) : 2747 - 2763
  • [42] Exploring the potential of thyroid hormones to predict clinical improvements in depressive patients: A machine learning analysis of the real-world based study.
    Qiao, Dan
    Liu, Huishan
    Zhang, Xuemin
    Lei, Lei
    Sun, Ning
    Yang, Chunxia
    Li, Gaizhi
    Guo, Meng
    Zhang, Yu
    Zhang, Kerang
    Liu, Zhifen
    JOURNAL OF AFFECTIVE DISORDERS, 2022, 299 : 159 - 165
  • [43] Coupling synthetic and real-world data for a deep learning-based segmentation process of 4D flow MRI
    Garzia, Simone
    Scarpolini, Martino Andrea
    Mazzoli, Marilena
    Capellini, Katia
    Monteleone, Angelo
    Cademartiri, Filippo
    Positano, Vincenzo
    Celi, Simona
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
  • [44] Predicting obstructive coronary artery disease using carotid ultrasound parameters: A nomogram from a large real-world clinical data
    Wu, Na
    Chen, Xinghua
    Li, Mingyang
    Qu, Xiaolong
    Li, Yueli
    Xie, Weijia
    Wu, Long
    Xiang, Ying
    Li, Yafei
    Zhong, Li
    EUROPEAN JOURNAL OF CLINICAL INVESTIGATION, 2018, 48 (08)
  • [46] Machine learning-based coronary artery calcium score predicted from clinical variables as a prognostic indicator in patients referred for invasive coronary angiography
    Jian, Wen
    Dong, Zhujun
    Shen, Xueqian
    Zheng, Ze
    Wu, Zheng
    Shi, Yuchen
    Han, Yingchun
    Du, Jie
    Liu, Jinghua
    EUROPEAN RADIOLOGY, 2024, 34 (09) : 5633 - 5643
  • [47] CAUSAL MACHINE LEARNING FOR ASSESSING PNEUMOCOCCAL VACCINE EFFECTIVENESS: INNOVATIONS IN REAL-WORLD DATA ANALYSIS AND CONFOUNDING PATHWAY ADJUSTMENT
    Wilson, A.
    Gregg, M.
    Streja, E.
    Alderden, J.
    Vanderpuye-Orgle, J.
    Roessner, M.
    VALUE IN HEALTH, 2023, 26 (12) : S422 - S423
  • [48] Exome sequence analysis identifies rare coding variants associated with a machine learning-based marker for coronary artery disease
    Petrazzini, Ben Omega
    Forrest, Iain S.
    Rocheleau, Ghislain
    Vy, Ha My T.
    Marquez-Luna, Carla
    Duffy, Aine
    Chen, Robert
    Park, Joshua K.
    Gibson, Kyle
    Goonewardena, Sascha N.
    Malick, Waqas A.
    Rosenson, Robert S.
    Jordan, Daniel M.
    Do, Ron
    NATURE GENETICS, 2024, 56 (07) : 1412 - 1419
  • [49] Machine Learning Based Monitoring of the Pneumatic Actuators' Behavior Through Signal Processing Using Real-World Data Set
    Kovacs, Tibor
    Ko, Andrea
    FUTURE DATA AND SECURITY ENGINEERING (FDSE 2019), 2019, 11814 : 33 - 44
  • [50] Machine- Learning Models Based on Real-World Data to Predict Rehospitalization or Death After Acute Myocardial Infarction
    Seegan, George
    O'Kelly, James
    Kalich, Bethany
    Shahabi, Ahva
    CIRCULATION, 2022, 146