Stratification of diabetes in the context of comorbidities, using representation learning and topological data analysis

被引:0
|
作者
Malgorzata Wamil
Abdelaali Hassaine
Shishir Rao
Yikuan Li
Mohammad Mamouei
Dexter Canoy
Milad Nazarzadeh
Zeinab Bidel
Emma Copland
Kazem Rahimi
Gholamreza Salimi-Khorshidi
机构
[1] University of Oxford,Deep Medicine, Oxford Martin School
[2] Mayo Clinic Healthcare,Nuffield Department of Women’s and Reproductive Health, Medical Science Division
[3] University of Oxford,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Diabetes is a heterogenous, multimorbid disorder with a large variation in manifestations, trajectories, and outcomes. The aim of this study is to validate a novel machine learning method for the phenotyping of diabetes in the context of comorbidities. Data from 9967 multimorbid patients with a new diagnosis of diabetes were extracted from Clinical Practice Research Datalink. First, using BEHRT (a transformer-based deep learning architecture), the embeddings corresponding to diabetes were learned. Next, topological data analysis (TDA) was carried out to test how different areas in high-dimensional manifold correspond to different risk profiles. The following endpoints were considered when profiling risk trajectories: major adverse cardiovascular events (MACE), coronary artery disease (CAD), stroke (CVA), heart failure (HF), renal failure (RF), diabetic neuropathy, peripheral arterial disease, reduced visual acuity and all-cause mortality. Kaplan Meier curves were plotted for each derived phenotype. Finally, we tested the performance of an established risk prediction model (QRISK) by adding TDA-derived features. We identified four subgroups of patients with diabetes and divergent comorbidity patterns differing in their risk of future cardiovascular, renal, and other microvascular outcomes. Phenotype 1 (young with chronic inflammatory conditions) and phenotype 2 (young with CAD) included relatively younger patients with diabetes compared to phenotypes 3 (older with hypertension and renal disease) and 4 (older with previous CVA), and those subgroups had a higher frequency of pre-existing cardio-renal diseases. Within ten years of follow-up, 2592 patients (26%) experienced MACE, 2515 patients (25%) died, and 2020 patients (20%) suffered RF. QRISK3 model’s AUC was augmented from 67.26% (CI 67.25–67.28%) to 67.67% (CI 67.66–67.69%) by adding specific TDA-derived phenotype and the distances to both extremities of the TDA graph improving its performance in the prediction of CV outcomes. We confirmed the importance of accounting for multimorbidity when risk stratifying heterogenous cohort of patients with new diagnosis of diabetes. Our unsupervised machine learning method improved the prediction of clinical outcomes.
引用
收藏
相关论文
共 50 条
  • [1] Stratification of diabetes in the context of comorbidities, using representation learning and topological data analysis
    Wamil, Malgorzata
    Hassaine, Abdelaali
    Rao, Shishir
    Li, Yikuan
    Mamouei, Mohammad
    Canoy, Dexter
    Nazarzadeh, Milad
    Bidel, Zeinab
    Copland, Emma
    Rahimi, Kazem
    Salimi-Khorshidi, Gholamreza
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [2] Individualized Patient Risk Stratification Using Machine Learning and Topological Data Analysis
    Ng, Arnold C. T.
    Delgado, Victoria
    Bax, Jeroen J.
    JACC-CARDIOVASCULAR IMAGING, 2020, 13 (05) : 1133 - 1134
  • [3] High Dimensional Data Stream Clustering using Topological Representation Learning
    Ben-Fares, Maha
    Grozavu, Nistor
    Rastin, Parisa
    Holat, Pierre
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1415 - 1422
  • [4] Unsupervised learning of topological phase diagram using topological data analysis
    Park, Sungjoon
    Hwang, Yoonseok
    Yang, Bohm-Jung
    PHYSICAL REVIEW B, 2022, 105 (19)
  • [5] Learning Representation for fMRI Data Analysis using Autoencoder
    Kamonsantiroj, Suwatchai
    Charoenvorakiat, Parinya
    Pipanmaekaporn, Luepol
    PROCEEDINGS 2016 5TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS IIAI-AAI 2016, 2016, : 560 - 565
  • [6] CONSISTENT MANIFOLD REPRESENTATION FOR TOPOLOGICAL DATA ANALYSIS
    Berry, Tyrus
    Sauer, Timothy
    FOUNDATIONS OF DATA SCIENCE, 2019, 1 (01): : 1 - 38
  • [7] Interpreting Deep Patient Stratification Models with Topological Data Analysis
    Jurek-Loughrey, Anna
    Gault, Richard
    Ahmaderaghi, Baharak
    Fahim, Muhammad
    Bai, Lu
    ADVANCES IN DIGITAL HEALTH AND MEDICAL BIOENGINEERING, VOL 1, EHB-2023, 2024, 109 : 563 - 574
  • [8] Topological data analysis and machine learning
    Leykam, Daniel
    Angelakis, Dimitris G.
    ADVANCES IN PHYSICS-X, 2023, 8 (01):
  • [9] Chatter Classification in Turning using Machine Learning and Topological Data Analysis
    Khasawneh, Firas A.
    Munch, Elizabeth
    Perea, Jose A.
    IFAC PAPERSONLINE, 2018, 51 (14): : 195 - 200
  • [10] Using topological data analysis and machine learning to predict customer churn
    Sagming, Marcel
    Heymann, Reolyn
    Visaya, Maria Vivien
    JOURNAL OF BIG DATA, 2024, 11 (01)