Unsupervised Learning to Subphenotype Heart Failure Patients from Electronic Health Records

被引:0
|
作者
Hackl, Melanie [1 ]
Datta, Suparno [1 ,2 ]
Miotto, Riccardo [2 ]
Bottinger, Erwin [1 ,2 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Digital Hlth Ctr, Potsdam, Germany
[2] Icahn Sch Med Mt Sinai, Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY USA
关键词
Unsupervised learning; Electronic health records; Heart failure;
D O I
10.1007/978-3-030-77211-6_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart failure (HF) is a deadly disease and its prevalence is slowly increasing. The sub-types of HF are currently mostly determined by the so-called ejection fraction (EF). In this work, we try to find novel subgroups of heart failure following a complete data-driven approach of clustering patients based on their electronic health records (EHRs). Using a validated phenotyping algorithm we were able to identify 14,334 adult patients with heart failure in our database. We derived embeddings of patients using two different strategies, one processing aggregated clinical features using principal component analysis (PCA) and uniform manifold approximation and projection (UMAP), and one where we learn embeddings from the sequence of medical events using a long short-term memory (LSTM) autoencoder. Then we evaluated different clustering strategies like k-means and agglomerative hierarchical to derive the most informative subtypes. The results were compared based on different metrics such as silhouette coefficient and so on and also based on comparing outcomes such as hospitalization, EF etc. between the clusters. In the most promising result, we were able to identify 3 subclusters using the aggregated data approach in combination with UMAP as dimension reduction method and k-means as cluster method. Patients in cluster 1 had the lowest number of hospital days and comorbidities, while patients in cluster 3 had a significantly higher number of hospital days together with a higher prevalence of comorbidities such as chronic kidney disease and atrial fibrillation. Patients in cluster 2 had a high prevalence of drug allergies in their medical history.
引用
收藏
页码:219 / 228
页数:10
相关论文
共 50 条
  • [21] ChatGPT-4 extraction of heart failure symptoms and signs from electronic health records
    Workman, T. Elizabeth
    Ahmed, Ali
    Sheriff, Helen M.
    Raman, Venkatesh K.
    Zhang, Sijian
    Shao, Yijun
    Faselis, Charles
    Fonarow, Gregg C.
    Zeng-Treitler, Qing
    PROGRESS IN CARDIOVASCULAR DISEASES, 2024, 87 : 44 - 49
  • [22] An Unsupervised Learning Approach to Identify Immunoglobulin Utilization Patterns Using Electronic Health Records
    Li, N.
    Riazi, K.
    Arnold, D.
    Sidhu, D.
    Barty, R.
    Heddle, N.
    Callum, J.
    Down, D.
    TRANSFUSION, 2023, 63 : 261A - 262A
  • [23] An unsupervised learning approach to identify immunoglobulin utilization patterns using electronic health records
    Riazi, Kiarash
    Ly, Mark
    Barty, Rebecca
    Callum, Jeannie
    Arnold, Donald M.
    Heddle, Nancy M.
    Down, Douglas G.
    Sidhu, Davinder
    Li, Na
    TRANSFUSION, 2023, 63 (12) : 2234 - 2247
  • [24] Unsupervised learning algorithm for chronic cough detection in the electronic health records (EHR) data
    Schelfhout, Jonathan
    Luo, Xiao
    Chandrasekaran, Vasu
    Turzhitsky, Vladimir
    Dexter, Paul
    Han, Zhi
    Zhang, Zuoyi
    Shao, Wei
    Roberts, Anna
    Metzger, Megan
    Baker, Jarod
    La Rosa, Carmen
    Weaver, Jessica
    Bali, Vishal
    Huang, Kun
    EUROPEAN RESPIRATORY JOURNAL, 2020, 56
  • [25] Unsupervised probabilistic models for sequential Electronic Health Records
    Kaplan, Alan D.
    Greene, John D.
    Liu, Vincent X.
    Ray, Priyadip
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 134
  • [26] Artificial intelligence-assisted automated heart failure detection and classification from electronic health records
    Oo, Mon Myat
    Gao, Chuang
    Cole, Christian
    Hummel, Yoran
    Guignard-Duff, Magalie
    Jefferson, Emily
    Hare, James
    Voors, Adriaan A.
    de Boer, Rudolf A.
    Lam, Carolyn S. P.
    Mordi, Ify R.
    Tromp, Jasper
    Lang, Chim C.
    ESC HEART FAILURE, 2024, 11 (05): : 2481 - +
  • [27] Atrial fibrillation development in the heart failure population from nationwide British linked electronic health records
    Yoshimura, Hiroyuki
    Paliwal, Nikhil
    Gonzalez-Izquierdo, Arturo
    Finan, Chris
    Schmidt, Amand Floriaan
    Lip, Gregory Y. H.
    Providencia, Rui
    ESC HEART FAILURE, 2025,
  • [28] Feasibility Of Artificial Intelligence Automated Detection And Classification Of Heart Failure From Routine Electronic Health Records
    Oo, Mon Myat
    Tromp, Jasper
    Gao, Chuang
    Hummel, Y. M.
    Guignard-Duff, Magalie
    Cole, Christian
    Jefferson, Emily
    Hare, James
    De Boer, Rudolf A.
    Voors, Adriaan
    Lam, Carolyn S. P.
    Lang, Chim C.
    JOURNAL OF CARDIAC FAILURE, 2023, 29 (04) : 640 - 640
  • [29] HEART FAILURE IN A HEALTH AREA OF MADRID, SPAIN. DESCRIPTION AND MANAGEMENT FROM ELECTRONIC MEDICAL RECORDS
    Parrondo, J.
    Sarria, A.
    VALUE IN HEALTH, 2015, 18 (07) : A381 - A382
  • [30] Using Unsupervised Learning to Identify Clinical Subtypes of Alzheimer's Disease in Electronic Health Records
    Alexander, Nonie
    Alexander, Daniel C.
    Barkhof, Frederik
    Denaxas, Spiros
    DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 499 - 503