Unsupervised Learning to Subphenotype Heart Failure Patients from Electronic Health Records

被引:0
|
作者
Hackl, Melanie [1 ]
Datta, Suparno [1 ,2 ]
Miotto, Riccardo [2 ]
Bottinger, Erwin [1 ,2 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Digital Hlth Ctr, Potsdam, Germany
[2] Icahn Sch Med Mt Sinai, Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY USA
关键词
Unsupervised learning; Electronic health records; Heart failure;
D O I
10.1007/978-3-030-77211-6_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart failure (HF) is a deadly disease and its prevalence is slowly increasing. The sub-types of HF are currently mostly determined by the so-called ejection fraction (EF). In this work, we try to find novel subgroups of heart failure following a complete data-driven approach of clustering patients based on their electronic health records (EHRs). Using a validated phenotyping algorithm we were able to identify 14,334 adult patients with heart failure in our database. We derived embeddings of patients using two different strategies, one processing aggregated clinical features using principal component analysis (PCA) and uniform manifold approximation and projection (UMAP), and one where we learn embeddings from the sequence of medical events using a long short-term memory (LSTM) autoencoder. Then we evaluated different clustering strategies like k-means and agglomerative hierarchical to derive the most informative subtypes. The results were compared based on different metrics such as silhouette coefficient and so on and also based on comparing outcomes such as hospitalization, EF etc. between the clusters. In the most promising result, we were able to identify 3 subclusters using the aggregated data approach in combination with UMAP as dimension reduction method and k-means as cluster method. Patients in cluster 1 had the lowest number of hospital days and comorbidities, while patients in cluster 3 had a significantly higher number of hospital days together with a higher prevalence of comorbidities such as chronic kidney disease and atrial fibrillation. Patients in cluster 2 had a high prevalence of drug allergies in their medical history.
引用
收藏
页码:219 / 228
页数:10
相关论文
共 50 条
  • [11] The Influence of Electronic Health Records on Quality of Care for Heart Failure
    Walsh, M. N.
    Fonarow, G.
    Yancy, C. W.
    Albert, N. M.
    Curtis, A.
    Stough, W. Gattis
    Gheorghiade, M.
    Heywood, J. T.
    McBride, M.
    Mehra, M.
    O'Connor, C.
    Reynolds, D.
    CIRCULATION, 2008, 118 (18) : S714 - S714
  • [12] HR-BGCN : Predicting readmission for heart failure from electronic health records
    Ma, Huiting
    Li, Dengao
    Zhao, Jumin
    Li, Wenjing
    Fu, Jian
    Li, Chunxia
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 150
  • [13] The application of unsupervised deep learning in predictive models using electronic health records
    Lei Wang
    Liping Tong
    Darcy Davis
    Tim Arnold
    Tina Esposito
    BMC Medical Research Methodology, 20
  • [14] The application of unsupervised deep learning in predictive models using electronic health records
    Wang, Lei
    Tong, Liping
    Davis, Darcy
    Arnold, Tim
    Esposito, Tina
    BMC MEDICAL RESEARCH METHODOLOGY, 2020, 20 (01)
  • [15] Learning Diagnosis from Electronic Health Records
    Barbantan, Ioana
    Potolea, Rodica
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 344 - 351
  • [16] Multimorbidity patterns in patients with heart failure: an observational Spanish study based on electronic health records
    Gimeno-Miguel, Antonio
    Gracia Gutierrez, Anyuli
    Poblador-Plou, Beatriz
    Coscollar-Santaliestra, Carlos
    Ignacio Perez-Calvo, J.
    Divo, Miguel J.
    Calderon-Larranaga, Amaia
    Prados-Torres, Alexandra
    Ruiz-Laiglesia, Fernando J.
    BMJ OPEN, 2019, 9 (12):
  • [17] Integrating electronic health records into the study of heart failure: promises and pitfalls
    Vaduganathan, Muthiah
    Patel, Ravi B.
    Butler, Javed
    Metra, Marco
    EUROPEAN JOURNAL OF HEART FAILURE, 2017, 19 (09) : 1128 - 1130
  • [18] Unsupervised Machine Learning for the Discovery of Latent Clusters in COVID-19 Patients Using Electronic Health Records
    Cui, Wanting
    Robins, Daniel
    Finkelstein, Joseph
    IMPORTANCE OF HEALTH INFORMATICS IN PUBLIC HEALTH DURING A PANDEMIC, 2020, 272 : 1 - 4
  • [19] Neighborhood Poverty and Incident Heart Failure: an Analysis of Electronic Health Records from 2005 to 2018
    Leah B. Rethy
    Megan E. McCabe
    Kiarri N. Kershaw
    Faraz S. Ahmad
    Tara Lagu
    Lindsay R. Pool
    Sadiya S. Khan
    Journal of General Internal Medicine, 2021, 36 : 3719 - 3727
  • [20] Neighborhood Poverty and Incident Heart Failure: an Analysis of Electronic Health Records from 2005 to 2018
    Rethy, Leah B.
    McCabe, Megan E.
    Kershaw, Kiarri N.
    Ahmad, Faraz S.
    Lagu, Tara
    Pool, Lindsay R.
    Khan, Sadiya S.
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2021, 36 (12) : 3719 - 3727