Unsupervised Learning to Subphenotype Heart Failure Patients from Electronic Health Records

被引:0
|
作者
Hackl, Melanie [1 ]
Datta, Suparno [1 ,2 ]
Miotto, Riccardo [2 ]
Bottinger, Erwin [1 ,2 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Digital Hlth Ctr, Potsdam, Germany
[2] Icahn Sch Med Mt Sinai, Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY USA
关键词
Unsupervised learning; Electronic health records; Heart failure;
D O I
10.1007/978-3-030-77211-6_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart failure (HF) is a deadly disease and its prevalence is slowly increasing. The sub-types of HF are currently mostly determined by the so-called ejection fraction (EF). In this work, we try to find novel subgroups of heart failure following a complete data-driven approach of clustering patients based on their electronic health records (EHRs). Using a validated phenotyping algorithm we were able to identify 14,334 adult patients with heart failure in our database. We derived embeddings of patients using two different strategies, one processing aggregated clinical features using principal component analysis (PCA) and uniform manifold approximation and projection (UMAP), and one where we learn embeddings from the sequence of medical events using a long short-term memory (LSTM) autoencoder. Then we evaluated different clustering strategies like k-means and agglomerative hierarchical to derive the most informative subtypes. The results were compared based on different metrics such as silhouette coefficient and so on and also based on comparing outcomes such as hospitalization, EF etc. between the clusters. In the most promising result, we were able to identify 3 subclusters using the aggregated data approach in combination with UMAP as dimension reduction method and k-means as cluster method. Patients in cluster 1 had the lowest number of hospital days and comorbidities, while patients in cluster 3 had a significantly higher number of hospital days together with a higher prevalence of comorbidities such as chronic kidney disease and atrial fibrillation. Patients in cluster 2 had a high prevalence of drug allergies in their medical history.
引用
收藏
页码:219 / 228
页数:10
相关论文
共 50 条
  • [1] Electronic Health Records and Heart Failure
    Kao, David P.
    HEART FAILURE CLINICS, 2022, 18 (02) : 201 - 211
  • [2] Comparison of Unsupervised Learning Approaches Applied to Electronic Health Record Traits in Heart Failure
    Reza, Nosheen
    Bone, William P.
    Singhal, Pankhuri
    Yang, Yifan
    Verma, Anurag
    Murthy, Ashwin C.
    Denduluri, Srinivas
    Adusumalli, Srinath
    Ritchie, Marylyn
    Cappola, Thomas P.
    CIRCULATION, 2021, 144
  • [3] Electronic health records and quality of care for heart failure
    Walsh, Mary Norine
    Yancy, Clyde W.
    Albert, Nancy M.
    Curtis, Anne B.
    Stough, Wendy Gattis
    Gheorghiade, Mihai
    Heywood, J. Thomas
    McBride, Mark L.
    Mehra, Mandeep R.
    O'Connor, Christopher M.
    Reynolds, Dwight
    Fonarow, Gregg C.
    AMERICAN HEART JOURNAL, 2010, 159 (04) : 635 - U165
  • [4] The impact of electronic health records on care of heart failure patients in the emergency room
    Connelly, Donald P.
    Park, Young-Taek
    Du, Jing
    Theera-Ampornpunt, Nawanan
    Gordon, Bradley D.
    Bershow, Barry A.
    Gensinger, Raymond A., Jr.
    Shrift, Michael
    Routhe, Daniel T.
    Speedie, Stuart M.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (03) : 334 - 340
  • [5] Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records
    Riccardo Miotto
    Li Li
    Brian A. Kidd
    Joel T. Dudley
    Scientific Reports, 6
  • [6] Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records
    Miotto, Riccardo
    Li, Li
    Kidd, Brian A.
    Dudley, Joel T.
    SCIENTIFIC REPORTS, 2016, 6
  • [7] Using electronic health records to predict severity of condition for congestive heart failure patients
    Sideris, Costas
    Shahbazi, Behnam
    Pourhomayoun, Mohammad
    Alshurafa, Nabil
    Sarrafzadeh, Majid
    PROCEEDINGS OF THE 2014 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING (UBICOMP'14 ADJUNCT), 2014, : 1187 - 1192
  • [8] The outcomes of electronic personal health records in patients with heart failure or coronary artery disease
    Nochioka, Kotaro
    Yasuda, Satoshi
    Shiroto, Takashi
    Yamamoto, Saori
    Sato, Haruka
    Hasebe, Yuhi
    Godo, Shigeo
    Nakano, Makoto
    Shindo, Tomohiko
    Nishimiya, Kensuke
    Hao, Kiyotaka
    Takahashi, Jun
    Ido, Keisuke
    Kakuta, Yoichi
    Shimizu, Hiroaki
    Shimokawa, Hiroaki
    Nakayama, Masaharu
    ESC HEART FAILURE, 2025, 12 (02): : 1464 - 1468
  • [9] Endpoint prediction of heart failure using electronic health records
    Chu, Jiebin
    Dong, Wei
    Huang, Zhengxing
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 109 (109)
  • [10] Heart Failure Management Innovation Enabled by Electronic Health Records
    Kao, David P.
    Trinkley, Katy E.
    Lin, Chen-Tan
    JACC-HEART FAILURE, 2020, 8 (03) : 223 - 233