Unsupervised Learning to Subphenotype Heart Failure Patients from Electronic Health Records

被引:0
|
作者
Hackl, Melanie [1 ]
Datta, Suparno [1 ,2 ]
Miotto, Riccardo [2 ]
Bottinger, Erwin [1 ,2 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Digital Hlth Ctr, Potsdam, Germany
[2] Icahn Sch Med Mt Sinai, Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY USA
关键词
Unsupervised learning; Electronic health records; Heart failure;
D O I
10.1007/978-3-030-77211-6_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart failure (HF) is a deadly disease and its prevalence is slowly increasing. The sub-types of HF are currently mostly determined by the so-called ejection fraction (EF). In this work, we try to find novel subgroups of heart failure following a complete data-driven approach of clustering patients based on their electronic health records (EHRs). Using a validated phenotyping algorithm we were able to identify 14,334 adult patients with heart failure in our database. We derived embeddings of patients using two different strategies, one processing aggregated clinical features using principal component analysis (PCA) and uniform manifold approximation and projection (UMAP), and one where we learn embeddings from the sequence of medical events using a long short-term memory (LSTM) autoencoder. Then we evaluated different clustering strategies like k-means and agglomerative hierarchical to derive the most informative subtypes. The results were compared based on different metrics such as silhouette coefficient and so on and also based on comparing outcomes such as hospitalization, EF etc. between the clusters. In the most promising result, we were able to identify 3 subclusters using the aggregated data approach in combination with UMAP as dimension reduction method and k-means as cluster method. Patients in cluster 1 had the lowest number of hospital days and comorbidities, while patients in cluster 3 had a significantly higher number of hospital days together with a higher prevalence of comorbidities such as chronic kidney disease and atrial fibrillation. Patients in cluster 2 had a high prevalence of drug allergies in their medical history.
引用
收藏
页码:219 / 228
页数:10
相关论文
共 50 条
  • [41] Learning a Health Knowledge Graph from Electronic Medical Records
    Maya Rotmensch
    Yoni Halpern
    Abdulhakim Tlimat
    Steven Horng
    David Sontag
    Scientific Reports, 7
  • [42] Using Unsupervised Machine Learning to Identify Subgroups Among Home Health Patients With Heart Failure Using Telehealth
    Bose, Eliezer
    Radhakrishnan, Kavita
    CIN-COMPUTERS INFORMATICS NURSING, 2018, 36 (05) : 242 - 248
  • [43] Electronic healthcare records and external outcome data for hospitalized patients with heart failure
    Zhongheng Zhang
    Linghong Cao
    Rangui Chen
    Yan Zhao
    Lukai Lv
    Ziyin Xu
    Ping Xu
    Scientific Data, 8
  • [44] Electronic healthcare records and external outcome data for hospitalized patients with heart failure
    Zhang, Zhongheng
    Cao, Linghong
    Chen, Rangui
    Zhao, Yan
    Lv, Lukai
    Xu, Ziyin
    Xu, Ping
    SCIENTIFIC DATA, 2021, 8 (01)
  • [45] Automated review of electronic health records to assess quality of care for outpatients with heart failure
    Baker, David W.
    Persell, Stephen D.
    Thompson, Jason A.
    Soman, Neilesh S.
    Burgner, Karen M.
    Liss, David
    Kmetik, Karen S.
    ANNALS OF INTERNAL MEDICINE, 2007, 146 (04) : 270 - 277
  • [46] Early detection of heart failure using in-patient longitudinal electronic health records
    Drozdov, Ignat
    Szubert, Benjamin
    Murphy, Clare
    Brooksbank, Katriona
    Lowe, David J.
    PLOS ONE, 2024, 19 (12):
  • [47] Phenotyping of heart failure with preserved ejection faction using health electronic records and echocardiography
    Pierre-Jean, M.
    Bouzille, G.
    L'official, G.
    Cuggia, M.
    Donal, E.
    EUROPEAN HEART JOURNAL, 2022, 43 : 819 - 819
  • [48] Identification of Heart Failure by Diagnoses and Cardiac Tests in a US Electronic Health Records Database
    Le, Hoa V.
    Truong, Chi T. L.
    Priest, Julie
    DiBello, Julia
    Averell, Carlyne
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2014, 23 : 394 - 394
  • [49] Prediction of left ventricular ejection fraction changes in heart failure patients using machine learning and electronic health records: a multi-site study
    Adekkanattu, Prakash
    Rasmussen, Luke V.
    Pacheco, Jennifer A.
    Kabariti, Joseph
    Stone, Daniel J.
    Yu, Yue
    Jiang, Guoqian
    Luo, Yuan
    Brandt, Pascal S.
    Xu, Zhenxing
    Vekaria, Veer
    Xu, Jie
    Wang, Fei
    Benda, Natalie C.
    Peng, Yifan
    Goyal, Parag
    Ahmad, Faraz S.
    Pathak, Jyotishman
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [50] Prediction of left ventricular ejection fraction changes in heart failure patients using machine learning and electronic health records: a multi-site study
    Prakash Adekkanattu
    Luke V. Rasmussen
    Jennifer A. Pacheco
    Joseph Kabariti
    Daniel J. Stone
    Yue Yu
    Guoqian Jiang
    Yuan Luo
    Pascal S. Brandt
    Zhenxing Xu
    Veer Vekaria
    Jie Xu
    Fei Wang
    Natalie C. Benda
    Yifan Peng
    Parag Goyal
    Faraz S. Ahmad
    Jyotishman Pathak
    Scientific Reports, 13