Predicting hypertension onset from longitudinal electronic health records with deep learning

被引:11
|
作者
Datta, Suparno [1 ,2 ]
Morassi Sasso, Ariane [1 ,2 ]
Kiwit, Nina [1 ]
Bose, Subhronil [1 ]
Nadkarni, Girish [1 ,2 ,3 ]
Miotto, Riccardo [2 ,4 ]
Boettinger, Erwin P. [1 ,2 ,3 ,5 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Digital Hlth Ctr, Potsdam, Germany
[2] Icahn Sch Med Mt Sinai, Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY 10029 USA
[3] Icahn Sch Med Mt Sinai, Dept Med, New York, NY 10029 USA
[4] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, New York, NY 10029 USA
[5] Icahn Sch Med Mt Sinai, Windreich Dept Artificial Intelligence & Human Hl, New York, NY 10029 USA
基金
美国国家卫生研究院;
关键词
machine learning; electronic health records; deep learning; hypertension; HIGH BLOOD-PRESSURE; INCIDENT HYPERTENSION; AMERICAN-COLLEGE; RISK; PREVENTION; MANAGEMENT; ADULTS;
D O I
10.1093/jamiaopen/ooac097
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: Hypertension has long been recognized as one of the most important predisposing factors for cardiovascular diseases and mortality. In recent years, machine learning methods have shown potential in diagnostic and predictive approaches in chronic diseases. Electronic health records (EHRs) have emerged as a reliable source of longitudinal data. The aim of this study is to predict the onset of hypertension using modern deep learning (DL) architectures, specifically long short-term memory (LSTM) networks, and longitudinal EHRs. Materials and Methods: We compare this approach to the best performing models reported from previous works, particularly XGboost, applied to aggregated features. Our work is based on data from 233 895 adult patients from a large health system in the United States. We divided our population into 2 distinct longitudinal datasets based on the diagnosis date. To ensure generalization to unseen data, we trained our models on the first dataset (dataset A "train and validation") using cross-validation, and then applied the models to a second dataset (dataset B "test") to assess their performance. We also experimented with 2 different time-windows before the onset of hypertension and evaluated the impact on model performance. Results: With the LSTM network, we were able to achieve an area under the receiver operating characteristic curve value of 0.98 in the "train and validation" dataset A and 0.94 in the "test" dataset B for a prediction time window of 1 year. Lipid disorders, type 2 diabetes, and renal disorders are found to be associated with incident hypertension. Conclusion: These findings show that DL models based on temporal EHR data can improve the identification of patients at high risk of hypertension and corresponding driving factors. In the long term, this work may support identifying individuals who are at high risk for developing hypertension and facilitate earlier intervention to prevent the future development of hypertension.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Predicting the Risk of Inpatient Hypoglycemia With Machine Learning Using Electronic Health Records
    Ruan, Yue
    Bellot, Alexis
    Moysova, Zuzana
    Tan, Garry D.
    Lumb, Alistair
    Davies, Jim
    van der Schaar, Mihaela
    Rea, Rustam
    DIABETES CARE, 2020, 43 (07) : 1504 - 1511
  • [42] Risk prediction of heart diseases in patients with breast cancer: A deep learning approach with longitudinal electronic health records data
    Zhou, Sicheng
    Blaes, Anne
    Shenoy, Chetan
    Sun, Ju
    Zhang, Rui
    ISCIENCE, 2024, 27 (07)
  • [43] Predicting Cardiovascular Disease in Psychiatric Patients: Machine Learning with Electronic Health Records
    Bernstorff, M.
    Danielsen, A.
    Dinesen, S.
    EUROPEAN PSYCHIATRY, 2022, 65 : S678 - S678
  • [44] Predicting opioid overdose risk of patients with opioid prescriptions using electronic health records based on temporal deep learning
    Dong, Xinyu
    Deng, Jianyuan
    Hou, Wei
    Rashidian, Sina
    Rosenthal, Richard N.
    Saltz, Mary
    Saltz, Joel H.
    Wang, Fusheng
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 116
  • [45] Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective
    Zhou, Shang-Ming
    Lyons, Ronan A.
    Rahman, Muhammad A.
    Holborow, Alexander
    Brophy, Sinead
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (01):
  • [46] Continual learning of longitudinal health records
    Armstrong, Jacob
    Clifton, David A.
    2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22), 2022,
  • [47] Predicting Breast Cancer by Applying Deep Learning to Linked Health Records and Mammograms
    Akselrod-Ballin, Ayelet
    Chorev, Michal
    Shoshan, Yoel
    Spiro, Adam
    Hazan, Alon
    Melamed, Roie
    Barkan, Ella
    Herzel, Esma
    Naor, Shaked
    Karavani, Ehud
    Koren, Gideon
    Goldscbmidt, Yaara
    Shalev, Varda
    Rosen-Zvi, Michal
    Guindy, Michal
    RADIOLOGY, 2019, 292 (02) : 331 - 342
  • [48] On the evaluation of synthetic longitudinal electronic health records
    Achterberg, Jim L.
    Haas, Marcel R.
    Spruit, Marco R.
    BMC MEDICAL RESEARCH METHODOLOGY, 2024, 24 (01)
  • [49] Predicting Glaucoma Development With Longitudinal Deep Learning Predictions From
    Lee, Terry
    Jammal, Alessandro A.
    Mariottoni, Eduardo B.
    Medeiros, Felipe A.
    AMERICAN JOURNAL OF OPHTHALMOLOGY, 2021, 225 : 86 - 94
  • [50] Predicting malnutrition from longitudinal patient trajectories with deep learning
    Jin, Boyang Tom
    Choi, Mi Hyun
    Moyer, Meagan F.
    Kim, David A.
    PLOS ONE, 2022, 17 (07):