Predicting hypertension onset from longitudinal electronic health records with deep learning

被引:11
|
作者
Datta, Suparno [1 ,2 ]
Morassi Sasso, Ariane [1 ,2 ]
Kiwit, Nina [1 ]
Bose, Subhronil [1 ]
Nadkarni, Girish [1 ,2 ,3 ]
Miotto, Riccardo [2 ,4 ]
Boettinger, Erwin P. [1 ,2 ,3 ,5 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Digital Hlth Ctr, Potsdam, Germany
[2] Icahn Sch Med Mt Sinai, Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY 10029 USA
[3] Icahn Sch Med Mt Sinai, Dept Med, New York, NY 10029 USA
[4] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, New York, NY 10029 USA
[5] Icahn Sch Med Mt Sinai, Windreich Dept Artificial Intelligence & Human Hl, New York, NY 10029 USA
基金
美国国家卫生研究院;
关键词
machine learning; electronic health records; deep learning; hypertension; HIGH BLOOD-PRESSURE; INCIDENT HYPERTENSION; AMERICAN-COLLEGE; RISK; PREVENTION; MANAGEMENT; ADULTS;
D O I
10.1093/jamiaopen/ooac097
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: Hypertension has long been recognized as one of the most important predisposing factors for cardiovascular diseases and mortality. In recent years, machine learning methods have shown potential in diagnostic and predictive approaches in chronic diseases. Electronic health records (EHRs) have emerged as a reliable source of longitudinal data. The aim of this study is to predict the onset of hypertension using modern deep learning (DL) architectures, specifically long short-term memory (LSTM) networks, and longitudinal EHRs. Materials and Methods: We compare this approach to the best performing models reported from previous works, particularly XGboost, applied to aggregated features. Our work is based on data from 233 895 adult patients from a large health system in the United States. We divided our population into 2 distinct longitudinal datasets based on the diagnosis date. To ensure generalization to unseen data, we trained our models on the first dataset (dataset A "train and validation") using cross-validation, and then applied the models to a second dataset (dataset B "test") to assess their performance. We also experimented with 2 different time-windows before the onset of hypertension and evaluated the impact on model performance. Results: With the LSTM network, we were able to achieve an area under the receiver operating characteristic curve value of 0.98 in the "train and validation" dataset A and 0.94 in the "test" dataset B for a prediction time window of 1 year. Lipid disorders, type 2 diabetes, and renal disorders are found to be associated with incident hypertension. Conclusion: These findings show that DL models based on temporal EHR data can improve the identification of patients at high risk of hypertension and corresponding driving factors. In the long term, this work may support identifying individuals who are at high risk for developing hypertension and facilitate earlier intervention to prevent the future development of hypertension.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] LEARNING HEALTHCARE DELIVERY NETWORK WITH LONGITUDINAL ELECTRONIC HEALTH RECORDS DATA
    Sun, Jiehuan
    Liao, Katherine P.
    Cai, Tianxi
    ANNALS OF APPLIED STATISTICS, 2024, 18 (01): : 882 - 898
  • [32] Predicting changes in hypertension control using electronic health records from a chronic disease management program
    Sun, Jimeng
    McNaughton, Candace D.
    Zhang, Ping
    Perer, Adam
    Gkoulalas-Divanis, Aris
    Denny, Joshua C.
    Kirby, Jacqueline
    Lasko, Thomas
    Saip, Alexander
    Malin, Bradley A.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (02) : 337 - 344
  • [33] Multiobjective Ensemble Deep Learning for Predicting Outcome After Lung Cancer Radiotherapy Using Electronic Health Records
    Wang, R.
    Weng, Y.
    Zhou, Z.
    Chen, L.
    Wang, J.
    MEDICAL PHYSICS, 2019, 46 (06) : E283 - E284
  • [34] Machine Learning for Personalized Medicine: Predicting Primary Myocardial Infarction from Electronic Health Records
    Weiss, Jeremy C.
    Natarajan, Sriraam
    Peissig, Peggy L.
    McCarty, Catherine A.
    Page, David
    AI MAGAZINE, 2012, 33 (04) : 33 - 45
  • [35] Deep learning detects and visualizes bleeding events in electronic health records
    Pedersen, Jannik S.
    Laursen, Martin S.
    Savarimuthu, Thiusius Rajeeth
    Hansen, Rasmus Sogaard
    Alnor, Anne Bryde
    Bjerre, Kristian Voss
    Kjaer, Ina Mathilde
    Gils, Charlotte
    Thorsen, Anne-Sofie Faarvang
    Andersen, Eline Sandvig
    Nielsen, Cathrine Brodsgaard
    Andersen, Lou-Ann Christensen
    Andreas, Soren
    Vinholt, Pernille Just
    RESEARCH AND PRACTICE IN THROMBOSIS AND HAEMOSTASIS, 2021, 5 (04)
  • [36] A Novel Deep Similarity Learning Approach to Electronic Health Records Data
    Gupta, Vagisha
    Sachdeva, Shelly
    Bhalla, Subhash
    IEEE ACCESS, 2020, 8 : 209278 - 209295
  • [37] Predicting Acute Graft-Versus-Host Disease Using Machine Learning and Longitudinal Vital Sign Data From Electronic Health Records
    Tang, Shengpu
    Chappell, Grant T.
    Mazzoli, Amanda
    Tewari, Muneesh
    Choi, Sung Won
    Wiens, Jenna
    JCO CLINICAL CANCER INFORMATICS, 2020, 4 : 128 - 135
  • [38] Deep learning for electronic health records: A comparative review of multiple deep neural architectures
    Solares, Jose Roberto Ayala
    Raimondi, Francesca Elisa Diletta
    Zhu, Yajie
    Rahimian, Fatemeh
    Canoy, Dexter
    Tran, Jenny
    Gomes, Ana Catarina Pinho
    Payberah, Amir H.
    Zottoli, Mariagrazia
    Nazarzadeh, Milad
    Conrad, Nathalie
    Rahimi, Kazem
    Salimi-Khorshidi, Gholamreza
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 101
  • [39] Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review
    Si, Yuqi
    Du, Jingcheng
    Li, Zhao
    Jiang, Xiaoqian
    Miller, Timothy
    Wang, Fei
    Zheng, W. Jim
    Roberts, Kirk
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 115
  • [40] Deep learning-based prediction of Clostridioides difficile infection caused by antibiotics using longitudinal electronic health records
    Kim, Junmo
    Kim, Joo Seong
    Kim, Sae-Hoon
    Yoo, Sooyoung
    Lee, Jun Kyu
    Kim, Kwangsoo
    NPJ DIGITAL MEDICINE, 2024, 7 (01):