Predicting hypertension onset from longitudinal electronic health records with deep learning

被引：11

作者：

Datta, Suparno ^{[1
,2
]}

Morassi Sasso, Ariane ^{[1
,2
]}

Kiwit, Nina ^{[1
]}

Bose, Subhronil ^{[1
]}

Nadkarni, Girish ^{[1
,2
,3
]}

Miotto, Riccardo ^{[2
,4
]}

Boettinger, Erwin P. ^{[1
,2
,3
,5
]}

机构：

[1] Univ Potsdam, Hasso Plattner Inst, Digital Hlth Ctr, Potsdam, Germany

[2] Icahn Sch Med Mt Sinai, Hasso Plattner Inst Digital Hlth Mt Sinai, New York, NY 10029 USA

[3] Icahn Sch Med Mt Sinai, Dept Med, New York, NY 10029 USA

[4] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, New York, NY 10029 USA

[5] Icahn Sch Med Mt Sinai, Windreich Dept Artificial Intelligence & Human Hl, New York, NY 10029 USA

来源：

JAMIA OPEN | 2022年 / 5卷 / 04期

基金：

美国国家卫生研究院;

关键词：

machine learning; electronic health records; deep learning; hypertension; HIGH BLOOD-PRESSURE; INCIDENT HYPERTENSION; AMERICAN-COLLEGE; RISK; PREVENTION; MANAGEMENT; ADULTS;

D O I：

10.1093/jamiaopen/ooac097

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Objective: Hypertension has long been recognized as one of the most important predisposing factors for cardiovascular diseases and mortality. In recent years, machine learning methods have shown potential in diagnostic and predictive approaches in chronic diseases. Electronic health records (EHRs) have emerged as a reliable source of longitudinal data. The aim of this study is to predict the onset of hypertension using modern deep learning (DL) architectures, specifically long short-term memory (LSTM) networks, and longitudinal EHRs. Materials and Methods: We compare this approach to the best performing models reported from previous works, particularly XGboost, applied to aggregated features. Our work is based on data from 233 895 adult patients from a large health system in the United States. We divided our population into 2 distinct longitudinal datasets based on the diagnosis date. To ensure generalization to unseen data, we trained our models on the first dataset (dataset A "train and validation") using cross-validation, and then applied the models to a second dataset (dataset B "test") to assess their performance. We also experimented with 2 different time-windows before the onset of hypertension and evaluated the impact on model performance. Results: With the LSTM network, we were able to achieve an area under the receiver operating characteristic curve value of 0.98 in the "train and validation" dataset A and 0.94 in the "test" dataset B for a prediction time window of 1 year. Lipid disorders, type 2 diabetes, and renal disorders are found to be associated with incident hypertension. Conclusion: These findings show that DL models based on temporal EHR data can improve the identification of patients at high risk of hypertension and corresponding driving factors. In the long term, this work may support identifying individuals who are at high risk for developing hypertension and facilitate earlier intervention to prevent the future development of hypertension.

引用

页数：10

共 50 条

[21] Predicting sequenced dental treatment plans from electronic dental records using deep learning
Chen, Haifan
Liu, Pufan
Chen, Zhaoxing
Chen, Qingxiao
Wen, Zaiwen
Xie, Ziqing
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 147
[22] Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records
Carrasco-Ribelles, Lucia A.
Cabrera-Bean, Margarita
Llanes-Jurado, Jose
Violan, Concepcion
APPLIED SCIENCES-BASEL, 2025, 15 (01):
[23] Domain Knowledge Guided Deep Learning with Electronic Health Records
Yin, Changchang
Zhao, Rongjian
Qian, Buyue
Lv, Xin
Zhang, Ping
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 738 - 747
[24] Learning Diagnosis from Electronic Health Records
Barbantan, Ioana
Potolea, Rodica
KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 344 - 351
[25] Readmission prediction using deep learning on electronic health records
Ashfaq, Awais
Sant'Anna, Anita
Lingman, Markus
Nowaczyk, Slawomir
JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 97
[26] Multimodal deep learning for predicting in-hospital mortality in heart failure patients using longitudinal chest X-rays and electronic health records
Li, Dengao
Xing, Wen
Zhao, Jumin
Shi, Changcheng
Wang, Fei
INTERNATIONAL JOURNAL OF CARDIOVASCULAR IMAGING, 2025, 41 (03): : 427 - 440
[27] Combining deep learning with token selection for patient phenotyping from electronic health records
Yang, Zhen
Dehmer, Matthias
Yli-Harja, Olli
Emmert-Streib, Frank
SCIENTIFIC REPORTS, 2020, 10 (01)
[28] Deep Learning with Heterogeneous Graph Embeddings for Mortality Prediction from Electronic Health Records
Wanyan, Tingyi
Honarvar, Hossein
Azad, Ariful
Ding, Ying
Glicksberg, Benjamin S.
DATA INTELLIGENCE, 2021, 3 (03) : 329 - 339
[29] Deep Learning with Heterogeneous Graph Embeddings for Mortality Prediction from Electronic Health Records
Tingyi Wanyan
Hossein Honarvar
Ariful Azad
Ying Ding
Benjamin SGlicksberg
Data Intelligence, 2021, 3 (03) : 329 - 339
[30] Combining deep learning with token selection for patient phenotyping from electronic health records
Zhen Yang
Matthias Dehmer
Olli Yli-Harja
Frank Emmert-Streib
Scientific Reports, 10

← 1 2 3 4 5 →