Recurrent neural network models (CovRNN) for predicting outcomes of patients with COVID-19 on admission to hospital: model development and validation using electronic health record data

被引：33

作者：

Rasmy, Laila ^{[1
]}

Nigo, Masayuki ^{[2
]}

Kannadath, Bijun Sai ^{[4
]}

Xie, Ziqian ^{[1
]}

Mao, Bingyu ^{[1
]}

Patel, Khush ^{[1
]}

Zhou, Yujia ^{[1
]}

Zhang, Wanheng ^{[3
]}

Ross, Angela ^{[1
]}

Xu, Hua ^{[1
]}

Zhi, Degui ^{[1
]}

机构：

[1] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, Houston, TX 77030 USA

[2] Univ Texas Hlth Sci Ctr Houston, McGovern Med Sch, Houston, TX 77030 USA

[3] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Houston, TX USA

[4] Univ Arizona, Coll Med, Phoenix, AZ USA

来源：

LANCET DIGITAL HEALTH | 2022年 / 4卷 / 06期

关键词：

D O I：

10.1016/S2589-7500(22)00049-8

中图分类号：

R-058 [];

学科分类号：

摘要：

Background Predicting outcomes of patients with COVID-19 at an early stage is crucial for optimised clinical care and resource management, especially during a pandemic. Although multiple machine learning models have been proposed to address this issue, because of their requirements for extensive data preprocessing and feature engineering, they have not been validated or implemented outside of their original study site. Therefore, we aimed to develop accurate and transferrable predictive models of outcomes on hospital admission for patients with COVID-19. Methods In this study, we developed recurrent neural network-based models (CovRNN) to predict the outcomes of patients with COVID-19 by use of available electronic health record data on admission to hospital, without the need for specific feature selection or missing data imputation. CovRNN was designed to predict three outcomes: in-hospital mortality, need for mechanical ventilation, and prolonged hospital stay (>7 days). For in-hospital mortality and mechanical ventilation, CovRNN produced time-to-event risk scores (survival prediction; evaluated by the concordance index) and all-time risk scores (binary prediction; area under the receiver operating characteristic curve [AUROCJ was the main metric); we only trained a binary classification model for prolonged hospital stay. For binary classification tasks, we compared CovRNN against traditional machine learning algorithms: logistic regression and light gradient boost machine. Our models were trained and validated on the heterogeneous, deidentified data of 247 960 patients with COVID-19 from 87 US health-care systems derived from the Cerner Real-World COVID-19 Q3 Dataset up to September 2020. We held out the data of 4175 patients from two hospitals for external validation. The remaining 243 785 patients from the 85 health systems were grouped into training (n=170 626), validation (n=24378), and multihospital test (n=48 781) sets. Model performance was evaluated in the multi-hospital test set. The transferability of CovRNN was externally validated by use of deidentified data from 36 140 patients derived from the US-based Optum deidentified COVID-19 electronic health record dataset (version 1015; from January, 2007, to Oct 15, 2020). Exact dates of data extraction were masked by the databases to ensure patient data safety. Findings CovRNN binary models achieved AUROCs of 93.0% (95% CI 92.6-93.4) for the prediction of in-hospital mortality, 92.9% (92.6-93.2) for the prediction of mechanical ventilation, and 86.5% (86.2-86.9) for the prediction of a prolonged hospital stay, outperforming light gradient boost machine and logistic regression algorithms. External validation confirmed AUROCs in similar ranges (91.3-97-0% for in-hospital mortality prediction, 91.5-96.0% for the prediction of mechanical ventilation, and 81.0-88.3% for the prediction of prolonged hospital stay). For survival prediction, CovRNN achieved a concordance index of 86.0% (95% CI 85.1-86.9) for in-hospital mortality and 92.6% (92. 2-93-0) for mechanical ventilation. Interpretation Trained on a large, heterogeneous, real-world dataset, our CovRNN models showed high prediction accuracy and transferability through consistently good performances on multiple external datasets. Our results show the feasibility of a COVID-19 predictive model that delivers high accuracy without the need for complex feature engineering. Copyright (C) 2022 The Author(s). Published by Elsevier Ltd.

引用

页码：E415 / E425

页数：11

共 50 条

[21] An Artificial Intelligence Model to Predict the Mortality of COVID-19 Patients at Hospital Admission Time Using Routine Blood Samples: Development and Validation of an Ensemble Model
Ko, Hoon
Chung, Heewon
Kang, Wu Seong
Park, Chul
Kim, Do Wan
Kim, Seong Eun
Chung, Chi Ryang
Ko, Ryoung Eun
Lee, Hooseok
Seo, Jae Ho
Choi, Tae-Young
Jaimes, Rafael
Kim, Kyung Won
Lee, Jinseok
JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (12)
[22] Development and validation of a nomogram using on admission routine laboratory parameters to predict in-hospital survival of patients with COVID-19
Chen, Hao
Chen, Rudong
Yang, Hongkuan
Wang, Junhong
Hou, Yuyang
Hu, Wei
Yu, Jiasheng
Li, Hua
JOURNAL OF MEDICAL VIROLOGY, 2021, 93 (04) : 2332 - 2339
[23] Development and validation of predictive models for COVID-19 outcomes in a safety-net hospital population
Hao, Boran
Hu, Yang
Sotudian, Shahabeddin
Zad, Zahra
Adams, William G.
Assoumou, Sabrina A.
Hsu, Heather
Mishuris, Rebecca G.
Paschalidis, Ioannis C.
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (07) : 1253 - 1262
[24] Predicting Risk of Hospital Admission in Patients With Suspected COVID-19 in a Community Setting: Protocol for Development and Validation of a Multivariate Risk Prediction Tool
Espinosa-Gonzalez, Ana Belen
Neves, Ana Luisa
Fiorentino, Francesca
Prociuk, Denys
Husain, Laiba
Ramtale, Sonny Christian
Mi, Emma
Mi, Ella
Macartney, Jack
Anand, Sneha N.
Sherlock, Julian
Saravanakumar, Kavitha
Mayer, Erik
de Lusignan, Simon
Greenhalgh, Trisha
Delaney, Brendan C.
JMIR RESEARCH PROTOCOLS, 2021, 10 (05):
[25] Development and validation of multivariable prediction models for adverse COVID-19 outcomes in patients with IBD
Sperger, John
Shah, Kushal S.
Lu, Minxin
Zhang, Xian
Ungaro, Ryan C.
Brenner, Erica J.
Agrawal, Manasi
Colombel, Jean-Frederic
Kappelman, Michael D.
Kosorok, Michael R.
BMJ OPEN, 2021, 11 (11):
[26] Development and validation of prognostic model for predicting mortality of COVID-19 patients in Wuhan, China
Qi Mei
Amanda Y. Wang
Amy Bryant
Yang Yang
Ming Li
Fei Wang
Jia Wei Zhao
Ke Ma
Liang Wu
Huawen Chen
Jinlong Luo
Shangming Du
Kathrin Halfter
Yong Li
Christian Kurts
Guangyuan Hu
Xianglin Yuan
Jian Li
Scientific Reports, 10
[27] Development and validation of prognostic model for predicting mortality of COVID-19 patients in Wuhan, China
Mei, Qi
Wang, Amanda Y.
Bryant, Amy
Yang, Yang
Li, Ming
Wang, Fei
Zhao, Jia Wei
Ma, Ke
Wu, Liang
Chen, Huawen
Luo, Jinlong
Du, Shangming
Halfter, Kathrin
Li, Yong
Kurts, Christian
Hu, Guangyuan
Yuan, Xianglin
Li, Jian
SCIENTIFIC REPORTS, 2020, 10 (01)
[28] Development and validation of a machine learning model predicting illness trajectory and hospital utilization of COVID-19 patients: A nationwide study
Roimi, Michael
Gutman, Rom
Somer, Jonathan
Ben Arie, Asaf
Calman, Ido
Bar-Lavie, Yaron
Gelbshtein, Udi
Liverant-Taub, Sigal
Ziv, Arnona
Eytan, Danny
Gorfine, Malka
Shalit, Uri
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (06) : 1188 - 1196
[29] An ordinal severity scale for COVID-19 retrospective studies using Electronic Health Record data
Khodaverdi, Maryam
Price, Bradley S.
Porterfield, J. Zachary
Bunnell, H. Timothy
Vest, Michael T.
Anzalone, Alfred Jerrod
Harper, Jeremy
Kimble, Wes D.
Moradi, Hamidreza
Hendricks, Brian
Santangelo, Susan L.
Hodder, Sally L.
JAMIA OPEN, 2022, 5 (03)
[30] Multitask Learning With Recurrent Neural Networks for Acute Respiratory Distress Syndrome Prediction Using Only Electronic Health Record Data: Model Development and Validation Study
Lam, Carson
Thapa, Rahul
Maharjan, Jenish
Rahmani, Keyvan
Tso, Chak Foon
Singh, Navan Preet
Chetty, Satish Casie
Mao, Qingqing
JMIR MEDICAL INFORMATICS, 2022, 10 (06)

← 1 2 3 4 5 →