Recurrent neural network models (CovRNN) for predicting outcomes of patients with COVID-19 on admission to hospital: model development and validation using electronic health record data

被引:33
|
作者
Rasmy, Laila [1 ]
Nigo, Masayuki [2 ]
Kannadath, Bijun Sai [4 ]
Xie, Ziqian [1 ]
Mao, Bingyu [1 ]
Patel, Khush [1 ]
Zhou, Yujia [1 ]
Zhang, Wanheng [3 ]
Ross, Angela [1 ]
Xu, Hua [1 ]
Zhi, Degui [1 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, Houston, TX 77030 USA
[2] Univ Texas Hlth Sci Ctr Houston, McGovern Med Sch, Houston, TX 77030 USA
[3] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Houston, TX USA
[4] Univ Arizona, Coll Med, Phoenix, AZ USA
来源
LANCET DIGITAL HEALTH | 2022年 / 4卷 / 06期
关键词
D O I
10.1016/S2589-7500(22)00049-8
中图分类号
R-058 [];
学科分类号
摘要
Background Predicting outcomes of patients with COVID-19 at an early stage is crucial for optimised clinical care and resource management, especially during a pandemic. Although multiple machine learning models have been proposed to address this issue, because of their requirements for extensive data preprocessing and feature engineering, they have not been validated or implemented outside of their original study site. Therefore, we aimed to develop accurate and transferrable predictive models of outcomes on hospital admission for patients with COVID-19. Methods In this study, we developed recurrent neural network-based models (CovRNN) to predict the outcomes of patients with COVID-19 by use of available electronic health record data on admission to hospital, without the need for specific feature selection or missing data imputation. CovRNN was designed to predict three outcomes: in-hospital mortality, need for mechanical ventilation, and prolonged hospital stay (>7 days). For in-hospital mortality and mechanical ventilation, CovRNN produced time-to-event risk scores (survival prediction; evaluated by the concordance index) and all-time risk scores (binary prediction; area under the receiver operating characteristic curve [AUROCJ was the main metric); we only trained a binary classification model for prolonged hospital stay. For binary classification tasks, we compared CovRNN against traditional machine learning algorithms: logistic regression and light gradient boost machine. Our models were trained and validated on the heterogeneous, deidentified data of 247 960 patients with COVID-19 from 87 US health-care systems derived from the Cerner Real-World COVID-19 Q3 Dataset up to September 2020. We held out the data of 4175 patients from two hospitals for external validation. The remaining 243 785 patients from the 85 health systems were grouped into training (n=170 626), validation (n=24378), and multihospital test (n=48 781) sets. Model performance was evaluated in the multi-hospital test set. The transferability of CovRNN was externally validated by use of deidentified data from 36 140 patients derived from the US-based Optum deidentified COVID-19 electronic health record dataset (version 1015; from January, 2007, to Oct 15, 2020). Exact dates of data extraction were masked by the databases to ensure patient data safety. Findings CovRNN binary models achieved AUROCs of 93.0% (95% CI 92.6-93.4) for the prediction of in-hospital mortality, 92.9% (92.6-93.2) for the prediction of mechanical ventilation, and 86.5% (86.2-86.9) for the prediction of a prolonged hospital stay, outperforming light gradient boost machine and logistic regression algorithms. External validation confirmed AUROCs in similar ranges (91.3-97-0% for in-hospital mortality prediction, 91.5-96.0% for the prediction of mechanical ventilation, and 81.0-88.3% for the prediction of prolonged hospital stay). For survival prediction, CovRNN achieved a concordance index of 86.0% (95% CI 85.1-86.9) for in-hospital mortality and 92.6% (92. 2-93-0) for mechanical ventilation. Interpretation Trained on a large, heterogeneous, real-world dataset, our CovRNN models showed high prediction accuracy and transferability through consistently good performances on multiple external datasets. Our results show the feasibility of a COVID-19 predictive model that delivers high accuracy without the need for complex feature engineering. Copyright (C) 2022 The Author(s). Published by Elsevier Ltd.
引用
收藏
页码:E415 / E425
页数:11
相关论文
共 50 条
  • [21] An Artificial Intelligence Model to Predict the Mortality of COVID-19 Patients at Hospital Admission Time Using Routine Blood Samples: Development and Validation of an Ensemble Model
    Ko, Hoon
    Chung, Heewon
    Kang, Wu Seong
    Park, Chul
    Kim, Do Wan
    Kim, Seong Eun
    Chung, Chi Ryang
    Ko, Ryoung Eun
    Lee, Hooseok
    Seo, Jae Ho
    Choi, Tae-Young
    Jaimes, Rafael
    Kim, Kyung Won
    Lee, Jinseok
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (12)
  • [22] Development and validation of a nomogram using on admission routine laboratory parameters to predict in-hospital survival of patients with COVID-19
    Chen, Hao
    Chen, Rudong
    Yang, Hongkuan
    Wang, Junhong
    Hou, Yuyang
    Hu, Wei
    Yu, Jiasheng
    Li, Hua
    JOURNAL OF MEDICAL VIROLOGY, 2021, 93 (04) : 2332 - 2339
  • [23] Development and validation of predictive models for COVID-19 outcomes in a safety-net hospital population
    Hao, Boran
    Hu, Yang
    Sotudian, Shahabeddin
    Zad, Zahra
    Adams, William G.
    Assoumou, Sabrina A.
    Hsu, Heather
    Mishuris, Rebecca G.
    Paschalidis, Ioannis C.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (07) : 1253 - 1262
  • [24] Predicting Risk of Hospital Admission in Patients With Suspected COVID-19 in a Community Setting: Protocol for Development and Validation of a Multivariate Risk Prediction Tool
    Espinosa-Gonzalez, Ana Belen
    Neves, Ana Luisa
    Fiorentino, Francesca
    Prociuk, Denys
    Husain, Laiba
    Ramtale, Sonny Christian
    Mi, Emma
    Mi, Ella
    Macartney, Jack
    Anand, Sneha N.
    Sherlock, Julian
    Saravanakumar, Kavitha
    Mayer, Erik
    de Lusignan, Simon
    Greenhalgh, Trisha
    Delaney, Brendan C.
    JMIR RESEARCH PROTOCOLS, 2021, 10 (05):
  • [25] Development and validation of multivariable prediction models for adverse COVID-19 outcomes in patients with IBD
    Sperger, John
    Shah, Kushal S.
    Lu, Minxin
    Zhang, Xian
    Ungaro, Ryan C.
    Brenner, Erica J.
    Agrawal, Manasi
    Colombel, Jean-Frederic
    Kappelman, Michael D.
    Kosorok, Michael R.
    BMJ OPEN, 2021, 11 (11):
  • [26] Development and validation of prognostic model for predicting mortality of COVID-19 patients in Wuhan, China
    Qi Mei
    Amanda Y. Wang
    Amy Bryant
    Yang Yang
    Ming Li
    Fei Wang
    Jia Wei Zhao
    Ke Ma
    Liang Wu
    Huawen Chen
    Jinlong Luo
    Shangming Du
    Kathrin Halfter
    Yong Li
    Christian Kurts
    Guangyuan Hu
    Xianglin Yuan
    Jian Li
    Scientific Reports, 10
  • [27] Development and validation of prognostic model for predicting mortality of COVID-19 patients in Wuhan, China
    Mei, Qi
    Wang, Amanda Y.
    Bryant, Amy
    Yang, Yang
    Li, Ming
    Wang, Fei
    Zhao, Jia Wei
    Ma, Ke
    Wu, Liang
    Chen, Huawen
    Luo, Jinlong
    Du, Shangming
    Halfter, Kathrin
    Li, Yong
    Kurts, Christian
    Hu, Guangyuan
    Yuan, Xianglin
    Li, Jian
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [28] Development and validation of a machine learning model predicting illness trajectory and hospital utilization of COVID-19 patients: A nationwide study
    Roimi, Michael
    Gutman, Rom
    Somer, Jonathan
    Ben Arie, Asaf
    Calman, Ido
    Bar-Lavie, Yaron
    Gelbshtein, Udi
    Liverant-Taub, Sigal
    Ziv, Arnona
    Eytan, Danny
    Gorfine, Malka
    Shalit, Uri
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (06) : 1188 - 1196
  • [29] An ordinal severity scale for COVID-19 retrospective studies using Electronic Health Record data
    Khodaverdi, Maryam
    Price, Bradley S.
    Porterfield, J. Zachary
    Bunnell, H. Timothy
    Vest, Michael T.
    Anzalone, Alfred Jerrod
    Harper, Jeremy
    Kimble, Wes D.
    Moradi, Hamidreza
    Hendricks, Brian
    Santangelo, Susan L.
    Hodder, Sally L.
    JAMIA OPEN, 2022, 5 (03)
  • [30] Multitask Learning With Recurrent Neural Networks for Acute Respiratory Distress Syndrome Prediction Using Only Electronic Health Record Data: Model Development and Validation Study
    Lam, Carson
    Thapa, Rahul
    Maharjan, Jenish
    Rahmani, Keyvan
    Tso, Chak Foon
    Singh, Navan Preet
    Chetty, Satish Casie
    Mao, Qingqing
    JMIR MEDICAL INFORMATICS, 2022, 10 (06)