Early Sepsis Prediction Using Ensemble Learning With Deep Features and Artificial Features Extracted From Clinical Electronic Health Records

被引:22
|
作者
He, Zhengling [1 ,2 ]
Du, Lidong [2 ,3 ]
Zhang, Pengfei [1 ,2 ]
Zhao, Rongjian [2 ,3 ]
Chen, Xianxiang [1 ,2 ,3 ]
Fang, Zhen [1 ,2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, State Key Lab Transducer Technol, Beijing, Peoples R China
[3] Chinese Acad Med Sci, Res Unit Personalized Management Chron Resp Dis, Beijing, Peoples R China
关键词
clinical electronic health records; ensemble learning; long short-term memory neural network; sepsis;
D O I
10.1097/CCM.0000000000004644
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
Objectives: Sepsis is caused by infection and subsequent overreaction of immune system and will severely threaten human life. The early prediction is important for the treatment of sepsis. This report aims to develop an early prediction method for sepsis 6 hours ahead on the basis of clinical electronic health records. Data Sources: Challenge data are released by PhysioNet/Computing in Cardiology Challenge 2019 and obtained from ICU patients in three separate hospital systems. Part of the data from two datasets, including 40,336 subjects, are publicly available, and the remaining are used as hidden test set. A normalized utility score defined by the organizing committee is used for model performance evaluation. Study Selection: The supervised machine learning is applied to tackle this challenge. Specifically, we establish the prediction model under the framework of ensemble learning by integrating the artificial features based on clinical prior knowledge of sepsis with deep features automatically extracted by long short-term memory neural network. Data Extraction: Forty clinical variables, including eight vital signs, 26 laboratory values, and six demographics, were measured and recorded once an hour for each individual, and the binary label (0 or 1) was simultaneously provided for each item. Data Synthesis: The proposed model was evaluated by 30-fold cross-validation. The sensitivity, specificity, and normalized utility score were 0.641 +/- 0.022, 0.844 +/- 0.007, and 0.401 +/- 0.019 on publicly available datasets, respectively. The final normalized utility score our team (UCAS_DataMiner) has obtained was 0.313 on full hidden test set (0.406, 0.373, and -0.215 on test set A, B, and C, respectively). Conclusions: We realized a 6-hour ahead early-onset prediction of sepsis on the basis of clinical electronic health record by ensemble learning. The results indicated the proposed model functioned well in the early prediction of sepsis. In particular, ensemble learning had a significant (p < 0.01) improvement than any single model in performance.
引用
收藏
页码:E1337 / E1342
页数:6
相关论文
共 50 条
  • [21] Enhanced early prediction of Li-ion battery degradation using multicycle features and an ensemble deep learning model
    Safitri, Meilia
    Adji, Teguh Bharata
    Cahyadi, Adha Imam
    RESULTS IN ENGINEERING, 2025, 25
  • [22] Prediction of Endometrial Carcinoma Using the Combination of Electronic Health Records and an Ensemble Machine Learning Method
    Wang, Wenwen
    Xu, Yang
    Yuan, Suzhen
    Li, Zhiying
    Zhu, Xin
    Zhou, Qin
    Shen, Wenfeng
    Wang, Shixuan
    FRONTIERS IN MEDICINE, 2022, 9
  • [23] A Regularized Deep Learning Approach for Clinical Risk Prediction of Acute Coronary Syndrome Using Electronic Health Records
    Huang, Zhengxing
    Dong, Wei
    Duan, Huilong
    Liu, Jiquan
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2018, 65 (05) : 956 - 968
  • [24] Software defect prediction using ensemble learning on selected features
    Laradji, Issam H.
    Alshayeb, Mohammad
    Ghouti, Lahouari
    INFORMATION AND SOFTWARE TECHNOLOGY, 2015, 58 : 388 - 402
  • [25] Prediction of Sepsis and In-Hospital Mortality Using Electronic Health Records
    Khojandi, Anahita
    Tansakul, Varisara
    Li, Xueping
    Koszalinski, Rebecca S.
    Paiva, William
    METHODS OF INFORMATION IN MEDICINE, 2018, 57 (04) : 185 - 193
  • [26] Deep Learning with Heterogeneous Graph Embeddings for Mortality Prediction from Electronic Health Records
    Wanyan, Tingyi
    Honarvar, Hossein
    Azad, Ariful
    Ding, Ying
    Glicksberg, Benjamin S.
    DATA INTELLIGENCE, 2021, 3 (03) : 329 - 339
  • [27] Deep Learning with Heterogeneous Graph Embeddings for Mortality Prediction from Electronic Health Records
    Tingyi Wanyan
    Hossein Honarvar
    Ariful Azad
    Ying Ding
    Benjamin SGlicksberg
    Data Intelligence, 2021, 3 (03) : 329 - 339
  • [28] Selecting relevant features from the electronic health record for clinical code prediction
    Scheurwegs, Elyne
    Cule, Boris
    Luyckx, Kim
    Luyten, Leon
    Daelemans, Walter
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 74 : 92 - 103
  • [29] Postprediction Inference for Clinical Characteristics Extracted With Machine Learning on Electronic Health Records
    Sondhi, Arjun
    Rich, Alexander S.
    Wang, Siruo
    Leek, Jeffery T.
    JCO CLINICAL CANCER INFORMATICS, 2023, 7
  • [30] Postprediction Inference for Clinical Characteristics Extracted With Machine Learning on Electronic Health Records
    Sondhi, Arjun
    Rich, Alexander S.
    Wang, Siruo
    Leek, Jeffery T.
    JCO CLINICAL CANCER INFORMATICS, 2023, 7 : e2200174