Prediction of sepsis mortality in ICU patients using machine learning methods

被引:5
|
作者
Gao, Jiayi [1 ]
Lu, Yuying [1 ]
Ashrafi, Negin [1 ]
Domingo, Ian [2 ]
Alaei, Kamiar [3 ]
Pishgar, Maryam [1 ]
机构
[1] Univ Southern Calif, Dept Ind Syst Engn, 3715 McClintock Ave, Los Angeles, CA 90089 USA
[2] Univ Calif Irvine, Dept Informat & Comp Sci, Inner Ring Rd, Irvine, CA 92697 USA
[3] Calif State Univ Long Beach, Dept Hlth Sci, 1250 Bellflower Blvd,HHS2-117, Long Beach, CA 90840 USA
关键词
Sepsis; Random Forest; Machine learning; MIMIC-IV; ICU mortality;
D O I
10.1186/s12911-024-02630-z
中图分类号
R-058 [];
学科分类号
摘要
ProblemSepsis, a life-threatening condition, accounts for the deaths of millions of people worldwide. Accurate prediction of sepsis outcomes is crucial for effective treatment and management. Previous studies have utilized machine learning for prognosis, but have limitations in feature sets and model interpretability.AimThis study aims to develop a machine learning model that enhances prediction accuracy for sepsis outcomes using a reduced set of features, thereby addressing the limitations of previous studies and enhancing model interpretability.MethodsThis study analyzes intensive care patient outcomes using the MIMIC-IV database, focusing on adult sepsis cases. Employing the latest data extraction tools, such as Google BigQuery, and following stringent selection criteria, we selected 38 features in this study. This selection is also informed by a comprehensive literature review and clinical expertise. Data preprocessing included handling missing values, regrouping categorical variables, and using the Synthetic Minority Over-sampling Technique (SMOTE) to balance the data. We evaluated several machine learning models: Decision Trees, Gradient Boosting, XGBoost, LightGBM, Multilayer Perceptrons (MLP), Support Vector Machines (SVM), and Random Forest. The Sequential Halving and Classification (SHAC) algorithm was used for hyperparameter tuning, and both train-test split and cross-validation methodologies were employed for performance and computational efficiency.ResultsThe Random Forest model was the most effective, achieving an area under the receiver operating characteristic curve (AUROC) of 0.94 with a confidence interval of +/- 0.01. This significantly outperformed other models and set a new benchmark in the literature. The model also provided detailed insights into the importance of various clinical features, with the Sequential Organ Failure Assessment (SOFA) score and average urine output being highly predictive. SHAP (Shapley Additive Explanations) analysis further enhanced the model's interpretability, offering a clearer understanding of feature impacts.ConclusionThis study demonstrates significant improvements in predicting sepsis outcomes using a Random Forest model, supported by advanced machine learning techniques and thorough data preprocessing. Our approach provided detailed insights into the key clinical features impacting sepsis mortality, making the model both highly accurate and interpretable. By enhancing the model's practical utility in clinical settings, we offer a valuable tool for healthcare professionals to make data-driven decisions, ultimately aiming to minimize sepsis-induced fatalities.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Interpretable machine learning-based prediction of 28-day mortality in ICU patients with sepsis: a multicenter retrospective study
    Shen, Li
    Wu, Jiaqiang
    Lan, Jianger
    Chen, Chao
    Wang, Yi
    Li, Zhiping
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2025, 14
  • [32] Early prediction of mortality at sepsis diagnosis time in critically ill patients by using interpretable machine learning
    Yi-Wei Cheng
    Po-Chih Kuo
    Shih-Hong Chen
    Yu-Ting Kuo
    Tyng-Luh Liu
    Wing-Sum Chan
    Kuang-Cheng Chan
    Yu-Chang Yeh
    Journal of Clinical Monitoring and Computing, 2024, 38 : 271 - 279
  • [33] Mortality prediction for ICU patients combining just-in-time learning and extreme learning machine
    Ding, Yangyang
    Wang, Youqing
    Zhou, Donghua
    NEUROCOMPUTING, 2018, 281 : 12 - 19
  • [34] Identification of Risk Factors and Prediction of Sepsis in Pregnancy Using Machine Learning Methods
    Kopanitsa, Georgy
    Metsker, Oleg
    Paskoshev, David
    Greschischeva, Sofia
    10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 393 - 401
  • [35] Mortality Prediction In Heart Transplant Recipients Using Machine Learning Methods
    Carcamo, Francisco javier jimenez
    Rubens, Muni
    Saxena, Anshul
    Koerfer, Daniel
    Gonzalez, Pablo gordilo
    Ali, Shozab
    Vaidean, Georgetta
    Chaparro, Sandra
    Fontana, Marianna
    Kumar, Varun
    Sheridan, Paige
    Rubens, Lexie
    Toyip, Astra
    Boehme, Amelia
    Bender, Shaun
    Capocelli, Kelley
    Danese, David
    JOURNAL OF CARDIAC FAILURE, 2025, 31 (01)
  • [36] Machine Learning Approaches-Driven for Mortality Prediction for Patients Undergoing Craniotomy in ICU
    Yu, Ronguo
    Wang, Shaobo
    Xu, Jingqing
    Wang, Qiqi
    He, Xinjun
    Li, Jun
    Shang, Xiuling
    Chen, Han
    Liu, Youjun
    BRAIN INJURY, 2021, 35 (14) : 1658 - 1664
  • [37] A machine learning-based prediction of hospital mortality in mechanically ventilated ICU patients
    Li, Hexin
    Ashrafi, Negin
    Kang, Chris
    Zhao, Guanlan
    Chen, Yubing
    Pishgar, Maryam
    Rathnayake, Upaka
    PLOS ONE, 2024, 19 (09):
  • [38] Early Prediction of Sepsis using Machine Learning
    Shankar, Anuraag
    Diwan, Mufaddal
    Singh, Snigdha
    Nahrpurawala, Husain
    Bhowmick, Tanusri
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 837 - 842
  • [39] Early Prediction of ICU Mortality in Patients with Acute Hypoxemic Respiratory Failure Using Machine Learning: The MEMORIAL Study
    Villar, Jesus
    Gonzalez-Martin, Jesus M.
    Fernandez, Cristina
    Anon, Jose M.
    Ferrando, Carlos
    Mora-Ordonez, Juan M.
    Martinez, Domingo
    Mosteiro, Fernando
    Ambros, Alfonso
    Fernandez, Lorena
    Murcia, Isabel
    Vidal, Anxela
    Pestana, David
    Romera, Miguel A.
    Montiel, Raquel
    Dominguez-Berrot, Ana M.
    Soler, Juan A.
    Gomez-Bentolila, Estrella
    Steyerberg, Ewout W.
    Szakmany, Tamas
    JOURNAL OF CLINICAL MEDICINE, 2025, 14 (05)
  • [40] A comparison of machine learning models versus clinical evaluation for mortality prediction in patients with sepsis
    van Doorn, William P. T. M.
    Stassen, Patricia M.
    Borggreve, Hella F.
    Schalkwijk, Maaike J.
    Stoffers, Judith
    Bekers, Otto
    Meex, Steven J. R.
    PLOS ONE, 2021, 16 (01):