Machine learning-based survival analysis approaches for predicting the risk of pneumonia post-stroke discharge

被引:0
|
作者
Lee, Chang -Ching [1 ]
Su, Sheng-You [2 ]
Sung, Sheng-Feng [3 ,4 ]
机构
[1] Chia Yi Christian Hosp, Ditmanson Med Fdn, Dept Internal Med, Div Pulm Med, Chiayi, Taiwan
[2] Chia Yi Christian Hosp, Ditmanson Med Fdn, Clin Med Res Ctr, Dept Med Res, Chiayi, Taiwan
[3] Chia Yi Christian Hosp, Ditmanson Med Fdn, Dept Internal Med, Div Neurol, 539 Zhongxiao Rd, Chiayi 60002, Taiwan
[4] Min Hwei Jr Coll Hlth Care Management, Dept Beauty & Hlth Care, Tainan, Taiwan
关键词
Machine learning; Pneumonia; Prediction; Risk -scoring system; Stroke; Survival analysis; STROKE REGISTRY; SCORE; MODELS; CODES; TOOL;
D O I
10.1016/j.ijmedinf.2024.105422
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: Post-stroke pneumonia (PSP) is common among stroke patients. PSP occurring after hospital discharge continues to increase the risk of poor functional outcomes and death among stroke survivors. Currently, there is no prediction model specifically designed to predict the occurrence of PSP beyond the acute stage of stroke. This study aimed to explore the use of machine learning (ML) methods in predicting the risk of PSP after hospital discharge. Methods: This study analyzed data from 5,754 hospitalized stroke patients. The dataset was randomly divided into a training set and a holdout test set, with a ratio of 80:20. Several clinical and laboratory variables were utilized as predictors and different ML algorithms were employed to model time-to-event data. The ML model's predictive performance was compared to existing risk-scoring systems. A model-agnostic method based on Shapley additive explanations was utilized to interpret the ML model. Results: The study found that 5.7% of the study patients experienced pneumonia within one year after discharge. Based on repeated 5-fold cross-validation on the training set, the random survival forest (RSF) model had the highest C-index among the various ML algorithms and traditional Cox regression analysis. The final RSF model achieved a C-index of 0.787 (95% confidence interval: 0.737-0.840) on the holdout test set, outperforming five existing risk-scoring systems. The top three important predictors were the Glasgow Coma Scale score, age, and length of hospital stay. Conclusions: The RSF model demonstrated superior discriminative ability compared to other ML algorithms and traditional Cox regression analysis, suggesting a non-linear relationship between predictors and outcomes. The developed ML model can be integrated into the hospital information system to provide personalized risk assessments.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] A machine learning-based analysis for predicting fragility curve parameters of buildings
    Dabiri, Hamed
    Faramarzi, Asaad
    Dall 'Asta, Andrea
    Tondi, Emanuele
    Micozzi, Fabio
    JOURNAL OF BUILDING ENGINEERING, 2022, 62
  • [42] In-hospital risk prediction for post-stroke depression after discharge: development and validation of the Post-Stroke Depression Prediction Scale
    de Man-van Ginkel, J. M.
    Hafsteinsdottir, T. B.
    Lindeman, E.
    Ettema, R. G. A.
    Grobbee, D. E.
    SChuurmans, M. J.
    CEREBROVASCULAR DISEASES, 2014, 37 : 45 - 45
  • [43] Neuroimaging mechanisms of acupuncture on functional reorganization for post-stroke motor improvement: a machine learning-based functional magnetic resonance imaging study
    Lu, Mengxin
    Du, Zhongming
    Zhao, Jiping
    Jiang, Lan
    Liu, Ruoyi
    Zhang, Muzhao
    Xu, Tianjiao
    Wei, Jingpei
    Wang, Wei
    Xu, Lingling
    Guo, Haijiao
    Chen, Chen
    Yu, Xin
    Tan, Zhongjian
    Fang, Jiliang
    Zou, Yihuai
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [44] Establishment and verification of a nomogram model for predicting the risk of post-stroke depression
    Luo, Shihang
    Zhang, Wenrui
    Mao, Rui
    Huang, Xia
    Liu, Fan
    Liao, Qiao
    Sun, Dongren
    Chen, Hengshu
    Zhang, Jingyuan
    Tian, Fafa
    PEERJ, 2023, 11
  • [45] POST-STROKE DEPRESSION IN PRIMARY SUPPORT PERSONS - PREDICTING THOSE AT RISK
    TOMPKINS, CA
    RAU, MT
    SCHULZ, R
    JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 1988, 56 (04) : 502 - 508
  • [46] Multiple imputation integrated to machine learning: predicting post-stroke recovery of ambulation after intensive inpatient rehabilitation
    Finocchi, Alice
    Campagnini, Silvia
    Mannini, Andrea
    Doronzio, Stefano
    Baccini, Marco
    Hakiki, Bahia
    Bardi, Donata
    Grippo, Antonello
    Macchi, Claudio
    Solano, Jorge Navarro
    Baccini, Michela
    Cecchi, Francesca
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [47] Nomogram model for predicting the risk of post-stroke depression based on clinical characteristics and DNA methylation
    Luo, Shihang
    Liu, Fan
    Liao, Qiao
    Chen, Hengshu
    Zhang, Tongtong
    Mao, Rui
    PEERJ, 2023, 11
  • [48] Risk Factors for Post-Stroke Pneumonia in a Patient Population with Subacute Stroke: A Retrospective Cohort Study
    Tashima, Hiroyuki
    Ito, Mari
    Kawakami, Michiyuki
    Ishii, Ryota
    Miyazaki, Yuta
    Akimoto, Tomonori
    Tsujikawa, Masahiro
    Kobayashi, Keigo
    Kondo, Kunitsugu
    Tsuji, Tetsuya
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (18)
  • [49] Machine learning in the prediction of post-stroke cognitive impairment: a systematic review and meta-analysis
    Li, XiaoSheng
    Chen, Zongning
    Jiao, Hexian
    Wang, BinYang
    Yin, Hui
    Chen, LuJia
    Shi, Hongling
    Yin, Yong
    Qin, Dongdong
    FRONTIERS IN NEUROLOGY, 2023, 14
  • [50] Clinical characteristics and risk factor analysis of Pneumocystis jirovecii pneumonia in patients with CKD: a machine learning-based approach
    Cai, Xiao-Yu
    Cheng, Yi-Chun
    Ge, Shu-Wang
    Xu, Gang
    EUROPEAN JOURNAL OF CLINICAL MICROBIOLOGY & INFECTIOUS DISEASES, 2023, 42 (03) : 323 - 338