Machine learning-based survival analysis approaches for predicting the risk of pneumonia post-stroke discharge

被引:0
|
作者
Lee, Chang -Ching [1 ]
Su, Sheng-You [2 ]
Sung, Sheng-Feng [3 ,4 ]
机构
[1] Chia Yi Christian Hosp, Ditmanson Med Fdn, Dept Internal Med, Div Pulm Med, Chiayi, Taiwan
[2] Chia Yi Christian Hosp, Ditmanson Med Fdn, Clin Med Res Ctr, Dept Med Res, Chiayi, Taiwan
[3] Chia Yi Christian Hosp, Ditmanson Med Fdn, Dept Internal Med, Div Neurol, 539 Zhongxiao Rd, Chiayi 60002, Taiwan
[4] Min Hwei Jr Coll Hlth Care Management, Dept Beauty & Hlth Care, Tainan, Taiwan
关键词
Machine learning; Pneumonia; Prediction; Risk -scoring system; Stroke; Survival analysis; STROKE REGISTRY; SCORE; MODELS; CODES; TOOL;
D O I
10.1016/j.ijmedinf.2024.105422
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: Post-stroke pneumonia (PSP) is common among stroke patients. PSP occurring after hospital discharge continues to increase the risk of poor functional outcomes and death among stroke survivors. Currently, there is no prediction model specifically designed to predict the occurrence of PSP beyond the acute stage of stroke. This study aimed to explore the use of machine learning (ML) methods in predicting the risk of PSP after hospital discharge. Methods: This study analyzed data from 5,754 hospitalized stroke patients. The dataset was randomly divided into a training set and a holdout test set, with a ratio of 80:20. Several clinical and laboratory variables were utilized as predictors and different ML algorithms were employed to model time-to-event data. The ML model's predictive performance was compared to existing risk-scoring systems. A model-agnostic method based on Shapley additive explanations was utilized to interpret the ML model. Results: The study found that 5.7% of the study patients experienced pneumonia within one year after discharge. Based on repeated 5-fold cross-validation on the training set, the random survival forest (RSF) model had the highest C-index among the various ML algorithms and traditional Cox regression analysis. The final RSF model achieved a C-index of 0.787 (95% confidence interval: 0.737-0.840) on the holdout test set, outperforming five existing risk-scoring systems. The top three important predictors were the Glasgow Coma Scale score, age, and length of hospital stay. Conclusions: The RSF model demonstrated superior discriminative ability compared to other ML algorithms and traditional Cox regression analysis, suggesting a non-linear relationship between predictors and outcomes. The developed ML model can be integrated into the hospital information system to provide personalized risk assessments.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Predicting Post-stroke Hospital Discharge Disposition Using Interpretable Machine Learning Approaches
    Cho, Jin
    Alharin, Alnour
    Hu, Zhen
    Fell, Nancy
    Sartipi, Mina
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4817 - 4822
  • [2] Machine learning-based predictive model for post-stroke dementia
    Wei, Zemin
    Li, Mengqi
    Zhang, Chenghui
    Miao, Jinli
    Wang, Wenmin
    Fan, Hong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [3] Predicting post-stroke activities of daily living through a machine learning-based approach on initiating rehabilitation
    Lin, Wan-Yin
    Chen, Chun-Hsien
    Tseng, Yi-Ju
    Tsai, Yu-Ting
    Chang, Ching-Yu
    Wang, Hsin-Yao
    Chen, Chih-Kuang
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 111 : 159 - 164
  • [4] Predicting post-stroke pneumonia using deep neural network approaches
    Ge, Yanqiu
    Wang, Qinghua
    Wang, Li
    Wu, Honghu
    Peng, Chen
    Wang, Jiajing
    Xu, Yuan
    Xiong, Gang
    Zhang, Yaoyun
    Yi, Yingping
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 132
  • [5] Predicting post-stroke pneumonia: the PANTHERIS score
    Harms, H.
    Grittner, U.
    Droege, H.
    Meisel, A.
    ACTA NEUROLOGICA SCANDINAVICA, 2013, 128 (03): : 178 - 184
  • [6] Machine Learning-Based Model for Prediction of Early Post-Stroke Fatigue in Patients With Stroke: A Longitudinal Study
    Wu, Yu
    Zhou, Depeng
    Fornah, Lovel
    Liu, Jian
    Zhao, Jun
    Wu, Shicai
    NEUROREHABILITATION AND NEURAL REPAIR, 2025,
  • [7] An Exploration of Machine Learning Methods for Predicting Post-stroke Aphasia Recovery
    Lai, Sha
    Billot, Anne
    Varkanitsa, Maria
    Braun, Emily J.
    Rapp, Brenda
    Parrish, Todd B.
    Kurani, Ajay S.
    Higgins, James
    Caplan, David
    Thompson, Cynthia K.
    Kiran, Swathi
    Betke, Margrit
    Ishwar, Prakash
    THE 14TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2021, 2021, : 556 - 564
  • [8] Logistic regression analysis and machine learning for predicting post-stroke gait independence: a retrospective study
    Miyazaki, Yuta
    Kawakami, Michiyuki
    Kondo, Kunitsugu
    Hirabe, Akiko
    Kamimoto, Takayuki
    Akimoto, Tomonori
    Hijikata, Nanako
    Tsujikawa, Masahiro
    Honaga, Kaoru
    Suzuki, Kanjiro
    Tsuji, Tetsuya
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [9] Post-stroke Anxiety Analysis via Machine Learning Methods
    Wang, Jirui
    Zhao, Defeng
    Lin, Meiqing
    Huang, Xinyu
    Shang, Xiuli
    FRONTIERS IN AGING NEUROSCIENCE, 2021, 13
  • [10] Development and validation of an interpretable machine learning model for predicting post-stroke epilepsy
    Yu, Yue
    Chen, Zhibin
    Yang, Yong
    Zhang, Jiajun
    Wang, Yan
    EPILEPSY RESEARCH, 2024, 205