Machine learning-based risk prediction model for pertussis in children: a multicenter retrospective study

被引:0
|
作者
Xie, Juan [1 ]
Ma, Run-wei [2 ]
Feng, Yu-jing [3 ]
Qiao, Yuan [4 ]
Zhu, Hong-yan [5 ]
Tao, Xing-ping [6 ]
Chen, Wen-juan [7 ]
Liu, Cong-yun [8 ]
Li, Tan [9 ]
Liu, Kai [10 ]
Cheng, Li-ming [1 ]
机构
[1] Kunming Childrens Hosp, Dept Anesthesiol, Kunming, Yunnan, Peoples R China
[2] Kunming Med Univ, Fuwai Yunnan Hosp, Chinese Acad Med Sci, Dept Cardiac Surg,Cardiovasc Hosp, Kunming, Yunnan, Peoples R China
[3] Wenshan Maternal & Child Hlth Care Hosp, Comprehens Pediat, Wenshan, Yunnan, Peoples R China
[4] Chuxiong Yi Autonomous Prefecture Peoples Hosp, Comprehens Pediat & Neonatol, Chuxiong, Yunnan, Peoples R China
[5] Qujing Maternal & Child Hlth Hosp, Pediat Resp Dept, Qujing, Yunnan, Peoples R China
[6] Kaiyuan Peoples Hosp, Dept Pediat, Kaiyuan, Peoples R China
[7] Yuxi Childrens Hosp, Dept Pediat & Emergency, Yuxi, Yunnan, Peoples R China
[8] Baoshan Peoples Hosp, Comprehens Pediat & Pulm & Crit Care Med, Baoshan, Yunnan, Peoples R China
[9] Kunming Childrens Hosp, Dept Resp Med, Kunming, Yunnan, Peoples R China
[10] Kunming Childrens Hosp, Comprehens Pediat & Pulm & Crit Care Med, Shulin St 28, Kunming 650000, Yunnan, Peoples R China
关键词
Public health; PDW-MPV-RATIO; SII; Calibration curves; Lasso regression; Random forest; Online deployment; Pertussis; VACCINE;
D O I
10.1186/s12879-025-10797-7
中图分类号
R51 [传染病];
学科分类号
100401 ;
摘要
BackgroundPertussis is a highly contagious respiratory disease. Even though vaccination has reduced the incidence, cases have resurfaced in certain regions due to immune escape and waning vaccine efficacy. Identifying high-risk patients to mitigate transmission and avert complications promptly is imperative. Nevertheless, the current diagnostic methods, including PCR and bacterial culture, are time-consuming and expensive. Some studies have attempted to develop risk prediction models based on multivariate data, but their performance can be improved. Therefore, this study aims to further optimize and expand the risk assessment tool to more efficiently identify high-risk individuals and compensate for the shortcomings of existing diagnostic methods.ObjectiveThe aim of this study was to develop a pertussis risk prediction model that is both efficient and has good generalization ability, applicable to different datasets. The model was constructed using machine learning techniques based on multicenter data and screened for key features. The performance and generalization ability of the model were evaluated by deploying it on an online platform. At the same time, this study aims to provide a rapid and accurate auxiliary diagnostic tool for clinical practice to help identify high-risk patients in a timely manner, optimize early intervention strategies, reduce the risk of complications and reduce transmission, thereby improving the efficiency of public health management.MethodsFirst, data from 1085 suspected pertussis patients from 7 centers were collected, and ten key features were analyzed using the lasso regression and Boruta algorithm: PDW-MPV-RATIO, SII, white blood cells, platelet distribution width, mean platelet volume, lymphocytes, cough duration, vaccination, fever, and lytic lymphocytes.Eight models were then trained and validated to assess their performance and to confirm their generalization ability with external datasets based on these features. Finally, an online platform was constructed for clinicians to use the models in real time.ResultsThe random forest model demonstrated excellent discrimination ability in the validation set, with an AUC of 0.98, and an AUC of 0.97 in the external validation set. Calibration curve and decision curve analysis showed that the model had high accuracy in predicting low-to-medium risk patients, which could help clinicians avoid unnecessary interventions, especially in resource-limited settings. The application of this model can help optimize the early identification and management of high-risk patients and improve clinical decision-making.ConclusionThe pertussis prediction model devised in this study was validated using multicenter data, exhibited high prediction performance, and was successfully implemented online. Future research should broaden the data sources and incorporate dynamic data to enhance the model's accuracy and applicability.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Machine learning-based prediction of pulmonary embolism to reduce unnecessary computed tomography scans in gastrointestinal cancer patients: a retrospective multicenter study
    Kim, Joo Seong
    Kwon, Doyun
    Kim, Kyungdo
    Lee, Sang Hyub
    Lee, Seung-Bo
    Kim, Kwangsoo
    Kim, Dongmin
    Lee, Min Woo
    Park, Namyoung
    Choi, Jin Ho
    Jang, Eun Sun
    Cho, In Rae
    Paik, Woo Hyun
    Lee, Jun Kyu
    Ryu, Ji Kon
    Kim, Yong-Tae
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [32] Machine learning-based diagnostic model for preoperative differentiation between xanthogranulomatous cholecystitis and gallbladder carcinoma: a multicenter retrospective cohort study
    Fu, Tianwei
    Bao, Yating
    Zhong, Zhihan
    Gao, Zhenyu
    Ye, Taiwei
    Zhang, Chengwu
    Jing, Huang
    Xiao, Zunqiang
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [33] Machine Learning-based Nomograms for Predicting Clinical Stages of Initial Prostate Cancer: A Multicenter Retrospective Study
    Chen, Luyao
    Fu, Zhehong
    Dong, Qianxi
    Zheng, Fuchun
    Wang, Zhipeng
    Li, Sheng
    Zhan, Xiangpeng
    Dong, Wentao
    Song, Yanping
    Xu, Songhui
    Fu, Bin
    Xiong, Situ
    UROLOGY, 2024, 194 : 180 - 188
  • [34] Machine Learning-Based Sample Misidentification Error Detection in Clinical Laboratory Tests: A Retrospective Multicenter Study
    Seok, Hyeon Seok
    Yu, Shinae
    Shin, Kyung-Hwa
    Lee, Woochang
    Chun, Sail
    Kim, Sollip
    Shin, Hangsik
    CLINICAL CHEMISTRY, 2024, 70 (10) : 1256 - 1267
  • [35] Prediction of Cognitive Impairment Risk among Older Adults: A Machine Learning-Based Comparative Study and Model Development
    Li, Jianwei
    Li, Jie
    Zhu, Huafang
    Liu, Mengyu
    Li, Tengfei
    He, Yeke
    Xu, Yuan
    Huang, Fen
    Qin, Qirong
    DEMENTIA AND GERIATRIC COGNITIVE DISORDERS, 2024, 53 (04) : 169 - 179
  • [36] A Machine Learning-Based Algorithm for the Prediction of Intensive Care Unit Delirium (PRIDE): Retrospective Study
    Hur, Sujeong
    Ko, Ryoung-Eun
    Yoo, Junsang
    Ha, Juhyung
    Cha, Won Chul
    Chung, Chi Ryang
    JMIR MEDICAL INFORMATICS, 2021, 9 (07)
  • [37] Development and validation of machine learning-based models for prediction of adolescent idiopathic scoliosis: A retrospective study
    Lv, Zheng
    Lv, Wen
    Wang, Lei
    Ou, Jiayuan
    MEDICINE, 2023, 102 (14) : E33441
  • [38] A Machine Learning-Based Prediction Model for Acute Kidney Injury in Patients With Community-Acquired Pneumonia: Multicenter Validation Study
    Ma, Mengqing
    Chen, Caimei
    Chen, Dawei
    Zhang, Hao
    Du, Xia
    Sun, Qing
    Fan, Li
    Kong, Huiping
    Chen, Xueting
    Cao, Changchun
    Wan, Xin
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [39] Machine Learning-Based Model for Prediction of Outcomes in Acute Stroke
    Heo, JoonNyung
    Yoon, Jihoon G.
    Park, Hyungjong
    Kim, Young Dae
    Nam, Hyo Suk
    Heo, Ji Hoe
    STROKE, 2019, 50 (05) : 1263 - 1265
  • [40] Machine learning-based prediction model for the efficacy and safety of statins
    Xiong, Yu
    Liu, Xiaoyang
    Wang, Qing
    Zhao, Li
    Kong, Xudong
    Da, Chunhe
    Meng, Zuohuan
    Qu, Leilei
    Xia, Qinfang
    Liu, Lihong
    Li, Pengmei
    FRONTIERS IN PHARMACOLOGY, 2024, 15