Machine learning-based risk prediction model for pertussis in children: a multicenter retrospective study

被引:0
|
作者
Xie, Juan [1 ]
Ma, Run-wei [2 ]
Feng, Yu-jing [3 ]
Qiao, Yuan [4 ]
Zhu, Hong-yan [5 ]
Tao, Xing-ping [6 ]
Chen, Wen-juan [7 ]
Liu, Cong-yun [8 ]
Li, Tan [9 ]
Liu, Kai [10 ]
Cheng, Li-ming [1 ]
机构
[1] Kunming Childrens Hosp, Dept Anesthesiol, Kunming, Yunnan, Peoples R China
[2] Kunming Med Univ, Fuwai Yunnan Hosp, Chinese Acad Med Sci, Dept Cardiac Surg,Cardiovasc Hosp, Kunming, Yunnan, Peoples R China
[3] Wenshan Maternal & Child Hlth Care Hosp, Comprehens Pediat, Wenshan, Yunnan, Peoples R China
[4] Chuxiong Yi Autonomous Prefecture Peoples Hosp, Comprehens Pediat & Neonatol, Chuxiong, Yunnan, Peoples R China
[5] Qujing Maternal & Child Hlth Hosp, Pediat Resp Dept, Qujing, Yunnan, Peoples R China
[6] Kaiyuan Peoples Hosp, Dept Pediat, Kaiyuan, Peoples R China
[7] Yuxi Childrens Hosp, Dept Pediat & Emergency, Yuxi, Yunnan, Peoples R China
[8] Baoshan Peoples Hosp, Comprehens Pediat & Pulm & Crit Care Med, Baoshan, Yunnan, Peoples R China
[9] Kunming Childrens Hosp, Dept Resp Med, Kunming, Yunnan, Peoples R China
[10] Kunming Childrens Hosp, Comprehens Pediat & Pulm & Crit Care Med, Shulin St 28, Kunming 650000, Yunnan, Peoples R China
关键词
Public health; PDW-MPV-RATIO; SII; Calibration curves; Lasso regression; Random forest; Online deployment; Pertussis; VACCINE;
D O I
10.1186/s12879-025-10797-7
中图分类号
R51 [传染病];
学科分类号
100401 ;
摘要
BackgroundPertussis is a highly contagious respiratory disease. Even though vaccination has reduced the incidence, cases have resurfaced in certain regions due to immune escape and waning vaccine efficacy. Identifying high-risk patients to mitigate transmission and avert complications promptly is imperative. Nevertheless, the current diagnostic methods, including PCR and bacterial culture, are time-consuming and expensive. Some studies have attempted to develop risk prediction models based on multivariate data, but their performance can be improved. Therefore, this study aims to further optimize and expand the risk assessment tool to more efficiently identify high-risk individuals and compensate for the shortcomings of existing diagnostic methods.ObjectiveThe aim of this study was to develop a pertussis risk prediction model that is both efficient and has good generalization ability, applicable to different datasets. The model was constructed using machine learning techniques based on multicenter data and screened for key features. The performance and generalization ability of the model were evaluated by deploying it on an online platform. At the same time, this study aims to provide a rapid and accurate auxiliary diagnostic tool for clinical practice to help identify high-risk patients in a timely manner, optimize early intervention strategies, reduce the risk of complications and reduce transmission, thereby improving the efficiency of public health management.MethodsFirst, data from 1085 suspected pertussis patients from 7 centers were collected, and ten key features were analyzed using the lasso regression and Boruta algorithm: PDW-MPV-RATIO, SII, white blood cells, platelet distribution width, mean platelet volume, lymphocytes, cough duration, vaccination, fever, and lytic lymphocytes.Eight models were then trained and validated to assess their performance and to confirm their generalization ability with external datasets based on these features. Finally, an online platform was constructed for clinicians to use the models in real time.ResultsThe random forest model demonstrated excellent discrimination ability in the validation set, with an AUC of 0.98, and an AUC of 0.97 in the external validation set. Calibration curve and decision curve analysis showed that the model had high accuracy in predicting low-to-medium risk patients, which could help clinicians avoid unnecessary interventions, especially in resource-limited settings. The application of this model can help optimize the early identification and management of high-risk patients and improve clinical decision-making.ConclusionThe pertussis prediction model devised in this study was validated using multicenter data, exhibited high prediction performance, and was successfully implemented online. Future research should broaden the data sources and incorporate dynamic data to enhance the model's accuracy and applicability.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Machine learning-based prediction of acute severity in infants hospitalized for bronchiolitis: a multicenter prospective study
    Raita, Yoshihiko
    Camargo, Carlos A., Jr.
    Macias, Charles G.
    Mansbach, Jonathan M.
    Piedra, Pedro A.
    Porter, Stephen C.
    Teach, Stephen J.
    Hasegawa, Kohei
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [22] A machine learning-based prediction model for gout in hyperuricemics: a nationwide cohort study
    Brikman, Shay
    Serfaty, Liel
    Abuhasira, Ran
    Schlesinger, Naomi
    Bieber, Amir
    Rappoport, Nadav
    RHEUMATOLOGY, 2024, 63 (09) : 2411 - 2417
  • [23] Machine learning-based ensemble model for groundwater quality prediction: A case study
    Jose, Annie
    Yasala, Srinivas
    WATER PRACTICE AND TECHNOLOGY, 2024, 19 (06) : 2364 - 2375
  • [24] Machine learning-based prognostic subgrouping of glioblastoma: A multicenter study
    Akbari, Hamed
    Bakas, Spyridon
    Sako, Chiharu
    Kazerooni, Anahita Fathi
    Villanueva-Meyer, Javier
    Garcia, Jose A.
    Mamourian, Elizabeth
    Liu, Fang
    Cao, Quy
    Shinohara, Russell T.
    Baid, Ujjwal
    Getka, Alexander
    Pati, Sarthak
    Singh, Ashish
    Calabrese, Evan
    Chang, Susan
    Rudie, Jeffrey
    Sotiras, Aristeidis
    LaMontagne, Pamela
    Marcus, Daniel S.
    Milchenko, Mikhail
    Nazeri, Arash
    Balana, Carmen
    Capellades, Jaume
    Puig, Josep
    Badve, Chaitra
    Barnholtz-Sloan, Jill S.
    Sloan, Andrew E.
    Vadmal, Vachan
    Waite, Kristin
    Ak, Murat
    Colen, Rivka R.
    Park, Yae Won
    Ahn, Sung Soo
    Chang, Jong Hee
    Choi, Yoon Seong
    Lee, Seung-Koo
    Alexander, Gregory S.
    Ali, Ayesha S.
    Dicker, Adam P.
    Flanders, Adam E.
    Liem, Spencer
    Lombardo, Joseph
    Shi, Wenyin
    Shukla, Gaurav
    Griffith, Brent
    Poisson, Laila M.
    Rogers, Lisa R.
    Kotrotsou, Aikaterini
    Booth, Thomas C.
    NEURO-ONCOLOGY, 2025,
  • [25] Machine learning-based risk prediction model for cardiovascular disease using a hybrid dataset
    Kanagarathinam, Karthick
    Sankaran, Durairaj
    Manikandan, R.
    DATA & KNOWLEDGE ENGINEERING, 2022, 140
  • [26] Building a machine learning-based risk prediction model for second-trimester miscarriage
    Qi, Sangsang
    Zheng, Shi
    Lu, Mengdan
    Chen, Aner
    Chen, Yanbo
    Fu, Xianhu
    BMC PREGNANCY AND CHILDBIRTH, 2024, 24 (01)
  • [27] Developing a machine learning-based flood risk prediction model for the Indus Basin in Pakistan
    Khan, Mehran
    Khan, Afed Ullah
    Ullah, Basir
    Khan, Sunaid
    WATER PRACTICE AND TECHNOLOGY, 2024, 19 (06) : 2213 - 2225
  • [28] Machine learning-based prediction of postoperative mortality in emergency colorectal surgery: A retrospective, multicenter cohort study using Tokushukai medical database
    Akabane, Shota
    Miyake, Katsunori
    Iwagami, Masao
    Tanabe, Kazunari
    Takagi, Toshio
    HELIYON, 2023, 9 (09)
  • [29] A machine learning-based universal outbreak risk prediction tool
    Zhang, Tianyu
    Rabhi, Fethi
    Chen, Xin
    Paik, Hye-young
    Macintyre, Chandini Raina
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [30] Machine Learning-Based Risk Prediction of Discharge Status for Sepsis
    Cai, Kaida
    Lou, Yuqing
    Wang, Zhengyan
    Yang, Xiaofang
    Zhao, Xin
    ENTROPY, 2024, 26 (08)