A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers

被引:10
|
作者
Rahman, Md Siddikur [1 ]
Chowdhury, Arman Hossain [1 ]
机构
[1] Begum Rokeya Univ, Dept Stat, Rangpur, Bangladesh
来源
PLOS ONE | 2022年 / 17卷 / 09期
关键词
TEMPERATURE; HUMIDITY;
D O I
10.1371/journal.pone.0273319
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
COVID-19 pandemic has become a global major public health concern. Examining the meteorological risk factors and accurately predicting the incidence of the COVID-19 pandemic is an extremely important challenge. Therefore, in this study, we analyzed the relationship between meteorological factors and COVID-19 transmission in SAARC countries. We also compared the predictive accuracy of Autoregressive Integrated Moving Average (ARIMAX) and eXtreme Gradient Boosting (XGBoost) methods for precise modelling of COVID-19 incidence. We compiled a daily dataset including confirmed COVID-19 case counts, minimum and maximum temperature (degrees C), relative humidity (%), surface pressure (kPa), precipitation (mm/day) and maximum wind speed (m/s) from the onset of the disease to January 29, 2022, in each country. The data were divided into training and test sets. The training data were used to fit ARIMAX model for examining significant meteorological risk factors. All significant factors were then used as covariates in ARIMAX and XGBoost models to predict the COVID-19 confirmed cases. We found that maximum temperature had a positive impact on the COVID-19 transmission in Afghanistan (beta = 11.91, 95% CI: 4.77, 19.05) and India (beta = 0.18, 95% CI: 0.01, 0.35). Surface pressure had a positive influence in Pakistan (beta = 25.77, 95% CI: 7.85, 43.69) and Sri Lanka (beta = 411.63, 95% CI: 49.04, 774.23). We also found that the XGBoost model can help improve prediction of COVID-19 cases in SAARC countries over the ARIMAX model. The study findings will help the scientific communities and policymakers to establish a more accurate early warning system to control the spread of the pandemic.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] DATA-DRIVEN HYPERPARAMETER OPTIMIZED EXTREME GRADIENT BOOSTING MACHINE LEARNING MODEL FOR SOLAR RADIATION FORECASTING
    Kumar, Mantosh
    Namrata, Kumari
    Kumar, Nishant
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 20 (04) : 549 - 559
  • [2] Clinical data-driven approach to identifying COVID-19 and influenza from a gradient-boosting model
    Chi, Duong Thi Kim
    Lang, Tran Van
    Nguyen, Thanh Q.
    COGENT ENGINEERING, 2023, 10 (01):
  • [3] Mortality predictors in patients with COVID-19 pneumonia: a machine learning approach using eXtreme Gradient Boosting model
    N. Casillas
    A. M. Torres
    M. Moret
    A. Gómez
    J. M. Rius-Peris
    J. Mateo
    Internal and Emergency Medicine, 2022, 17 : 1929 - 1939
  • [4] Mortality predictors in patients with COVID-19 pneumonia: a machine learning approach using eXtreme Gradient Boosting model
    Casillas, N.
    Torres, A. M.
    Moret, M.
    Gomez, A.
    Rius-Peris, J. M.
    Mateo, J.
    INTERNAL AND EMERGENCY MEDICINE, 2022, 17 (07) : 1929 - 1939
  • [5] A data-driven model to describe and forecast the dynamics of COVID-19 transmission
    Paiva, Henrique Mohallem
    Magalhaes Afonso, Rubens Junqueira
    de Oliveira, Igor Luppi
    Garcia, Gabriele Fernandes
    PLOS ONE, 2020, 15 (07):
  • [6] Development of gradient boosting-assisted machine learning data-driven model for free chlorine residual prediction
    Helm, Wiley
    Zhong, Shifa
    Reid, Elliot
    Igou, Thomas
    Chen, Yongsheng
    FRONTIERS OF ENVIRONMENTAL SCIENCE & ENGINEERING, 2024, 18 (02)
  • [7] A gradient boosting machine learning approach in modeling the impact of temperature and humidity on the transmission rate of COVID-19 in India
    Shrivastav, Lokesh Kumar
    Jha, Sunil Kumar
    APPLIED INTELLIGENCE, 2021, 51 (05) : 2727 - 2739
  • [8] A gradient boosting machine learning approach in modeling the impact of temperature and humidity on the transmission rate of COVID-19 in India
    Lokesh Kumar Shrivastav
    Sunil Kumar Jha
    Applied Intelligence, 2021, 51 : 2727 - 2739
  • [9] A population data-driven workflow for COVID-19 modeling and learning
    Ozik, Jonathan
    Wozniak, Justin M.
    Collier, Nicholson
    Macal, Charles M.
    Binois, Mickael
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2021, 35 (05): : 483 - 499
  • [10] Early Triage of COVID-19 patients exploiting Data-Driven Strategies and Machine Learning Techniques
    Park, Ji-Sung
    Kim, Gun-Woo
    Seok, Hyeri
    Shin, Hong Ju
    Lee, Dong-Ho
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,