Forecasting COVID-19 cases using dynamic time warping and incremental machine learning methods

被引:6
|
作者
Miralles-Pechuan, Luis [1 ]
Kumar, Ankit [2 ]
Suarez-Cetrulo, Andres L. [2 ]
机构
[1] Technol Univ Dublin, Sch Comp Sci, Dublin, Ireland
[2] Univ Coll Dublin, Ctr Appl Data Analyt Res CeADAR, Dublin, Ireland
关键词
COVID-19; prediction; dynamic time warping; epidemiology curve; incremental machine learning; time series similarity measures;
D O I
10.1111/exsy.13237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The investment of time and resources for developing better strategies is key to dealing with future pandemics. In this work, we recreated the situation of COVID-19 across the year 2020, when the pandemic started spreading worldwide. We conducted experiments to predict the coronavirus cases for the 50 countries with the most cases during 2020. We compared the performance of state-of-the-art machine learning algorithms, such as long-short-term memory networks, against that of online incremental machine learning algorithms. To find the best strategy, we performed experiments to test three different approaches. In the first approach (single-country), we trained each model using data only from the country we were predicting. In the second one (multiple-country), we trained a model using the data from the 50 countries, and we used that model to predict each of the 50 countries. In the third experiment, we first applied clustering to calculate the nine most similar countries to the country that we were predicting. We consider two countries to be similar if the differences between the curve that represents the COVID-19 time series are small. To do so, we used time series similarity measures (TSSM) such as Euclidean Distance (ED) and Dynamic Time Warping (DTW). TSSM return a real value that represents the distance between the points in two time series which can be interpreted as how similar they are. Then, we trained the models with the data from the nine more similar countries to the one that was predicted and the predicted one. We used the model ARIMA as a baseline for our results. Results show that the idea of using TSSM is a very effective approach. By using it with the ED, the obtained RMSE in the single-country and multiple-country approaches was reduced by 74.21% and 74.70%, respectively. And by using the DTW, the RMSE was reduced by 74.89% and 75.36%. The main advantage of our methodology is that it is very simple and fast to apply since it is only based on time series data, as opposed to more complex methodologies that require a deep and thorough study to consider the number of parameters involved in the spread of the virus and their corresponding values. We made our code public to allow other researchers to explore our proposed methodology.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Forecasting of COVID-19 cases using deep learning models: Is it reliable and practically significant?
    Devaraj, Jayanthi
    Elavarasan, Rajvikram Madurai
    Pugazhendhi, Rishi
    Shafiullah, G. M.
    Ganesan, Sumathi
    Jeysree, Ajay Kaarthic
    Khan, Irfan Ahmad
    Hossain, Eklas
    RESULTS IN PHYSICS, 2021, 21
  • [42] Forecasting gasoline consumption using machine learning algorithms during COVID-19 pandemic
    Ceylan, Zeynep
    Akbulut, Derya
    Bayturk, Engin
    ENERGY SOURCES PART A-RECOVERY UTILIZATION AND ENVIRONMENTAL EFFECTS, 2022, 46 (01) : 16623 - 16641
  • [43] Machine Learning Methods on COVID-19 Situation Prediction
    Yang, Zhihao
    Chen, Kang'an
    2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING (ICAICE 2020), 2020, : 78 - 83
  • [44] Clustering and Modelling of the Top 30 Cryptocurrency Prices Using Dynamic Time Warping and Machine Learning Methods
    Stastny, Tomas
    Koudelka, Jiri
    Bilkova, Diana
    Marek, Lubos
    MATHEMATICS, 2022, 10 (19)
  • [45] Forecasting COVID-19 cases using time series modeling and association rule mining
    Rachasak Somyanonthanakul
    Kritsasith Warin
    Watchara Amasiri
    Karicha Mairiang
    Chatchai Mingmalairak
    Wararit Panichkitkosolkul
    Krittin Silanun
    Thanaruk Theeramunkong
    Surapon Nitikraipot
    Siriwan Suebnukarn
    BMC Medical Research Methodology, 22
  • [46] Forecasting COVID-19 cases using time series modeling and association rule mining
    Somyanonthanakul, Rachasak
    Warin, Kritsasith
    Amasiri, Watchara
    Mairiang, Karicha
    Mingmalairak, Chatchai
    Panichkitkosolkul, Wararit
    Silanun, Krittin
    Theeramunkong, Thanaruk
    Nitikraipot, Surapon
    Suebnukarn, Siriwan
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [47] Dynamic time series modelling and forecasting of COVID-19 in Norway
    Bardsen, Gunnar
    Nymoen, Ragnar
    INTERNATIONAL JOURNAL OF FORECASTING, 2025, 41 (01) : 251 - 269
  • [48] Modelling and forecasting new cases of Covid-19 in Nigeria: Comparison of regression, ARIMA and machine learning models
    Busari, S. I.
    Samson, T. K.
    SCIENTIFIC AFRICAN, 2022, 18
  • [49] Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models
    Wang, Yanding
    Yan, Zehui
    Wang, Ding
    Yang, Meitao
    Li, Zhiqiang
    Gong, Xinran
    Wu, Di
    Zhai, Lingling
    Zhang, Wenyi
    Wang, Yong
    BMC INFECTIOUS DISEASES, 2022, 22 (SUPPL 1)
  • [50] Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models
    Yanding Wang
    Zehui Yan
    Ding Wang
    Meitao Yang
    Zhiqiang Li
    Xinran Gong
    Di Wu
    Lingling Zhai
    Wenyi Zhang
    Yong Wang
    BMC Infectious Diseases, 22