Forecasting COVID-19 cases using dynamic time warping and incremental machine learning methods

被引:6
|
作者
Miralles-Pechuan, Luis [1 ]
Kumar, Ankit [2 ]
Suarez-Cetrulo, Andres L. [2 ]
机构
[1] Technol Univ Dublin, Sch Comp Sci, Dublin, Ireland
[2] Univ Coll Dublin, Ctr Appl Data Analyt Res CeADAR, Dublin, Ireland
关键词
COVID-19; prediction; dynamic time warping; epidemiology curve; incremental machine learning; time series similarity measures;
D O I
10.1111/exsy.13237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The investment of time and resources for developing better strategies is key to dealing with future pandemics. In this work, we recreated the situation of COVID-19 across the year 2020, when the pandemic started spreading worldwide. We conducted experiments to predict the coronavirus cases for the 50 countries with the most cases during 2020. We compared the performance of state-of-the-art machine learning algorithms, such as long-short-term memory networks, against that of online incremental machine learning algorithms. To find the best strategy, we performed experiments to test three different approaches. In the first approach (single-country), we trained each model using data only from the country we were predicting. In the second one (multiple-country), we trained a model using the data from the 50 countries, and we used that model to predict each of the 50 countries. In the third experiment, we first applied clustering to calculate the nine most similar countries to the country that we were predicting. We consider two countries to be similar if the differences between the curve that represents the COVID-19 time series are small. To do so, we used time series similarity measures (TSSM) such as Euclidean Distance (ED) and Dynamic Time Warping (DTW). TSSM return a real value that represents the distance between the points in two time series which can be interpreted as how similar they are. Then, we trained the models with the data from the nine more similar countries to the one that was predicted and the predicted one. We used the model ARIMA as a baseline for our results. Results show that the idea of using TSSM is a very effective approach. By using it with the ED, the obtained RMSE in the single-country and multiple-country approaches was reduced by 74.21% and 74.70%, respectively. And by using the DTW, the RMSE was reduced by 74.89% and 75.36%. The main advantage of our methodology is that it is very simple and fast to apply since it is only based on time series data, as opposed to more complex methodologies that require a deep and thorough study to consider the number of parameters involved in the spread of the virus and their corresponding values. We made our code public to allow other researchers to explore our proposed methodology.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Forecasting of Covid-19 Cases Using Machine Learning Approach
    Kumar, Sachin
    Veer, Karan
    CURRENT RESPIRATORY MEDICINE REVIEWS, 2020, 16 (04) : 240 - 245
  • [2] Forecasting COVID-19 new cases using deep learning methods
    Xu, Lu
    Magar, Rishikesh
    Farimani, Amir Barati
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 144
  • [3] Forecasting of COVID-19 Cases in India Using Machine Learning: A Critical Analysis
    Nagvanshi, Suraj Singh
    Kaur, Inderjeet
    PROCEEDINGS OF THIRD DOCTORAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE, DOSCI 2022, 2023, 479 : 593 - 601
  • [4] Statistical Machine and Deep Learning Methods for Forecasting of Covid-19
    Juneja, Mamta
    Saini, Sumindar Kaur
    Kaur, Harleen
    Jindal, Prashant
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 138 (01) : 497 - 524
  • [5] The Number of Confirmed Cases of Covid-19 by using Machine Learning: Methods and Challenges
    Ahmad, Amir
    Garhwal, Sunita
    Ray, Santosh Kumar
    Kumar, Gagan
    Malebary, Sharaf Jameel
    Barukab, Omar Mohammed
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2021, 28 (04) : 2645 - 2653
  • [6] The Number of Confirmed Cases of Covid-19 by using Machine Learning: Methods and Challenges
    Amir Ahmad
    Sunita Garhwal
    Santosh Kumar Ray
    Gagan Kumar
    Sharaf Jameel Malebary
    Omar Mohammed Barukab
    Archives of Computational Methods in Engineering, 2021, 28 : 2645 - 2653
  • [7] FOCOMO : Forecasting and monitoring the worldwide spread of COVID-19 using machine learning methods
    Agrawal, Prateek
    Madaan, Vishu
    Roy, Aditya
    Kumari, Rajani
    Deore, Harshal
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2021, 24 (02) : 443 - 466
  • [8] Time series forecasting of new cases and new deaths rate for COVID-19 using deep learning methods
    Ayoobi, Nooshin
    Sharifrazi, Danial
    Alizadehsani, Roohallah
    Shoeibi, Afshin
    Gorriz, Juan M.
    Moosaei, Hossein
    Khosravi, Abbas
    Nahavandi, Saeid
    Chofreh, Abdoulmohammad Gholamzadeh
    Goni, Feybi Ariani
    Klemes, Jiri Jaromir
    Mosavi, Amir
    RESULTS IN PHYSICS, 2021, 27
  • [9] Machine Learning Techniques and Forecasting Methods for Analyzing and Predicting Covid-19
    Alshabeeb, Israa Ali
    Azeez, Ruaa Majeed
    Shakir, Wafaa Mohammed Ridha
    INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2022, 17 (01): : 413 - 424
  • [10] Comparative study of machine learning methods for COVID-19 transmission forecasting
    Dairi, Abdelkader
    Harrou, Fouzi
    Zeroual, Abdelhafid
    Hittawe, Mohamad Mazen
    Sun, Ying
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 118