Forecasting COVID-19 cases using dynamic time warping and incremental machine learning methods

被引:6
|
作者
Miralles-Pechuan, Luis [1 ]
Kumar, Ankit [2 ]
Suarez-Cetrulo, Andres L. [2 ]
机构
[1] Technol Univ Dublin, Sch Comp Sci, Dublin, Ireland
[2] Univ Coll Dublin, Ctr Appl Data Analyt Res CeADAR, Dublin, Ireland
关键词
COVID-19; prediction; dynamic time warping; epidemiology curve; incremental machine learning; time series similarity measures;
D O I
10.1111/exsy.13237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The investment of time and resources for developing better strategies is key to dealing with future pandemics. In this work, we recreated the situation of COVID-19 across the year 2020, when the pandemic started spreading worldwide. We conducted experiments to predict the coronavirus cases for the 50 countries with the most cases during 2020. We compared the performance of state-of-the-art machine learning algorithms, such as long-short-term memory networks, against that of online incremental machine learning algorithms. To find the best strategy, we performed experiments to test three different approaches. In the first approach (single-country), we trained each model using data only from the country we were predicting. In the second one (multiple-country), we trained a model using the data from the 50 countries, and we used that model to predict each of the 50 countries. In the third experiment, we first applied clustering to calculate the nine most similar countries to the country that we were predicting. We consider two countries to be similar if the differences between the curve that represents the COVID-19 time series are small. To do so, we used time series similarity measures (TSSM) such as Euclidean Distance (ED) and Dynamic Time Warping (DTW). TSSM return a real value that represents the distance between the points in two time series which can be interpreted as how similar they are. Then, we trained the models with the data from the nine more similar countries to the one that was predicted and the predicted one. We used the model ARIMA as a baseline for our results. Results show that the idea of using TSSM is a very effective approach. By using it with the ED, the obtained RMSE in the single-country and multiple-country approaches was reduced by 74.21% and 74.70%, respectively. And by using the DTW, the RMSE was reduced by 74.89% and 75.36%. The main advantage of our methodology is that it is very simple and fast to apply since it is only based on time series data, as opposed to more complex methodologies that require a deep and thorough study to consider the number of parameters involved in the spread of the virus and their corresponding values. We made our code public to allow other researchers to explore our proposed methodology.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A Machine Learning Univariate Time series Model for Forecasting COVID-19 Confirmed Cases: A Pilot Study in Botswana
    Mphale, Ofaletse
    Okike, Ezekiel U.
    Rafifing, Neo
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (01): : 225 - 233
  • [22] Machine Learning Datamining Methods To Predict Fore Coming Covid-19 Cases
    Preethi, B. Meena
    Radha, P.
    INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2022, 12 : 150 - 154
  • [23] Prediction and forecasting of worldwide corona virus (COVID-19) outbreak using time series and machine learning
    Jain, Priyank
    Sahu, Shriya
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (26):
  • [24] Machine Learning and Classical Forecasting Methods Based Decision Support Systems for COVID-19
    Unlu, Ramazan
    Namli, Ersin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 64 (03): : 1383 - 1399
  • [25] Estimating COVID-19 Cases Using Machine Learning Regression Algorithms
    Deshwal, Vaishali
    Kumar, Vimal
    Shukla, Rati
    Yadav, Vikash
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2022, 15 (05) : 390 - 400
  • [26] An Improved COVID-19 Forecasting by Infectious Disease Modelling Using Machine Learning
    Ahmad, Hafiz Farooq
    Khaloofi, Huda
    Azhar, Zahra
    Algosaibi, Abdulelah
    Hussain, Jamil
    APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [27] Data analysis of Covid-19 pandemic and short-term cumulative case forecasting using machine learning time series methods
    Balli, Serkan
    CHAOS SOLITONS & FRACTALS, 2021, 142
  • [28] Predicting the negative conversion time of nonsevere COVID-19 patients using machine learning methods
    Ye, Jiru
    Shao, Xiaonan
    Yang, Yong
    Zhu, Feng
    JOURNAL OF MEDICAL VIROLOGY, 2023, 95 (04)
  • [29] COVID-19 Influence: A General Analysis using Machine Learning Methods
    Chen, Yanxiong
    Mi, Zixuan
    Xiao, Zaichu
    Zhang, Yunqi
    Proceedings - 2021 3rd International Conference on Machine Learning, Big Data and Business Intelligence, MLBDBI 2021, 2021, : 284 - 290
  • [30] Gravitational search algorithm-extreme learning machine for COVID-19 active cases forecasting
    Huang, Boyu
    Song, Youyi
    Cui, Zhihan
    Dou, Haowen
    Jiang, Dazhi
    Zhou, Teng
    Qin, Jing
    IET SOFTWARE, 2023, 17 (04) : 554 - 565