Forecasting COVID-19 cases using dynamic time warping and incremental machine learning methods

被引:6
|
作者
Miralles-Pechuan, Luis [1 ]
Kumar, Ankit [2 ]
Suarez-Cetrulo, Andres L. [2 ]
机构
[1] Technol Univ Dublin, Sch Comp Sci, Dublin, Ireland
[2] Univ Coll Dublin, Ctr Appl Data Analyt Res CeADAR, Dublin, Ireland
关键词
COVID-19; prediction; dynamic time warping; epidemiology curve; incremental machine learning; time series similarity measures;
D O I
10.1111/exsy.13237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The investment of time and resources for developing better strategies is key to dealing with future pandemics. In this work, we recreated the situation of COVID-19 across the year 2020, when the pandemic started spreading worldwide. We conducted experiments to predict the coronavirus cases for the 50 countries with the most cases during 2020. We compared the performance of state-of-the-art machine learning algorithms, such as long-short-term memory networks, against that of online incremental machine learning algorithms. To find the best strategy, we performed experiments to test three different approaches. In the first approach (single-country), we trained each model using data only from the country we were predicting. In the second one (multiple-country), we trained a model using the data from the 50 countries, and we used that model to predict each of the 50 countries. In the third experiment, we first applied clustering to calculate the nine most similar countries to the country that we were predicting. We consider two countries to be similar if the differences between the curve that represents the COVID-19 time series are small. To do so, we used time series similarity measures (TSSM) such as Euclidean Distance (ED) and Dynamic Time Warping (DTW). TSSM return a real value that represents the distance between the points in two time series which can be interpreted as how similar they are. Then, we trained the models with the data from the nine more similar countries to the one that was predicted and the predicted one. We used the model ARIMA as a baseline for our results. Results show that the idea of using TSSM is a very effective approach. By using it with the ED, the obtained RMSE in the single-country and multiple-country approaches was reduced by 74.21% and 74.70%, respectively. And by using the DTW, the RMSE was reduced by 74.89% and 75.36%. The main advantage of our methodology is that it is very simple and fast to apply since it is only based on time series data, as opposed to more complex methodologies that require a deep and thorough study to consider the number of parameters involved in the spread of the virus and their corresponding values. We made our code public to allow other researchers to explore our proposed methodology.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Analysis on novel coronavirus (COVID-19) using machine learning methods
    Yadav, Milind
    Perumal, Murukessan
    Srinivas, M.
    CHAOS SOLITONS & FRACTALS, 2020, 139
  • [32] COVID-19 Influence: A General Analysis using Machine Learning Methods
    Chen, Yanxiong
    Mi, Zixuan
    Xiao, Zaichu
    Zhang, Yunqi
    2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 284 - 290
  • [33] Machine Learning Algorithms for Forecasting COVID 19 Confirmed Cases in America
    Jojoa Acosta, Mario Fernando
    Garcia-Zapirain, Begona
    2020 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2020), 2020,
  • [34] Machine learning Models to Predict COVID-19 Cases
    Alshabana, Ghadah
    Tran, Thao
    Saadati, Marjan
    George, Michael Thompson
    Chitimalla, Ashritha
    2022 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2022, : 223 - 229
  • [35] Forecasting COVID-19 Cases in Morocco: A Deep Learning Approach
    Hankar, Mustapha
    Birjali, Marouane
    Beni-Hssane, Abderrahim
    NETWORKING, INTELLIGENT SYSTEMS AND SECURITY, 2022, 237 : 845 - 857
  • [36] Machine Learning Model for Identification of Covid-19 Future Forecasting
    Anitha, N.
    Soundarajan, C.
    Swathi, V
    Tamilselvan, M.
    INNOVATIONS IN BIO-INSPIRED COMPUTING AND APPLICATIONS, IBICA 2021, 2022, 419 : 286 - 295
  • [37] A machine learning forecasting model for COVID-19 pandemic in India
    Sujath, R.
    Chatterjee, Jyotir Moy
    Hassanien, Aboul Ella
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2020, 34 (07) : 959 - 972
  • [38] A machine learning forecasting model for COVID-19 pandemic in India
    R. Sujath
    Jyotir Moy Chatterjee
    Aboul Ella Hassanien
    Stochastic Environmental Research and Risk Assessment, 2020, 34 : 959 - 972
  • [39] COVID-19 Spread Forecasting, Mathematical Methods vs. Machine Learning, Moscow Case
    Pavlyutin, Matvey
    Samoyavcheva, Marina
    Kochkarov, Rasul
    Pleshakova, Ekaterina
    Korchagin, Sergey
    Gataullin, Timur
    Nikitin, Petr
    Hidirova, Mohiniso
    MATHEMATICS, 2022, 10 (02)
  • [40] Predicting reduction of COVID-19 cases in India Using Machine Learning Algorithm
    Welekar, Rashmi
    Tapase, Sharvari
    Bajaj, Shubhi
    Pande, Isha
    Verma, Abhishek
    Katpatal, Ashutosh
    Mishra, Vaibhav
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 189 - 192