Early Time Series Classification Using Reinforcement Learning for Pre-Symptomatic Covid-19 Screening From Imbalanced Health Tracker Data

被引:0
|
作者
Sarwar, Atifa [1 ]
Almadani, Abdulsalam [1 ]
Agu, Emmanuel O. [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
关键词
COVID-19; Accuracy; Feature extraction; Heart rate; Time series analysis; Physiology; Infectious diseases; Testing; Reinforcement learning; Pandemics; Infectious Diseases; Covid-19; Passive Screening; Physiological signs; Early Time Series Classification; Class Imbalance; Reinforcement Learning; Health Tracker;
D O I
10.1109/JBHI.2024.3509630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Early detection of infectious diseases such as Covid-19 can limit transmission and curb pandemics. This study proposes EarlyDetect, an end-to-end framework for early Covid-19 detection using heart rate and step data collected passively from consumer-grade health trackers. A key challenge in early Covid-19 detection is determining the optimal amount of historical data (e.g., past days) a machine learning model should analyze to achieve the earliest possible, yet accurate, infection detection. Leveraging Reinforcement Learning-based Early Time Series Classification, EarlyDetect extracts 45 digital biomarkers (daily steps, daytime/nighttime HR, mesor, sedentary time), and feeds them into a deep Multi-layer Perceptron neural network model trained using Double Deep Q-Network. At each iteration, EarlyDetect dynamically decides whether to wait for more data or proceed with classifying the window of data observed so far. A novel reward function ensures early yet accurate classification in imbalanced class distributions. Using heart rate and steps values over 72 hours lookback window, EarlyDetect achieves an accuracy of 0.8 (95% CI 0.71-0.89), AUC-ROC of 0.73 (95% CI: 0.6-0.86), an earliness of 0.07 (95% CI: 0.05-0.09), thus requiring up to 86% less data than existing methods while predicting Covid-19 status 50% earlier (smaller detection window). Performance on two Covid-19 datasets was encouraging, identifying 61% and 46% of Covid+ cases before the coronavirus reached peak transmissibility. EarlyDetect is a significant advancement in early infectious disease screening, and is the first method to dynamically determine an optimal lookback window size for Covid-19 detection from physiological signs on imbalanced datasets using Reinforcement Learning-based Early Time Series Classification.
引用
收藏
页码:2246 / 2256
页数:11
相关论文
共 50 条
  • [41] A MULTI-STAGE PROGRESSIVE LEARNING STRATEGY FOR COVID-19 DIAGNOSIS USING CHEST COMPUTED TOMOGRAPHY WITH IMBALANCED DATA
    Yang, Zaifeng
    Hou, Yubo
    Chen, Zhenghua
    Zhang, Le
    Chen, Jie
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8578 - 8582
  • [42] A review on use of data science for visualization and prediction of the covid-19 pandemic and early diagnosis of covid-19 using machine learning models
    Choubey S.K.
    Naman H.
    Studies in Big Data, 2020, 80 : 241 - 265
  • [43] Results of a hospitalization policy of asymptomatic and pre-symptomatic COVID-19-positive long-term care facility residents in the province of Salzburg-a report from the AGMT COVID-19 Registry
    Huemer, Florian
    Rinnerthaler, Gabriel
    Jorg, Benedikt
    Morre, Patrick
    Stegbuchner, Birgit
    Proksch, Elisabeth
    Fleimisch, Stefanie
    Oberkofler, Hannes
    Kremser, Iris
    Greil, Richard
    Egle, Alexander
    GEROSCIENCE, 2021, 43 (04) : 1877 - 1897
  • [44] Prediction and forecasting of worldwide corona virus (COVID-19) outbreak using time series and machine learning
    Jain, Priyank
    Sahu, Shriya
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (26):
  • [45] Time series analysis and predicting COVID-19 affected patients by ARIMA model using machine learning
    Chyon, Fuad Ahmed
    Suman, Md Nazmul Hasan
    Fahim, Md Rafiul Islam
    Ahmmed, Md Sazol
    JOURNAL OF VIROLOGICAL METHODS, 2022, 301
  • [46] Data analysis of Covid-19 pandemic and short-term cumulative case forecasting using machine learning time series methods
    Balli, Serkan
    CHAOS SOLITONS & FRACTALS, 2021, 142
  • [47] Hybrid learning-oriented approaches for predicting Covid-19 time series data: A comparative analytical study
    Mehrmolaei, Soheila
    Savargiv, Mohammad
    Keyvanpour, Mohammad Reza
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [48] Forecasting Covid-19 Time Series Data using the Long Short-Term Memory (LSTM)
    Mukhtar, Harun
    Taufiq, Reny Medikawati
    Herwinanda, Ilham
    Winarso, Doni
    Hayami, Regiolina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (10) : 211 - 217
  • [49] Hierarchical Time Series Forecasting of COVID-19 Cases Using County-Level Clustering Data
    Sonaxy Mohanty
    Airi Shimamura
    Charles D. Nicholson
    Andrés D. González
    Talayeh Razzaghi
    Operations Research Forum, 6 (1)
  • [50] Classification and visual explanation for COVID-19 pneumonia from CT images using triple learning
    Sota Kato
    Masahiro Oda
    Kensaku Mori
    Akinobu Shimizu
    Yoshito Otake
    Masahiro Hashimoto
    Toshiaki Akashi
    Kazuhiro Hotta
    Scientific Reports, 12