Early Time Series Classification Using Reinforcement Learning for Pre-Symptomatic Covid-19 Screening From Imbalanced Health Tracker Data

被引:0
|
作者
Sarwar, Atifa [1 ]
Almadani, Abdulsalam [1 ]
Agu, Emmanuel O. [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
关键词
COVID-19; Accuracy; Feature extraction; Heart rate; Time series analysis; Physiology; Infectious diseases; Testing; Reinforcement learning; Pandemics; Infectious Diseases; Covid-19; Passive Screening; Physiological signs; Early Time Series Classification; Class Imbalance; Reinforcement Learning; Health Tracker;
D O I
10.1109/JBHI.2024.3509630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Early detection of infectious diseases such as Covid-19 can limit transmission and curb pandemics. This study proposes EarlyDetect, an end-to-end framework for early Covid-19 detection using heart rate and step data collected passively from consumer-grade health trackers. A key challenge in early Covid-19 detection is determining the optimal amount of historical data (e.g., past days) a machine learning model should analyze to achieve the earliest possible, yet accurate, infection detection. Leveraging Reinforcement Learning-based Early Time Series Classification, EarlyDetect extracts 45 digital biomarkers (daily steps, daytime/nighttime HR, mesor, sedentary time), and feeds them into a deep Multi-layer Perceptron neural network model trained using Double Deep Q-Network. At each iteration, EarlyDetect dynamically decides whether to wait for more data or proceed with classifying the window of data observed so far. A novel reward function ensures early yet accurate classification in imbalanced class distributions. Using heart rate and steps values over 72 hours lookback window, EarlyDetect achieves an accuracy of 0.8 (95% CI 0.71-0.89), AUC-ROC of 0.73 (95% CI: 0.6-0.86), an earliness of 0.07 (95% CI: 0.05-0.09), thus requiring up to 86% less data than existing methods while predicting Covid-19 status 50% earlier (smaller detection window). Performance on two Covid-19 datasets was encouraging, identifying 61% and 46% of Covid+ cases before the coronavirus reached peak transmissibility. EarlyDetect is a significant advancement in early infectious disease screening, and is the first method to dynamically determine an optimal lookback window size for Covid-19 detection from physiological signs on imbalanced datasets using Reinforcement Learning-based Early Time Series Classification.
引用
收藏
页码:2246 / 2256
页数:11
相关论文
共 50 条
  • [31] Classification of COVID-19 from tuberculosis and pneumonia using deep learning techniques
    Venkataramana, Lokeswari
    Prasad, D. Venkata Vara
    Saraswathi, S.
    Mithumary, C. M.
    Karthikeyan, R.
    Monika, N.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (09) : 2681 - 2691
  • [32] Early risk assessment for COVID-19 patients from emergency department data using machine learning
    Frank S. Heldt
    Marcela P. Vizcaychipi
    Sophie Peacock
    Mattia Cinelli
    Lachlan McLachlan
    Fernando Andreotti
    Stojan Jovanović
    Robert Dürichen
    Nadezda Lipunova
    Robert A. Fletcher
    Anne Hancock
    Alex McCarthy
    Richard A. Pointon
    Alexander Brown
    James Eaton
    Roberto Liddi
    Lucy Mackillop
    Lionel Tarassenko
    Rabia T. Khan
    Scientific Reports, 11
  • [33] Early risk assessment for COVID-19 patients from emergency department data using machine learning
    Heldt, Frank S.
    Vizcaychipi, Marcela P.
    Peacock, Sophie
    Cinelli, Mattia
    McLachlan, Lachlan
    Andreotti, Fernando
    Jovanovic, Stojan
    Durichen, Robert
    Lipunova, Nadezda
    Fletcher, Robert A.
    Hancock, Anne
    McCarthy, Alex
    Pointon, Richard A.
    Brown, Alexander
    Eaton, James
    Liddi, Roberto
    Mackillop, Lucy
    Tarassenko, Lionel
    Khan, Rabia T.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [34] COVID-19 early detection for imbalanced or low number of data using a regularized cost-sensitive CapsNet
    Malihe Javidi
    Saeid Abbaasi
    Sara Naybandi Atashi
    Mahdi Jampour
    Scientific Reports, 11
  • [35] COVID-19 early detection for imbalanced or low number of data using a regularized cost-sensitive CapsNet
    Javidi, Malihe
    Abbaasi, Saeid
    Naybandi Atashi, Sara
    Jampour, Mahdi
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [36] Time Series Analysis of COVID-19 Data- A study from Northern India
    Semwal, Jayanti
    Bahuguna, Abhinav
    Sharma, Neha
    Dikshit, Rajiv Kumar
    Bijalwan, Rajeev
    Augustine, Piyush
    INDIAN JOURNAL OF COMMUNITY HEALTH, 2022, 34 (02) : 202 - 206
  • [37] Detection of COVID-19 Using Protein Sequence Data via Machine Learning Classification Approach
    Aminah, Siti
    Ardaneswari, Gianinna
    Husnah, Mufarrido
    Deori, Ghani
    Prasetyo, Handi Bagus
    JOURNAL OF APPLIED MATHEMATICS, 2023, 2023
  • [38] Results of a hospitalization policy of asymptomatic and pre-symptomatic COVID-19-positive long-term care facility residents in the province of Salzburg—a report from the AGMT COVID-19 Registry
    Florian Huemer
    Gabriel Rinnerthaler
    Benedikt Jörg
    Patrick Morre
    Birgit Stegbuchner
    Elisabeth Proksch
    Stefanie Fleimisch
    Hannes Oberkofler
    Iris Kremser
    Richard Greil
    Alexander Egle
    GeroScience, 2021, 43 : 1877 - 1897
  • [39] Capturing asymmetry in COVID-19 counts using an improved skewness measure for time series data *
    Bapat, Sudeep R.
    METHODSX, 2023, 11
  • [40] Prediction of Bontang City COVID-19 Data Time Series Using the Facebook Prophet Method
    Kasturi, Kurnia
    Putera, M. Ihsan Alfani
    Natasia, Sri Rahayu
    2021 4TH INTERNATIONAL SEMINAR ON RESEARCH OF INFORMATION TECHNOLOGY AND INTELLIGENT SYSTEMS (ISRITI 2021), 2020,