Deep imputation of missing values in time series health data: A review with benchmarking

被引:13
|
作者
Kazijevs, Maksims [1 ]
Samad, Manar D. [1 ]
机构
[1] Tennessee State Univ, Dept Comp Sci, Nashville, TN 37209 USA
基金
美国国家卫生研究院;
关键词
Time series; Multivariate data; Longitudinal imputation; Cross-sectional imputation; Missing value imputation; Deep neural network; Electronic health records; Sensor data;
D O I
10.1016/j.jbi.2023.104440
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The imputation of missing values in multivariate time series (MTS) data is critical in ensuring data quality and producing reliable data-driven predictive models. Apart from many statistical approaches, a few recent studies have proposed state-of-the-art deep learning methods to impute missing values in MTS data. However, the evaluation of these deep methods is limited to one or two data sets, low missing rates, and completely random missing value types. This survey performs six data-centric experiments to benchmark state-of-the-art deep imputation methods on five time series health data sets. Our extensive analysis reveals that no single imputation method outperforms the others on all five data sets. The imputation performance depends on data types, individual variable statistics, missing value rates, and types. Deep learning methods that jointly perform cross-sectional (across variables) and longitudinal (across time) imputations of missing values in time series data yield statistically better data quality than traditional imputation methods. Although computationally expensive, deep learning methods are practical given the current availability of high-performance computing resources, especially when data quality and sample size are of paramount importance in healthcare informatics. Our findings highlight the importance of data-centric selection of imputation methods to optimize data-driven predictive models.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Missing Value Imputation of Time-Series Air-Quality Data via Deep Neural Networks
    Kim, Taesung
    Kim, Jinhee
    Yang, Wonho
    Lee, Hunjoo
    Choo, Jaegul
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (22)
  • [42] Binned Data Provide Better Imputation of Missing Time Series Data from Wearables
    Chakrabarti, Shweta
    Biswas, Nupur
    Karnani, Khushi
    Padul, Vijay
    Jones, Lawrence D.
    Kesari, Santosh
    Ashili, Shashaanka
    SENSORS, 2023, 23 (03)
  • [43] Deep Learning Approach for Imputation of Missing Values in Actigraphy Data: Algorithm Development Study
    Jang, Jong-Hwan
    Choi, Junggu
    Roh, Hyun Woong
    Son, Sang Joon
    Hong, Chang Hyung
    Kim, Eun Young
    Kim, Tae Young
    Yoon, Dukyong
    JMIR MHEALTH AND UHEALTH, 2020, 8 (07):
  • [44] Multivariate Time Series Missing Data Imputation Using Recurrent Denoising Autoencoder
    Zhang, Jianye
    Yin, Peng
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 760 - 764
  • [45] Visual Imputation Analytics for Missing Time-Series Data in Bayesian Network
    Yeon, Hanbyul
    Son, Hyesook
    Jang, Yun
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 303 - 310
  • [46] REGRESSION IMPUTATION OF MISSING VALUES IN LONGITUDINAL DATA SETS
    SCHNEIDERMAN, ED
    KOWALSKI, CJ
    WILLIS, SM
    INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1993, 32 (02): : 121 - 133
  • [47] Robust imputation method for missing values in microarray data
    Yoon, Dankyu
    Lee, Eun-Kyung
    Park, Taesung
    BMC BIOINFORMATICS, 2007, 8 (Suppl 2)
  • [48] Treatment of missing values with imputation for the analysis of otologic data
    Laurikkala, J
    Kentala, E
    Juhola, M
    Pyykkö, I
    MEDICAL INFORMATICS EUROPE '99, 1999, 68 : 428 - 431
  • [49] Robust imputation method for missing values in microarray data
    Dankyu Yoon
    Eun-Kyung Lee
    Taesung Park
    BMC Bioinformatics, 8
  • [50] Imputation of missing values in multi-view data
    van Loon, Wouter
    de Vos, Frank
    de Vos, Frank
    Koini, Marisa
    Schmidt, Reinhold
    de Rooij, Mark
    INFORMATION FUSION, 2024, 111