A comparison of missing-data procedures for ARIMA time-series analysis

被引:58
|
作者
Velicer, WF
Colby, SM
机构
[1] Univ Rhode Isl, Canc Prevent Res Ctr, Kingston, RI 02881 USA
[2] Brown Univ, Providence, RI 02912 USA
关键词
missing data; ARIMA models; time-series analysis; autocorrelation;
D O I
10.1177/0013164404272502
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Missing data are a common practical problem for longitudinal designs. Time-series analysis is a longitudinal method that involves a large number of observations on a single unit. Four different missing-data methods (deletion, mean substitution, mean of adjacent observations, and maximum likelihood estimation) were evaluated. Computer-generated time-series data of length 100 were generated for 50 different conditions representing five levels ofautocorrelation, two levels of slope, and five levels of proportion of missing data. Methods were compared with respect to the accuracy of estimation for four parameters (level, error variance, degree of autocorrelation, and slope). The choice of method had a major impact on the analysis. The maximum likelihood very accurately estimated all four parameters under all conditions tested. The mean of the series was the least accurate approach. Statistical methods such as the maximum likelihood procedure represent a superior approach to missing data.
引用
收藏
页码:596 / 615
页数:20
相关论文
共 50 条
  • [41] Optimal Frequency-domain Analysis for Spacecraft Time Series: Introducing the Missing-data Multitaper Power Spectrum Estimator
    Dodson-Robinson, Sarah
    Haley, Charlotte
    ASTRONOMICAL JOURNAL, 2024, 167 (01):
  • [42] Effects of missing data imputation methods on univariate blood pressure time series data analysis and forecasting with ARIMA and LSTM
    Niako, Nicholas
    Melgarejo, Jesus D.
    Maestre, Gladys E.
    Vatcheva, Kristina P.
    BMC MEDICAL RESEARCH METHODOLOGY, 2024, 24 (01)
  • [43] INTERPOLATING MISSING VALUES IN A TIME-SERIES
    DAMSLETH, E
    SCANDINAVIAN JOURNAL OF STATISTICS, 1980, 7 (01) : 33 - 39
  • [44] COMPARISON OF REGRESSION AND TIME-SERIES METHODS FOR SYNTHESIZING MISSING STREAMFLOW RECORDS
    BEAUCHAMP, JJ
    DOWNING, DJ
    RAILSBACK, SF
    WATER RESOURCES BULLETIN, 1989, 25 (05): : 961 - 975
  • [45] Visual Imputation Analytics for Missing Time-Series Data in Bayesian Network
    Yeon, Hanbyul
    Son, Hyesook
    Jang, Yun
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 303 - 310
  • [46] THE ESTIMATION OF MISSING OBSERVATIONS IN RELATED TIME-SERIES DATA - FURTHER RESULTS
    BROWN, KC
    KADIYALA, KR
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1985, 14 (04) : 973 - 981
  • [47] Distance measures for effective clustering of ARIMA time-series
    Kalpakis, K
    Gada, D
    Puttagunta, V
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 273 - 280
  • [48] ARE THERE APPLICATIONS OF TIME-SERIES ANALYSIS (ARIMA-MODEL) IN THE EEG-RESEARCH
    SPIEL, G
    SPIEL, C
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 1986, 4 (03) : 261 - 262
  • [49] Population-Level Administration of AlcoholEdu for College: An ARIMA Time-Series Analysis
    Wyatt, Todd M.
    DeJong, William
    Dixon, Elizabeth
    JOURNAL OF HEALTH COMMUNICATION, 2013, 18 (08) : 898 - 912
  • [50] Time-series analysis with neural networks and ARIMA-neural network hybrids
    Hansen, JV
    Nelson, RD
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2003, 15 (03) : 315 - 330