Augmenting energy time-series for data-efficient imputation of missing values

被引:17
|
作者
Liguori, Antonio [1 ]
Markovic, Romana [2 ]
Ferrando, Martina [3 ]
Frisch, Jerome [1 ]
Causone, Francesco [3 ]
van Treeck, Christoph [1 ]
机构
[1] Rhein Westfal TH Aachen, E3D Inst Energy Efficiency & Sustainable Bldg, Mathieustr 30, D-52074 Aachen, Germany
[2] Karlsruhe Inst Technol, Bldg Sci Grp, Englerstr 7, D-76131 Karlsruhe, Germany
[3] Politecn Milan, Dept Energy, Via Lambruschini 4, I-20156 Milan, Italy
关键词
Missing data; Data augmentation; Data scarcity; Building energy data; Deep learning; REPRESENTATIONS; NETWORK;
D O I
10.1016/j.apenergy.2023.120701
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
This study explores the applicability of data augmentation techniques for reconstructing missing energy time -series in limited data regimes. In particular, multiple synthetic copies of a relatively small training dataset are stacked together with pseudo-random noise. First, an existing convolutional denoising autoencoder is selected from a previous work, as the base imputation model of this study. Then, an optimal augmentation rate, which minimizes the training set of the model, is chosen based on the preliminary results obtained from one building. The results proved that, augmenting 80 times a nine days-long training set could reduce the initial average root mean squared error (RMSE) by 37% and 48%, for continuous and random missing scenarios. Additionally, the augmented model outperformed the benchmark methods with 23% and 12% lower average RMSE. No additional tuning or calibration costs were required for the existing base imputation model. Therefore, the presented data augmentation technique could significantly reduce the expensive computational costs associated with deep learning models.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Combining Convolution and Transformer for Missing Time Series Data Imputation
    Wang, Yi-Fan
    Bu, Shuai-Yu
    Yan, Jing-Hua
    Hou, Zhi-Wen
    Bu, Ling-Bin
    Meng, Fan-Xu
    Journal of Network Intelligence, 2023, 8 (03): : 823 - 838
  • [32] Comparison of Missing Data Imputation Methods in Time Series Forecasting
    Ahn, Hyun
    Sun, Kyunghee
    Kim, Kwanghoon Pio
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01): : 767 - 779
  • [33] LEAST-SQUARES ESTIMATION OF MISSING VALUES IN TIME-SERIES
    BEVERIDGE, S
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1992, 21 (12) : 3479 - 3496
  • [34] Training Energy-Based Models for Time-Series Imputation
    Brakel, Philemon
    Stroobandt, Dirk
    Schrauwen, Benjamin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 2771 - 2797
  • [35] Visually and Statistically Guided Imputation of Missing Values in Univariate Seasonal Time Series
    Boegl, M.
    Filzmoser, P.
    Gschwandtner, T.
    Miksch, S.
    Aigner, W.
    Rind, A.
    Lammarsch, T.
    2015 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY, 2015, : 189 - 190
  • [36] Imputation of missing values in environmental time series by D-vine copulas
    Chapon, Antoine
    Ouarda, Taha B. M. J.
    Hamdi, Yasser
    WEATHER AND CLIMATE EXTREMES, 2023, 41
  • [37] End-to-end Multi-task Learning of Missing Value Imputation and Forecasting in Time-Series Data
    Kim, Jinhee
    Kim, Taesung
    Choi, Jang-Ho
    Choo, Jaegul
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8849 - 8856
  • [38] Mind the Gap: An Experimental Evaluation of Imputation of Missing Values Techniques in Time Series
    Khayati, Mourad
    Lerner, Alberto
    Tymchenko, Zakhar
    Cudre-Mauroux, Philippe
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (05): : 768 - 782
  • [39] TS-Pothole: automated imputation of missing values in univariate time series
    Sanwouo, Brell
    Quinton, Clément
    Rouvoy, Romain
    Neural Computing and Applications, 2024, 36 (36) : 22923 - 22955
  • [40] Multiple imputation for multivariate data with missing and below-threshold measurements: Time-series concentrations of pollutants in the Arctic
    Hopke, PK
    Liu, CH
    Rubin, DB
    BIOMETRICS, 2001, 57 (01) : 22 - 33