Augmenting energy time-series for data-efficient imputation of missing values

被引:17
|
作者
Liguori, Antonio [1 ]
Markovic, Romana [2 ]
Ferrando, Martina [3 ]
Frisch, Jerome [1 ]
Causone, Francesco [3 ]
van Treeck, Christoph [1 ]
机构
[1] Rhein Westfal TH Aachen, E3D Inst Energy Efficiency & Sustainable Bldg, Mathieustr 30, D-52074 Aachen, Germany
[2] Karlsruhe Inst Technol, Bldg Sci Grp, Englerstr 7, D-76131 Karlsruhe, Germany
[3] Politecn Milan, Dept Energy, Via Lambruschini 4, I-20156 Milan, Italy
关键词
Missing data; Data augmentation; Data scarcity; Building energy data; Deep learning; REPRESENTATIONS; NETWORK;
D O I
10.1016/j.apenergy.2023.120701
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
This study explores the applicability of data augmentation techniques for reconstructing missing energy time -series in limited data regimes. In particular, multiple synthetic copies of a relatively small training dataset are stacked together with pseudo-random noise. First, an existing convolutional denoising autoencoder is selected from a previous work, as the base imputation model of this study. Then, an optimal augmentation rate, which minimizes the training set of the model, is chosen based on the preliminary results obtained from one building. The results proved that, augmenting 80 times a nine days-long training set could reduce the initial average root mean squared error (RMSE) by 37% and 48%, for continuous and random missing scenarios. Additionally, the augmented model outperformed the benchmark methods with 23% and 12% lower average RMSE. No additional tuning or calibration costs were required for the existing base imputation model. Therefore, the presented data augmentation technique could significantly reduce the expensive computational costs associated with deep learning models.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] ESTIMATION OF TIME-SERIES MODELS IN THE PRESENCE OF MISSING DATA
    DUNSMUIR, W
    ROBINSON, PM
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1981, 76 (375) : 560 - 568
  • [42] Nonlinear time-series prediction with missing and noisy data
    Tresp, V
    Hofmann, R
    NEURAL COMPUTATION, 1998, 10 (03) : 731 - 747
  • [43] A multitaper spectral estimator for time-series with missing data
    Chave, Alan D.
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2019, 218 (03) : 2165 - 2178
  • [44] Data Imputation for Multivariate Time Series Sensor Data With Large Gaps of Missing Data
    Wu, Rui
    Hamshaw, Scott D.
    Yang, Lei
    Kincaid, Dustin W.
    Etheridge, Randall
    Ghasemkhani, Amir
    IEEE SENSORS JOURNAL, 2022, 22 (11) : 10671 - 10683
  • [45] Imputation of continuous missing values in profile data
    Yang, Luo
    Wang, Kaibo
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2022, 38 (07) : 3644 - 3662
  • [46] Imputation strategies for missing data in environmental time series for an unlucky situation
    Mendola, D
    INNOVATIONS IN CLASSIFICATION, DATA SCIENCE, AND INFORMATION SYSTEMS, 2005, : 275 - 282
  • [47] MISSING OBSERVATIONS IN TIME-SERIES
    ABRAHAM, B
    COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1981, 10 (16): : 1643 - 1653
  • [48] ContrAttNet: Contribution and attention approach to multivariate time-series data imputation
    Yin, Yunfei
    Huang, Caihao
    Bao, Xianjian
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024,
  • [49] Time-Series Forecasting to Fill Missing Data in IoT Sensor Data
    Rosero-Montalvo, Paul D.
    Tozun, Pinar
    Hernandez, Wilmar
    IEEE SENSORS LETTERS, 2023, 7 (09)
  • [50] Application of a multi-stage neural network approach for time-series landfill gas modeling with missing data imputation
    Fallah, Bahareh
    Ng, Kelvin Tsun Wai
    Hoang Lan Vu
    Torabi, Farshid
    WASTE MANAGEMENT, 2020, 116 : 66 - 78