Augmenting energy time-series for data-efficient imputation of missing values

被引:17
|
作者
Liguori, Antonio [1 ]
Markovic, Romana [2 ]
Ferrando, Martina [3 ]
Frisch, Jerome [1 ]
Causone, Francesco [3 ]
van Treeck, Christoph [1 ]
机构
[1] Rhein Westfal TH Aachen, E3D Inst Energy Efficiency & Sustainable Bldg, Mathieustr 30, D-52074 Aachen, Germany
[2] Karlsruhe Inst Technol, Bldg Sci Grp, Englerstr 7, D-76131 Karlsruhe, Germany
[3] Politecn Milan, Dept Energy, Via Lambruschini 4, I-20156 Milan, Italy
关键词
Missing data; Data augmentation; Data scarcity; Building energy data; Deep learning; REPRESENTATIONS; NETWORK;
D O I
10.1016/j.apenergy.2023.120701
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
This study explores the applicability of data augmentation techniques for reconstructing missing energy time -series in limited data regimes. In particular, multiple synthetic copies of a relatively small training dataset are stacked together with pseudo-random noise. First, an existing convolutional denoising autoencoder is selected from a previous work, as the base imputation model of this study. Then, an optimal augmentation rate, which minimizes the training set of the model, is chosen based on the preliminary results obtained from one building. The results proved that, augmenting 80 times a nine days-long training set could reduce the initial average root mean squared error (RMSE) by 37% and 48%, for continuous and random missing scenarios. Additionally, the augmented model outperformed the benchmark methods with 23% and 12% lower average RMSE. No additional tuning or calibration costs were required for the existing base imputation model. Therefore, the presented data augmentation technique could significantly reduce the expensive computational costs associated with deep learning models.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Imputation of missing data in time series for air pollutants
    Junger, W. L.
    de Leon, A. Ponce
    ATMOSPHERIC ENVIRONMENT, 2015, 102 : 96 - 104
  • [22] Missing Data Imputation in Time Series by Evolutionary Algorithms
    Figueroa Garcia, Juan C.
    Kalenatic, Dusko
    Lopez Bello, Cesar Amilcar
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2008, 5227 : 275 - +
  • [23] A NOTE ON LIKELIHOOD ESTIMATION OF MISSING VALUES IN TIME-SERIES
    PENA, D
    TIAO, GC
    AMERICAN STATISTICIAN, 1991, 45 (03): : 212 - 213
  • [24] MISSING VALUES IN TIME-SERIES AND IMPLICATIONS ON AUTOCORRELATION ANALYSIS
    LIMONARD, CBG
    ANALYTICA CHIMICA ACTA-COMPUTER TECHNIQUES AND OPTIMIZATION, 1978, 2 (02): : 133 - 140
  • [25] On the Imputation of Missing Values in Univariate PM10 Time Series
    Albano, G.
    La Rocca, M.
    Perna, C.
    COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2017, PT II, 2018, 10672 : 12 - 19
  • [26] Trend Tests in Time Series with Missing Values: a Case Study with Imputation
    Rosario Ramos, M.
    Cordeiro, Clara
    11TH INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2013, PTS 1 AND 2 (ICNAAM 2013), 2013, 1558 : 1909 - 1912
  • [27] Time-series analysis if data are randomly missing
    Broersen, PMT
    Bos, R
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2006, 55 (01) : 79 - 84
  • [28] What to Do about Missing Values in Time-Series Cross-Section Data
    Honaker, James
    King, Gary
    AMERICAN JOURNAL OF POLITICAL SCIENCE, 2010, 54 (02) : 561 - 581
  • [29] Missing Value Imputation of Time-Series Air-Quality Data via Deep Neural Networks
    Kim, Taesung
    Kim, Jinhee
    Yang, Wonho
    Lee, Hunjoo
    Choo, Jaegul
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (22)
  • [30] Time Series Data and Recent Imputation Techniques for Missing Data: A Review
    Zainuddin, Aznilinda
    Hairuddin, Muhammad Asraf
    Yassin, Ahmad Ihsan Mohd
    Abd Latiff, Zatul Iffah
    Azhar, Aziemah
    2022 INTERNATIONAL CONFERENCE ON GREEN ENERGY, COMPUTING AND SUSTAINABLE TECHNOLOGY (GECOST), 2022, : 346 - 350