Evaluation of Missing Data Imputation Methods for an Enhanced Distributed PV Generation Prediction

被引:8
|
作者
Sundararajan, Aditya [1 ]
Sarwat, Arif I. [1 ]
机构
[1] Florida Int Univ, Miami, FL 33199 USA
基金
美国国家科学基金会;
关键词
Distributed PV; Missing data; Data processing; Imputation methods; PV Generation Prediction; INCOMPLETE DATA;
D O I
10.1007/978-3-030-32520-6_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To effectively predict generation of distributed photovoltaic (PV) systems, three parameters are critical: irradiance, ambient temperature, and module temperature. However, their completeness cannot be guaranteed because of issues in data acquisition. Many methods in literature address missingness, but their applicability varies with missingness mechanism. Exploration of methods to impute missing data in PV systems is lacking. This paper conducts statistical analyses to understand missingness mechanism in data of a real grid-tied 1.4MW PV system at Miami, and compares the imputation performance of different methods: random imputation, multiple imputation using expectation-maximization, kNN, and random forests, using error metrics and size effect measures. Imputed values are used in a multilayer perceptron to predict and compare PV generation with observed values. Results show that values imputed using kNN and random forests have the least differences in proportions and help utilities make more accurate prediction of generation for distribution planning.
引用
收藏
页码:590 / 609
页数:20
相关论文
共 50 条
  • [21] A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction
    Hu, Zhiyong
    Du, Dongping
    PLOS ONE, 2020, 15 (09):
  • [22] A comparison of imputation methods for the consecutive missing temperature data
    Kim, Hee-Kyung
    Kang, In-Kyeong
    Lee, Jae-Won
    Lee, Yung-Seop
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (03) : 549 - 557
  • [23] Some imputation methods for missing data in sample surveys
    Singh, G. N.
    Maurya, S.
    Khetan, M.
    Kadilar, Cem
    Hacettepe Journal of Mathematics and Statistics, 2016, 45 (06): : 1865 - 1880
  • [24] Ensemble imputation methods for missing software engineering data
    Twala, B
    Cartwright, M
    2005 11TH INTERNATIONAL SYMPOSIUM ON SOFTWARE METRICS (METRICS), 2005, : 268 - 277
  • [25] Application and Comparison of Imputation Methods for Missing Degradation Data
    Fan, Ye
    Sun, Fuqiang
    Jiang, Tongmin
    ENGINEERING ASSET MANAGEMENT - SYSTEMS, PROFESSIONAL PRACTICES AND CERTIFICATION, 2015, : 1607 - 1614
  • [26] New imputation methods for missing data using quantiles
    Munoz, J. F.
    Rueda, M.
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2009, 232 (02) : 305 - 317
  • [27] Comparison of imputation methods for missing laboratory data in medicine
    Waljee, Akbar K.
    Mukherjee, Ashin
    Singal, Amit G.
    Zhang, Yiwei
    Warren, Jeffrey
    Balis, Ulysses
    Marrero, Jorge
    Zhu, Ji
    Higgins, Peter D. R.
    BMJ OPEN, 2013, 3 (08):
  • [28] Imputation Methods for Multiple Regression with Missing Heteroscedastic Data
    Asif, Muhammad
    Samart, Klairung
    THAILAND STATISTICIAN, 2022, 20 (01): : 1 - 15
  • [29] Missing data imputation methods and their performance with biodistance analyses
    Kenyhercz, Michael W.
    Passalacqua, Nicholas V.
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2015, 156 : 185 - 185
  • [30] Evaluating Imputation Methods for Missing Data in a MCI Dataset
    Gomez-Valades Batanero, Alba
    Rincon Zamorano, Mariano
    Martinez Tomas, Rafael
    Guerrero Martin, Juan
    ARTIFICIAL INTELLIGENCE IN NEUROSCIENCE: AFFECTIVE ANALYSIS AND HEALTH APPLICATIONS, PT I, 2022, 13258 : 446 - 454