Site-Specific Deterministic Temperature and Dew Point Forecasts with Explainable and Reliable Machine Learning

被引:1
|
作者
Han, Mengmeng [1 ]
Leeuwenburg, Tennessee [2 ]
Murphy, Brad [2 ]
机构
[1] Bur Meteorol, 32 Turbot St, Brisbane, Qld 4000, Australia
[2] Bur Meteorol, 700 Collins St, Docklands, Vic 3008, Australia
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 14期
关键词
weather forecast; gradient boosting decision tree; machine learning; XGBoost; NWP post-processing; SHAP;
D O I
10.3390/app14146314
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Site-specific weather forecasts are essential for accurate prediction of power demand and are consequently of great interest to energy operators. However, weather forecasts from current numerical weather prediction (NWP) models lack the fine-scale detail to capture all important characteristics of localised real-world sites. Instead, they provide weather information representing a rectangular gridbox (usually kilometres in size). Even after post-processing and bias correction, area-averaged information is usually not optimal for specific sites. Prior work on site-optimised forecasts has focused on linear methods, weighted consensus averaging, and time-series methods, among others. Recent developments in machine learning (ML) have prompted increasing interest in applying ML as a novel approach towards this problem. In this study, we investigate the feasibility of optimising forecasts at sites by adopting the popular machine learning model "gradient boosted decision tree", supported by the XGBoost package (v.1.7.3) in the Python language. Regression trees have been trained with historical NWP and site observations as training data, aimed at predicting temperature and dew point at multiple site locations across Australia. We developed a working ML framework, named "Multi-SiteBoost", and initial test results show a significant improvement compared with gridded values from bias-corrected NWP models. The improvement from XGBoost (0.1-0.6 degrees C, 4-27% improvement in temperature) is found to be comparable with non-ML methods reported in the literature. With the insights provided by SHapley Additive exPlanations (SHAP), this study also tests various approaches to understand the ML predictions and increase the reliability of the forecasts generated by ML.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Assessment of machine learning techniques for deterministic and probabilistic intra-hour solar forecasts
    Pedro, Hugo T. C.
    Coimbra, Carlos F. M.
    David, Mathieu
    Lauret, Philippe
    RENEWABLE ENERGY, 2018, 123 : 191 - 203
  • [32] Application of spatio-temporal data in site-specific maize yield prediction with machine learning methods
    A. Nyéki
    C. Kerepesi
    B. Daróczy
    A. Benczúr
    G. Milics
    J. Nagy
    E. Harsányi
    A. J. Kovács
    M. Neményi
    Precision Agriculture, 2021, 22 : 1397 - 1415
  • [33] Machine-Learning Methods to Identify Key Predictors of Site-Specific Vineyard Yield and Vine Size
    Taylor, James A.
    Bates, Terence R.
    Jakubowski, Rhiann
    Jones, Hazael
    AMERICAN JOURNAL OF ENOLOGY AND VITICULTURE, 2023, 74 (01):
  • [34] Application of spatio-temporal data in site-specific maize yield prediction with machine learning methods
    Nyeki, A.
    Kerepesi, C.
    Daroczy, B.
    Benczur, A.
    Milics, G.
    Nagy, J.
    Harsanyi, E.
    Kovacs, A. J.
    Nemenyi, M.
    PRECISION AGRICULTURE, 2021, 22 (05) : 1397 - 1415
  • [35] Estimating Site-Specific Wind Speeds Using Gridded Data: A Comparison of Multiple Machine Learning Models
    Zhou, Jintao
    Feng, Jin
    Zhou, Xin
    Li, Yang
    Zhu, Fuxin
    ATMOSPHERE, 2023, 14 (01)
  • [36] Reliable and explainable machine learning for charge transfer/atomic structure relationships of hydrogenated nanodiamonds
    Wang, Peng
    Ren, Jingli
    DIAMOND AND RELATED MATERIALS, 2024, 144
  • [37] Reliable Autism Spectrum Disorder Diagnosis for Pediatrics Using Machine Learning and Explainable AI
    Jeon, Insu
    Kim, Minjoong
    So, Dayeong
    Kim, Eun Young
    Nam, Yunyoung
    Kim, Seungsoo
    Shim, Sehoon
    Kim, Joungmin
    Moon, Jihoon
    DIAGNOSTICS, 2024, 14 (22)
  • [38] Explainable machine learning practices: opening another black box for reliable medical AI
    Emanuele Ratti
    Mark Graves
    AI and Ethics, 2022, 2 (4): : 801 - 814
  • [39] A deterministic method for deriving site-specific human health assessment criteria for contaminants in soil
    Nathanail, P
    McCaffrey, C
    Earl, N
    Foster, ND
    Gillett, AG
    Ogden, R
    HUMAN AND ECOLOGICAL RISK ASSESSMENT, 2005, 11 (02): : 389 - 410
  • [40] Constant Ductility Site-Specific Yield Point Spectra for Seismic Design
    Ji, Duofa
    Wen, Weiping
    Zhai, Changhai
    ADVANCES IN CIVIL ENGINEERING, 2019, 2019