Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms

被引:0
|
作者
Hristos Tyralis
Georgia Papacharalampous
Andreas Langousis
机构
[1] National Technical University of Athens,Department of Water Resources and Environmental Engineering, School of Civil Engineering
[2] Elefsina Air Base,Air Force Support Command, Hellenic Air Force
[3] University of Patras,Department of Civil Engineering, School of Engineering
来源
Neural Computing and Applications | 2021年 / 33卷
关键词
Combining forecasts; Ensemble learning; Hydrology; Stacking;
D O I
暂无
中图分类号
学科分类号
摘要
Daily streamflow forecasting through data-driven approaches is traditionally performed using a single machine learning algorithm. Existing applications are mostly restricted to examination of few case studies, not allowing accurate assessment of the predictive performance of the algorithms involved. Here, we propose super learning (a type of ensemble learning) by combining 10 machine learning algorithms. We apply the proposed algorithm in one-step-ahead forecasting mode. For the application, we exploit a big dataset consisting of 10-year long time series of daily streamflow, precipitation and temperature from 511 basins. The super ensemble learner improves over the performance of the linear regression algorithm by 20.06%, outperforming the “hard to beat in practice” equal weight combiner. The latter improves over the performance of the linear regression algorithm by 19.21%. The best performing individual machine learning algorithm is neural networks, which improves over the performance of the linear regression algorithm by 16.73%, followed by extremely randomized trees (16.40%), XGBoost (15.92%), loess (15.36%), random forests (12.75%), polyMARS (12.36%), MARS (4.74%), lasso (0.11%) and support vector regression (− 0.45%). Furthermore, the super ensemble learner outperforms exponential smoothing and autoregressive integrated moving average (ARIMA). These latter two models improve over the performance of the linear regression algorithm by 13.89% and 8.77%, respectively. Based on the obtained large-scale results, we propose super ensemble learning for daily streamflow forecasting.
引用
收藏
页码:3053 / 3068
页数:15
相关论文
共 50 条
  • [31] Comparison of Machine Learning Algorithms for Daily Runoff Forecasting with Global Rainfall Products in Algeria
    Bounab, Rayane
    Boutaghane, Hamouda
    Boulmaiz, Tayeb
    Tramblay, Yves
    ATMOSPHERE, 2025, 16 (02)
  • [32] A Survey on Large-Scale Machine Learning
    Wang, Meng
    Fu, Weijie
    He, Xiangnan
    Hao, Shijie
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
  • [33] Randomized algorithms for large-scale dictionary learning
    Wu, Gang
    Yang, Jiali
    NEURAL NETWORKS, 2024, 179
  • [34] Comparative Performance of Machine Learning Ensemble Algorithms for Forecasting Cryptocurrency Prices
    Derbentsev, V
    Babenko, V
    Khrustalev, K.
    Obruch, H.
    Khrustalova, S.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2021, 34 (01): : 140 - 148
  • [35] Forecasting the risk at infractions: an ensemble comparison of machine learning approach
    Li, Lei
    Wu, Desheng
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2022, 122 (01) : 1 - 19
  • [36] Forecasting performance comparison of two hybrid machine learning models for cooling load of a large-scale commercial building
    Zhou Xuan
    Zi Xuehui
    Liang Liequan
    Fan Zubing
    Yan Junwei
    Pan Dongmei
    JOURNAL OF BUILDING ENGINEERING, 2019, 21 : 64 - 73
  • [37] Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment
    Zhou, Todd
    Jiao, Hong
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2023, 83 (04) : 831 - 854
  • [38] Ensemble learning for large-scale crowd flow prediction
    Karbovskii, Vladislav
    Lees, Michael
    Presbitero, Alva
    Kurilkin, Alexey
    Voloshin, Daniil
    Derevitskii, Ivan
    Karsakov, Andrey
    Sloot, Peter M. A.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [39] A Comparison of Platforms for Implementing and Running Very Large Scale Machine Learning Algorithms
    Cai, Zhuhua
    Gao, Zekai J.
    Luo, Shangyu
    Perez, Luis L.
    Vagena, Zografoula
    Jermaine, Christopher
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1371 - 1382
  • [40] Efficient Machine Learning On Large-Scale Graphs
    Erickson, Parker
    Lee, Victor E.
    Shi, Feng
    Tang, Jiliang
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4788 - 4789