Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms

被引：0

作者：

Hristos Tyralis

Georgia Papacharalampous

Andreas Langousis

机构：

[1] National Technical University of Athens,Department of Water Resources and Environmental Engineering, School of Civil Engineering

[2] Elefsina Air Base,Air Force Support Command, Hellenic Air Force

[3] University of Patras,Department of Civil Engineering, School of Engineering

来源：

Neural Computing and Applications | 2021年 / 33卷

关键词：

Combining forecasts; Ensemble learning; Hydrology; Stacking;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Daily streamflow forecasting through data-driven approaches is traditionally performed using a single machine learning algorithm. Existing applications are mostly restricted to examination of few case studies, not allowing accurate assessment of the predictive performance of the algorithms involved. Here, we propose super learning (a type of ensemble learning) by combining 10 machine learning algorithms. We apply the proposed algorithm in one-step-ahead forecasting mode. For the application, we exploit a big dataset consisting of 10-year long time series of daily streamflow, precipitation and temperature from 511 basins. The super ensemble learner improves over the performance of the linear regression algorithm by 20.06%, outperforming the “hard to beat in practice” equal weight combiner. The latter improves over the performance of the linear regression algorithm by 19.21%. The best performing individual machine learning algorithm is neural networks, which improves over the performance of the linear regression algorithm by 16.73%, followed by extremely randomized trees (16.40%), XGBoost (15.92%), loess (15.36%), random forests (12.75%), polyMARS (12.36%), MARS (4.74%), lasso (0.11%) and support vector regression (− 0.45%). Furthermore, the super ensemble learner outperforms exponential smoothing and autoregressive integrated moving average (ARIMA). These latter two models improve over the performance of the linear regression algorithm by 13.89% and 8.77%, respectively. Based on the obtained large-scale results, we propose super ensemble learning for daily streamflow forecasting.

引用

页码：3053 / 3068

页数：15

共 50 条

[31] Comparison of Machine Learning Algorithms for Daily Runoff Forecasting with Global Rainfall Products in Algeria
Bounab, Rayane
Boutaghane, Hamouda
Boulmaiz, Tayeb
Tramblay, Yves
ATMOSPHERE, 2025, 16 (02)
[32] A Survey on Large-Scale Machine Learning
Wang, Meng
Fu, Weijie
He, Xiangnan
Hao, Shijie
Wu, Xindong
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
[33] Randomized algorithms for large-scale dictionary learning
Wu, Gang
Yang, Jiali
NEURAL NETWORKS, 2024, 179
[34] Comparative Performance of Machine Learning Ensemble Algorithms for Forecasting Cryptocurrency Prices
Derbentsev, V
Babenko, V
Khrustalev, K.
Obruch, H.
Khrustalova, S.
INTERNATIONAL JOURNAL OF ENGINEERING, 2021, 34 (01): : 140 - 148
[35] Forecasting the risk at infractions: an ensemble comparison of machine learning approach
Li, Lei
Wu, Desheng
INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2022, 122 (01) : 1 - 19
[36] Forecasting performance comparison of two hybrid machine learning models for cooling load of a large-scale commercial building
Zhou Xuan
Zi Xuehui
Liang Liequan
Fan Zubing
Yan Junwei
Pan Dongmei
JOURNAL OF BUILDING ENGINEERING, 2019, 21 : 64 - 73
[37] Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment
Zhou, Todd
Jiao, Hong
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2023, 83 (04) : 831 - 854
[38] Ensemble learning for large-scale crowd flow prediction
Karbovskii, Vladislav
Lees, Michael
Presbitero, Alva
Kurilkin, Alexey
Voloshin, Daniil
Derevitskii, Ivan
Karsakov, Andrey
Sloot, Peter M. A.
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
[39] A Comparison of Platforms for Implementing and Running Very Large Scale Machine Learning Algorithms
Cai, Zhuhua
Gao, Zekai J.
Luo, Shangyu
Perez, Luis L.
Vagena, Zografoula
Jermaine, Christopher
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1371 - 1382
[40] Efficient Machine Learning On Large-Scale Graphs
Erickson, Parker
Lee, Victor E.
Shi, Feng
Tang, Jiliang
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4788 - 4789

← 1 2 3 4 5 →