Predicting flood stages in watersheds with different scales using hourly rainfall dataset: A high-volume rainfall features empowered machine learning approach

被引：3

作者：

Qiao, Lei ^{[1
]}

Livsey, Daniel ^{[2
]}

Wise, Jarrett ^{[2
]}

Kadavy, Kem ^{[2
]}

Hunt, Sherry ^{[2
]}

Wagner, Kevin ^{[1
,3
]}

机构：

[1] Oklahoma State Univ, Oklahoma Water Resources Ctr, Stillwater, OK 74078 USA

[2] US DA, Agroclimate & Hydraul Res Unit, Agr Res Unit, Stillwater, OK 74075 USA

[3] Oklahoma State Univ, Dept Plant & Soil Sci, Stillwater, OK 74078 USA

来源：

SCIENCE OF THE TOTAL ENVIRONMENT | 2024年 / 950卷

关键词：

Lake level prediction; Surface runoff; Streamflow; Machine learning; Feature engineering; Hydrological modeling; UNCERTAINTY; MODEL;

D O I：

10.1016/j.scitotenv.2024.175231

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Accurate prediction of instantaneous high lake water levels and flood flows (flood stages) from micro-catchments to big river basins are critical for flood forecasting. Lake Carl Blackwell, a small-watershed reservoir in the southcentral USA, served as a primary case study due to its rich historical dataset. Bearing knowledge that both current and previous rainfall contributes to the reservoirs' water body, a series of hourly rainfall features were created to maximize predicting power, which include total rainfall amounts in the current hour, the past 2 h, 3 h, ..., 600 h in addition to previous-day lake levels. Notedly, the rainfall features are the accumulated rainfall amounts from present to previous hours rather than the rainfall amount in any specific hour. Random Forest Regression (RFR) was used to score the features' importance and predict the flood stages along with Neural Network - Multi-layer Perceptron Regression (NN-MLP), Support Vector Regression (SVR), Extreme Gradient Boosting (XGBoost), and the ordinary multi-variant linear regression (MLR) together with dimension reduced linear models of Principal Component Regression (PCR) and Partial Least Square Regression (PLSR). The prediction accuracy for the lake flood stages can be as high as 0.95 in R2, 2 , 0.11 ft. in mean absolute error (MAE), and 0.21 ft. in root mean square error (RMSE) for the testing dataset by the RFR (NN-MLP performed equally well), with small accuracy decreases by the other two non-linear algorithms of XGBoost and SVR. The linear regressions with dimension reductions had the lowest accuracy. Furthermore, our approach demonstrated high accuracy and broad applicability for surface runoff and streamflow predictions across three different-sized watersheds from micro-catchment to big river basins in the region, with increases of predicting power from earlier rainfall for larger watersheds and vice versa.

引用

页数：11

共 3 条

[1] Research on flood forecasting method in mountainous small watersheds based on machine learning for identifying rainfall dynamic spatiotemporal features
Liu, Yuanyuan
Liu, Yesen
Liu, Yang
Liu, Zhengfeng
Yang, Weitao
Hu, Wencai
Shuili Xuebao/Journal of Hydraulic Engineering, 2024, 55 (09): : 1009 - 1019
[2] Integrating WRF forecasts at different scales for pluvial flood forecasting using a rainfall threshold approach and a real-time flood model
Young, Adele
Bhattacharya, Biswa
Daniels, Emma
Zevenbergen, Chris
JOURNAL OF HYDROLOGY, 2025, 656
[3] Different Time-Increment Rainfall Prediction Models: a Machine Learning Approach Using Various Input Scenarios
Rahimi, Anas
Yashooa, Noor Kh.
Ahmed, Ali Najah
Sherif, Mohsen
El-shafie, Ahmed
WATER RESOURCES MANAGEMENT, 2024, : 1677 - 1696

← 1 →