Predicting flood stages in watersheds with different scales using hourly rainfall dataset: A high-volume rainfall features empowered machine learning approach

被引:3
|
作者
Qiao, Lei [1 ]
Livsey, Daniel [2 ]
Wise, Jarrett [2 ]
Kadavy, Kem [2 ]
Hunt, Sherry [2 ]
Wagner, Kevin [1 ,3 ]
机构
[1] Oklahoma State Univ, Oklahoma Water Resources Ctr, Stillwater, OK 74078 USA
[2] US DA, Agroclimate & Hydraul Res Unit, Agr Res Unit, Stillwater, OK 74075 USA
[3] Oklahoma State Univ, Dept Plant & Soil Sci, Stillwater, OK 74078 USA
关键词
Lake level prediction; Surface runoff; Streamflow; Machine learning; Feature engineering; Hydrological modeling; UNCERTAINTY; MODEL;
D O I
10.1016/j.scitotenv.2024.175231
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate prediction of instantaneous high lake water levels and flood flows (flood stages) from micro-catchments to big river basins are critical for flood forecasting. Lake Carl Blackwell, a small-watershed reservoir in the southcentral USA, served as a primary case study due to its rich historical dataset. Bearing knowledge that both current and previous rainfall contributes to the reservoirs' water body, a series of hourly rainfall features were created to maximize predicting power, which include total rainfall amounts in the current hour, the past 2 h, 3 h, ..., 600 h in addition to previous-day lake levels. Notedly, the rainfall features are the accumulated rainfall amounts from present to previous hours rather than the rainfall amount in any specific hour. Random Forest Regression (RFR) was used to score the features' importance and predict the flood stages along with Neural Network - Multi-layer Perceptron Regression (NN-MLP), Support Vector Regression (SVR), Extreme Gradient Boosting (XGBoost), and the ordinary multi-variant linear regression (MLR) together with dimension reduced linear models of Principal Component Regression (PCR) and Partial Least Square Regression (PLSR). The prediction accuracy for the lake flood stages can be as high as 0.95 in R2, 2 , 0.11 ft. in mean absolute error (MAE), and 0.21 ft. in root mean square error (RMSE) for the testing dataset by the RFR (NN-MLP performed equally well), with small accuracy decreases by the other two non-linear algorithms of XGBoost and SVR. The linear regressions with dimension reductions had the lowest accuracy. Furthermore, our approach demonstrated high accuracy and broad applicability for surface runoff and streamflow predictions across three different-sized watersheds from micro-catchment to big river basins in the region, with increases of predicting power from earlier rainfall for larger watersheds and vice versa.
引用
收藏
页数:11
相关论文
共 3 条
  • [1] Research on flood forecasting method in mountainous small watersheds based on machine learning for identifying rainfall dynamic spatiotemporal features
    Liu, Yuanyuan
    Liu, Yesen
    Liu, Yang
    Liu, Zhengfeng
    Yang, Weitao
    Hu, Wencai
    Shuili Xuebao/Journal of Hydraulic Engineering, 2024, 55 (09): : 1009 - 1019
  • [2] Integrating WRF forecasts at different scales for pluvial flood forecasting using a rainfall threshold approach and a real-time flood model
    Young, Adele
    Bhattacharya, Biswa
    Daniels, Emma
    Zevenbergen, Chris
    JOURNAL OF HYDROLOGY, 2025, 656
  • [3] Different Time-Increment Rainfall Prediction Models: a Machine Learning Approach Using Various Input Scenarios
    Rahimi, Anas
    Yashooa, Noor Kh.
    Ahmed, Ali Najah
    Sherif, Mohsen
    El-shafie, Ahmed
    WATER RESOURCES MANAGEMENT, 2024, : 1677 - 1696