Flash-Flood Forecasting in an Andean Mountain Catchment - Development of a Step-Wise Methodology Based on the Random Forest Algorithm

被引:71
|
作者
Munoz, Paul [1 ,2 ]
Orellana-Alvear, Johanna [1 ,3 ]
Willems, Patrick [2 ]
Celleri, Rolando [1 ,4 ]
机构
[1] Univ Cuenca, Dept Recursos Hidr & Ciencias Ambientales, Cuenca 010150, Ecuador
[2] Katholieke Univ Leuven, Hydraul Sect, Dept Civil Engn, B-3001 Leuven, Belgium
[3] Univ Marburg, Fac Geog, Lab Climatol & Remote Sensing, D-35032 Marburg, Germany
[4] Univ Cuenca, Fac Ingn, Cuenca 010150, Ecuador
关键词
flash-flood; precipitation-runoff; forecasting; lag analysis; random forest; machine learning; RAINFALL VARIABILITY; RISK-ASSESSMENT; TIME; RUNOFF; SUSCEPTIBILITY; VULNERABILITY; PERFORMANCE; PREDICTION; ACCURACY; AREAS;
D O I
10.3390/w10111519
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Flash-flood forecasting has emerged worldwide due to the catastrophic socio-economic impacts this hazard might cause and the expected increase of its frequency in the future. In mountain catchments, precipitation-runoff forecasts are limited by the intrinsic complexity of the processes involved, particularly its high rainfall variability. While process-based models are hard to implement, there is a potential to use the random forest algorithm due to its simplicity, robustness and capacity to deal with complex data structures. Here a step-wise methodology is proposed to derive parsimonious models accounting for both hydrological functioning of the catchment (e.g., input data, representation of antecedent moisture conditions) and random forest procedures (e.g., sensitivity analyses, dimension reduction, optimal input composition). The methodology was applied to develop short-term prediction models of varying time duration (4, 8, 12, 18 and 24 h) for a catchment representative of the Ecuadorian Andes. Results show that the derived parsimonious models can reach validation efficiencies (Nash-Sutcliffe coefficient) from 0.761 (4-h) to 0.384 (24-h) for optimal inputs composed only by features accounting for 80% of the model's outcome variance. Improvement in the prediction of extreme peak flows was demonstrated (extreme value analysis) by including precipitation information in contrast to the use of pure autoregressive models.
引用
收藏
页数:18
相关论文
共 3 条
  • [1] Influence of Random Forest Hyperparameterization on Short-Term Runoff Forecasting in an Andean Mountain Catchment
    Contreras, Pablo
    Orellana-Alvear, Johanna
    Munoz, Paul
    Bendix, Jorg
    Celleri, Rolando
    ATMOSPHERE, 2021, 12 (02)
  • [2] Flash-flood susceptibility mapping based on XGBoost, random forest and boosted regression trees
    Abedi, Rahebeh
    Costache, Romulus
    Shafizadeh-Moghadam, Hossein
    Pham, Quoc Bao
    GEOCARTO INTERNATIONAL, 2022, 37 (19) : 5479 - 5496
  • [3] Random forest machine learning algorithm based seasonal multi-step ahead short-term solar photovoltaic power output forecasting
    Jogunuri, Sravankumar
    Josh, F. T.
    Stonier, Albert Alexander
    Peter, Geno
    Jayaraj, Jayakumar
    Jaganathan, S.
    Joseph, Jency J.
    Ganji, Vivekananda
    IET RENEWABLE POWER GENERATION, 2024,