Enhancing the streamflow simulation of a process-based hydrological model using machine learning and multi-source data

被引:3
|
作者
Lei, Huajin [1 ,2 ,3 ]
Li, Hongyi [4 ]
Hu, Wanpin [5 ]
机构
[1] Xihua Univ, Sch Energy & Power Engn, Chengdu 610039, Peoples R China
[2] Xihua Univ, Key Lab Fluid Machinery & Engn, Chengdu 610039, Sichuan, Peoples R China
[3] Sichuan Univ, Coll Water Resource & Hydropower, Chengdu 610065, Peoples R China
[4] Chinese Acad Sci, Northwest Inst Ecoenvironm & Resources, Lanzhou 730070, Peoples R China
[5] Sichuan Inst Land Sci & Technol, Dept Nat Resources Sichuan Prov, Chengdu 610065, Peoples R China
关键词
Streamflow simulation; Process-based hydrological model; Machine learning; Hybrid modelling; Jialing River basin; ARTIFICIAL NEURAL-NETWORKS; VARIABLE SELECTION; BTOP MODEL; REGRESSION; UNCERTAINTY; EVAPORATION; PREDICTION; TOPMODEL;
D O I
10.1016/j.ecoinf.2024.102755
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Streamflow simulation is crucial for flood mitigation, ecological protection, and water resource planning. Process-based hydrological models and machine learning algorithms are the mainstream tools for streamflow simulation. However, their inherent limitations, such as time-consuming and large data requirements, make achieving high-precision simulations challenging. This study developed a hybrid approach to simultaneously improve the accuracy and computational efficiency of streamflow simulation, which integrates Block-wise use of the TOPMODEL (BTOP) model into the eXtreme Gradient Boosting (XGBoost), i.e., BTOP_XGB. In this approach, BTOP generates simulated streamflow using the Latin hypercube sampling algorithm instead of the time-consuming calibration algorithms to reduce computational costs. Then, XGBoost combines BTOP simulated streamflow with multi-source data to reduce simulation errors. In which, serval input variable selection algorithms are employed to choose relevant inputs and remove redundant information for model. The hybrid approach is validated and compared with a standalone model at three hydrological stations in the Jialing River basin, China. The results show that the performance of BTOP_XGB is significantly better than the BTOP and XGBoost models. The NSE of BTOP_XGB at Beibei, Xiaoheba, and Luoduxi stations increases by 54%, 21%, and 83%, respectively. Meanwhile, the computational time of BTOP_XGB is saved by >90% compared to the original calibrated BTOP. BTOP_XGB is less affected by parameter sample sizes and data amounts, demonstrating the robustness of the hybrid model. This study simplifies the complexity of the hydrological model and enhances the stability of machine learning, jointly improving the reliability of streamflow simulation. The hybrid approach provides a potential shortcut for streamflow simulation over basins with large areas or limited observed data.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Combining process-based model and machine learning to predict hydrological regimes in floodplain wetlands under climate change
    Yao, Siyang
    Chen, Cheng
    Chen, Qiuwen
    Zhang, Jianyun
    He, Mengnan
    JOURNAL OF HYDROLOGY, 2023, 626
  • [32] Enhancing hydrological model performance through multi-source open-data utilization in the highly managed, data-scarce basin
    Zhang, Jiayan
    Liu, Zhihong
    Li, Yu
    Dou, Yanhong
    Wang, Mingjun
    Zhou, Huicheng
    Xu, Bo
    JOURNAL OF HYDROLOGY, 2025, 654
  • [33] Improving water status prediction of winter wheat using multi-source data with machine learning
    Shi, Bo
    Yuan, Yifan
    Zhuang, Tingxuan
    Xu, Xuan
    Schmidhalter, Urs
    Ata-UI-Karim, Syed Tahir
    Zhao, Ben
    Liu, Xiaojun
    Tian, Yongchao
    Zhu, Yan
    Cao, Weixing
    Cao, Qiang
    EUROPEAN JOURNAL OF AGRONOMY, 2022, 139
  • [34] Prediction of High-Resolution Soil Moisture Using Multi-source Data and Machine Learning
    Sudhakara, B.
    Bhattacharjee, Shrutilipi
    DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2024, 2024, 14501 : 282 - 292
  • [35] Emulating process-based water quality modelling in water source reservoirs using machine learning
    Mohammed, Hadi
    Tornyeviadzi, Hoese Michel
    Seidu, Razak
    JOURNAL OF HYDROLOGY, 2022, 609
  • [36] Simulation Credibility Evaluation Based on Multi-source Data Fusion
    Zhou, Yuchen
    Fang, Ke
    Ma, Ping
    Yang, Ming
    METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2018, 946 : 18 - 31
  • [37] Regional Forest Carbon Stock Estimation Based on Multi-Source Data and Machine Learning Algorithms
    Zheng, Mingwei
    Wen, Qingqing
    Xu, Fengya
    Wu, Dasheng
    FORESTS, 2025, 16 (03):
  • [38] Enhancing the accuracy of monitoring effective tiller counts of wheat using multi-source data and machine learning derived from consumer drones
    Feng, Ziheng
    Cai, Jiaxiang
    Wu, Ke
    Li, Yahui
    Yuan, Xinru
    Duan, Jianzhao
    He, Li
    Feng, Wei
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 232
  • [39] Estimation of Nitrogen Content in Winter Wheat Based on Multi-Source Data Fusion and Machine Learning
    Ding, Fan
    Li, Changchun
    Zhai, Weiguang
    Fei, Shuaipeng
    Cheng, Qian
    Chen, Zhen
    AGRICULTURE-BASEL, 2022, 12 (11):
  • [40] Process based calibration of a continental-scale hydrological model using soil moisture and streamflow data
    Bajracharya, Ajay Ratna
    Ahmed, Mohamed Ismaiel
    Stadnyk, Tricia
    Asadzadeh, Masoud
    JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2023, 47