A comparative study on urban waterlogging susceptibility assessment based on multiple data-driven models

被引:4
|
作者
Han, Feifei [1 ]
Yu, Jingshan [1 ,2 ]
Zhou, Guihuan [1 ]
Li, Shuang [1 ]
Sun, Tong [1 ]
机构
[1] Beijing Normal Univ, Coll Water Sci, Beijing Key Lab Urban Hydrol Cycle & Sponge City T, Beijing 100875, Peoples R China
[2] Shandong Acad Environm Planning, State Environm Protect Key Lab Land & Sea Ecol Gov, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Urban waterlogging susceptibility; Machine learning; MaxEnt; DISO; Uncertainty analysis; SUPPORT VECTOR MACHINE; SIMULATION; ALGORITHM; CLASSIFICATION; OPTIMIZATION; FOREST; AREA;
D O I
10.1016/j.jenvman.2024.121166
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate identification of urban waterlogging areas and assessing waterlogging susceptibility are crucial for preventing and controlling hazards. Data-driven models are utilized to forecast waterlogging areas by establishing intricate relationships between explanatory variables and waterlogging states. This approach tackles the constraints of mechanistic models, which are frequently complex and unable to incorporate socio-economic factors. Previous research predominantly employed single-type data-driven models to predict waterlogging locations and evaluation of their effectiveness. There is a scarcity of comprehensive performance comparisons and uncertainty analyses of different types of models, as well as a lack of interpretability analysis. The chosen study area was the central area of Beijing, which is prone to waterlogging. Given the high manpower, time, and economic costs associated with collecting waterlogging information, the waterlogging point distribution map released by the Beijing Water Affairs Bureau was selected as labeled samples. Twelve factors affecting waterlogging susceptibility were chosen as explanatory variables to construct Random Forest (RF), Support Vector Machine with Radial Basis Function (SVM-RBF), Particle Swarm Optimization-Weakly Labeled Support Vector Machine (PSO-WELLSVM), and Maximum Entropy (MaxEnt). The utilization of diverse single evaluation indicators (such as F-score, Kappa, AUC, etc.) to assess the model performance may yield conflicting results. The Distance between Indices of Simulation and Observation (DISO) was chosen as a comprehensive measure to assess the model's performance in predicting waterlogging points. PSO-WELLSVM exhibited the highest performance with a DISOtest value of 0.63, outperforming MaxEnt (0.78), which excelled in identifying areas highly susceptible to waterlogging, including extremely high susceptibility zones. The SVM-RBF and RF models demonstrated suboptimal performance and exhibited overfitting. The examination of waterlogging susceptibility distribution maps predicted by the four models revealed significant spatial differences due to variations in computational principles and input parameter complexities. The integration of four WSAMs based on logistic regression has been shown to significantly decrease the uncertainty of a single data-driven model and identify the most flood-prone areas. To improve the interpretability of the data model, a geographical detector was incorporated to demonstrate the explanatory capacity of 12 variables and the process of waterlogging. Building Density (BD) exhibits the highest explanatory power in relation to explain waterlogging susceptibility (Q value = 0.202), followed by Distance to Road, Frequency of Heavy Rainstorms (FHR), DEM, etc. The interaction between BD and FHR results in a nonlinear increase in the explanatory power of waterlogging susceptibility. The presence of waterlogging susceptibility risk in the research area can be attributed to the interactions of multiple factors.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A comparative study of an expert knowledge-based model and two data-driven models for landslide susceptibility mapping
    Zhu, A-Xing
    Miao, Yamin
    Wang, Rongxun
    Zhu, Tongxin
    Deng, Yongcui
    Liu, Junzhi
    Yang, Lin
    Qin, Cheng-Zhi
    Hong, Haoyuan
    CATENA, 2018, 166 : 317 - 327
  • [2] Comparative Study of Data-Driven Models in Motor RUL Estimation
    Banerjee, Ahin
    Gupta, Sanjay K.
    Putcha, Chandrasekhar
    ASCE-ASME JOURNAL OF RISK AND UNCERTAINTY IN ENGINEERING SYSTEMS PART A-CIVIL ENGINEERING, 2022, 8 (01):
  • [3] A Comparative Study of Data-Driven Models for Travel Destination Characterization
    Dietz, Linus W.
    Sertkan, Mete
    Myftija, Saadi
    Thimbiri Palage, Sameera
    Neidhardt, Julia
    Woerndl, Wolfgang
    FRONTIERS IN BIG DATA, 2022, 5
  • [4] A Comparative Study of Data-driven Models for Groundwater Level Forecasting
    Sarma, R.
    Singh, S. K.
    WATER RESOURCES MANAGEMENT, 2022, 36 (08) : 2741 - 2756
  • [5] A Comparative Study of Data-driven Models for Groundwater Level Forecasting
    R. Sarma
    S. K. Singh
    Water Resources Management, 2022, 36 : 2741 - 2756
  • [6] Towards urban flood susceptibility mapping using data-driven models in Berlin, Germany
    Seleem, Omar
    Ayzel, Georgy
    de Souza, Arthur Costa Tomaz
    Bronstert, Axel
    Heistermann, Maik
    GEOMATICS NATURAL HAZARDS & RISK, 2022, 13 (01) : 1640 - 1662
  • [7] Study on spatiotemporal dynamic characteristics of precipitation and causes of waterlogging based on a data-driven framework
    Han, Feifei
    Zhang, Xueyu
    Yu, Jingshan
    Xu, Shugao
    Zhou, Guihuan
    Li, Shuang
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 913
  • [8] Comparative review of data-driven landslide susceptibility models: case study in the Eastern Andes mountain range of Colombia
    Calderon-Guevara, Wilmar
    Sanchez-Silva, Mauricio
    Nitescu, Bogdan
    Villarraga, Daniel F.
    NATURAL HAZARDS, 2022, 113 (02) : 1105 - 1132
  • [9] Comparative review of data-driven landslide susceptibility models: case study in the Eastern Andes mountain range of Colombia
    Wilmar Calderón-Guevara
    Mauricio Sánchez-Silva
    Bogdan Nitescu
    Daniel F. Villarraga
    Natural Hazards, 2022, 113 : 1105 - 1132
  • [10] A comparative study of Data-driven Prognostic Approaches: Stochastic and Statistical Models
    Li, Rui
    Verhagen, Wim J. C.
    Curran, Richard
    2018 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2018,