Landslide Modeling in a Tropical Mountain Basin Using Machine Learning Algorithms and Shapley Additive Explanations

被引:8
|
作者
Vega, Johnny [1 ]
Sepulveda-Murillo, Fabio Humberto [2 ]
Parra, Melissa [1 ]
机构
[1] Univ Medellin, Fac Ingn, Medellin, Colombia
[2] Univ Medellin, Fac Ciencias Basicas, Medellin, Colombia
来源
关键词
Colombian Andes; landslides; machine learning; SHAP; statistical methods; susceptibility; DECISION TREE; FUZZY MULTICRITERIA; FREQUENCY RATIO; RANDOM FOREST; SUSCEPTIBILITY; SYSTEM; AREA;
D O I
10.1177/11786221231195824
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Landslides are a geological hazard commonly induced by rainfall, earthquakes, deforestation, or human activity causing loss of human life every year specially on highlands or mountain slopes with serious impacts that threaten communities and its infrastructure. The incidence and recurrence of landslides are conditioned by several aspects related to soil properties, geological structure, climatic conditions, soil cover, and water flow. Precisely, Colombia is one of the most affected by this type of natural hazard, as well as by floods, since they are the natural phenomena that bring with them the most severe risks for communities. In this work, we articulated the statistical approach of the landslide conditioning factors, Machine Learning Algorithms (MLA), and Geographic Information System (GIS), evaluating a flexible and agile methodology to estimate the landslide susceptibility defining areas prone to the landslide occurrence. The MLA were validated in a case study in the "La Liboriana" River basin, located in the Municipality of Salgar in the Colombian mountains Andes where Landslide Susceptibility Maps (LSMs) were obtained. The obtained MLA results hold immense potential in the field of regional landslide mapping, facilitating the development of effective strategies aimed at minimizing the devastating impacts on human lives, infrastructure, and the natural environment. By leveraging these findings, proactive measures can be devised to safeguard vulnerable areas, mitigate risks, and ensure the safety and well-being of communities. Seven supervised MLA were employed, two regression algorithms (Logistic) and five decision tree algorithms (Recursive Partitioning and Regression Trees [RPART], Conditional Inference Trees [CTREE], Random Forest [RF], Ranger, and Extreme Gradient Boosting Algorithm [XGBoost]). The LSMs were produced for each MLA. Considering different performance metrics, the RF model yields the best classification accuracy with an area under receiver operating characteristic (ROC) curve of 95% and 90% of accuracy, providing the most representative results. Finally, the contribution of each landslide conditioning factor on predictions with RF model is explained using the SHAP method.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Assessing influential factors of Chinese industrial aqueous cadmium emissions based on machine learning and shapley additive explanations
    Yang, Guangfei
    Ju, Yi
    Wu, Wenjun
    Guo, Zitong
    Ni, Wenli
    JOURNAL OF CLEANER PRODUCTION, 2024, 448
  • [42] Hybrid machine learning approach to prediction of the compressive and flexural strengths of UHPC and parametric analysis with shapley additive explanations
    Das, Pobithra
    Kashem, Abul
    CASE STUDIES IN CONSTRUCTION MATERIALS, 2024, 20
  • [43] Machine learning-based heat deflection temperature prediction and effect analysis in polypropylene composites using catboost and shapley additive explanations
    Joo, Chonghyo
    Park, Hyundo
    Lim, Jongkoo
    Cho, Hyungtae
    Kim, Junghwan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [44] Bagging-based machine learning algorithms for landslide susceptibility modeling
    Zhang, Tingyu
    Fu, Quan
    Wang, Hao
    Liu, Fangfang
    Wang, Huanyuan
    Han, Ling
    NATURAL HAZARDS, 2022, 110 (02) : 823 - 846
  • [45] Bagging-based machine learning algorithms for landslide susceptibility modeling
    Tingyu Zhang
    Quan Fu
    Hao Wang
    Fangfang Liu
    Huanyuan Wang
    Ling Han
    Natural Hazards, 2022, 110 : 823 - 846
  • [46] Predicting egg production rate and egg weight of broiler breeders based on machine learning and Shapley additive explanations
    Ji, Hengyi
    Xu, Yidan
    Teng, Ganghui
    POULTRY SCIENCE, 2025, 104 (01)
  • [47] Evaluating the relevance of eggshell and glass powder for cement-based materials using machine learning and SHapley Additive exPlanations (SHAP) analysis
    Amin, Muhammad Nasir
    Ahmad, Waqas
    Khan, Kaffayatullah
    Nazar, Sohaib
    Abu Arab, Abdullah Mohammad
    Deifalla, Ahmed Farouk
    CASE STUDIES IN CONSTRUCTION MATERIALS, 2023, 19
  • [48] EXPLAINING DEEP LEARNING MODELS FOR SPOOFING AND DEEPFAKE DETECTION WITH SHAPLEY ADDITIVE EXPLANATIONS
    Ge, Wanying
    Patino, Jose
    Todisco, Massimiliano
    Evans, Nicholas
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6387 - 6391
  • [49] Explaining Deep Q-Learning Experience Replay with SHapley Additive exPlanations
    Sullivan, Robert S.
    Longo, Luca
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (04): : 1433 - 1455
  • [50] Enhancing co-pyrolysis process of biomass and coal using machine learning insights and Shapley additive explanations based on cooperative game theory
    Le, Quang Dung
    Paramasivam, Prabhu
    Chohan, Jasgurpreet Singh
    Sirohi, Ranjana
    Bui, Van Hung
    Kowalski, Jerzy
    Le, Huu Cuong
    Tran, Viet Dung
    ENERGY & ENVIRONMENT, 2025,