Landslide Modeling in a Tropical Mountain Basin Using Machine Learning Algorithms and Shapley Additive Explanations

被引:8
|
作者
Vega, Johnny [1 ]
Sepulveda-Murillo, Fabio Humberto [2 ]
Parra, Melissa [1 ]
机构
[1] Univ Medellin, Fac Ingn, Medellin, Colombia
[2] Univ Medellin, Fac Ciencias Basicas, Medellin, Colombia
来源
关键词
Colombian Andes; landslides; machine learning; SHAP; statistical methods; susceptibility; DECISION TREE; FUZZY MULTICRITERIA; FREQUENCY RATIO; RANDOM FOREST; SUSCEPTIBILITY; SYSTEM; AREA;
D O I
10.1177/11786221231195824
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Landslides are a geological hazard commonly induced by rainfall, earthquakes, deforestation, or human activity causing loss of human life every year specially on highlands or mountain slopes with serious impacts that threaten communities and its infrastructure. The incidence and recurrence of landslides are conditioned by several aspects related to soil properties, geological structure, climatic conditions, soil cover, and water flow. Precisely, Colombia is one of the most affected by this type of natural hazard, as well as by floods, since they are the natural phenomena that bring with them the most severe risks for communities. In this work, we articulated the statistical approach of the landslide conditioning factors, Machine Learning Algorithms (MLA), and Geographic Information System (GIS), evaluating a flexible and agile methodology to estimate the landslide susceptibility defining areas prone to the landslide occurrence. The MLA were validated in a case study in the "La Liboriana" River basin, located in the Municipality of Salgar in the Colombian mountains Andes where Landslide Susceptibility Maps (LSMs) were obtained. The obtained MLA results hold immense potential in the field of regional landslide mapping, facilitating the development of effective strategies aimed at minimizing the devastating impacts on human lives, infrastructure, and the natural environment. By leveraging these findings, proactive measures can be devised to safeguard vulnerable areas, mitigate risks, and ensure the safety and well-being of communities. Seven supervised MLA were employed, two regression algorithms (Logistic) and five decision tree algorithms (Recursive Partitioning and Regression Trees [RPART], Conditional Inference Trees [CTREE], Random Forest [RF], Ranger, and Extreme Gradient Boosting Algorithm [XGBoost]). The LSMs were produced for each MLA. Considering different performance metrics, the RF model yields the best classification accuracy with an area under receiver operating characteristic (ROC) curve of 95% and 90% of accuracy, providing the most representative results. Finally, the contribution of each landslide conditioning factor on predictions with RF model is explained using the SHAP method.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Assessing the Suitability of Boosting Machine-Learning Algorithms for Classifying Arsenic-Contaminated Waters: A Novel Model-Explainable Approach Using SHapley Additive exPlanations
    Ibrahim, Bemah
    Ewusi, Anthony
    Ahenkorah, Isaac
    WATER, 2022, 14 (21)
  • [32] Improved Prediction of Total Energy Consumption and Feature Analysis in Electric Vehicles Using Machine Learning and Shapley Additive Explanations Method
    Pokharel, Sugam
    Sah, Pradip
    Ganta, Deepak
    WORLD ELECTRIC VEHICLE JOURNAL, 2021, 12 (03):
  • [33] Explanation of Machine Learning Models Using Improved Shapley Additive Explanation
    Nohara, Yasunobu
    Matsumoto, Koutarou
    Soejima, Hidehisa
    Nakashima, Naoki
    ACM-BCB'19: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, 2019, : 546 - 546
  • [34] Study on biomass and polymer catalytic co-pyrolysis product characteristics using machine learning and shapley additive explanations (SHAP)
    Qi, Jingwei
    Wang, Yijie
    Xu, Pengcheng
    Huhe, Taoli
    Ling, Xiang
    Yuan, Haoran
    Chen, Yong
    Li, Jiadong
    FUEL, 2025, 380
  • [35] Comparison of Explainable Machine-Learning Models for Decision-Making in Health Intensive Care Using SHapley Additive exPlanations
    Vidal, Igor Pereira
    Pereira, Marluce Rodrigues
    Freire, Andre Pimenta
    Resende, Uanderson
    Maziero, Erick Galani
    PROCEEDINGS OF THE 19TH BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, 2023, : 300 - 307
  • [36] Explaining anomalies detected by autoencoders using Shapley Additive Explanations
    Antwarg, Liat
    Miller, Ronnie Mindlin
    Shapira, Bracha
    Rokach, Lior
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
  • [37] Identification of Smartwatch-Collected Lifelog Variables Affecting Body Mass Index in Middle-Aged People Using Regression Machine Learning Algorithms and SHapley Additive Explanations
    Kim, Jiyong
    Lee, Jiyoung
    Park, Minseo
    APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [38] Explaining deep learning-based activity schedule models using SHapley Additive exPlanations
    Koushik, Anil
    Manoj, M.
    Nezamuddin, N.
    TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2024,
  • [39] Interpretable machine learning with tree-based shapley additive explanations: Application to metabolomics datasets for binary classification
    Bifarin, Olatomiwa O.
    PLOS ONE, 2023, 18 (05):
  • [40] A model for predicting academic performance on standardised tests for lagging regions based on machine learning and Shapley additive explanations
    Suaza-Medina, Mario
    Penabaena-Niebles, Rita
    Jubiz-Diaz, Maria
    SCIENTIFIC REPORTS, 2024, 14 (01):