Landslide Susceptibility Mapping Methods Coupling with Statistical Methods, Machine Learning Models and Clustering Algorithms

被引:0
|
作者
Wang Q. [1 ]
Xiong J. [1 ,2 ]
Cheng W. [2 ]
Cui X. [1 ]
Pang Q. [4 ,5 ]
Liu J. [3 ]
Chen W. [1 ]
Tang H. [1 ]
Song N. [1 ]
机构
[1] School of Civil Engineering and Geomatics, Southwest Petroleum University, Chengdu
[2] State Key Laboratory of Resource and Environmental Information System, Institute of Geographic and Natural Resources Research, Chinese Academy of Sciences, Beijing
[3] School of Geography and Planning, Sun Yat-Sen University, Guangzhou
[4] Sichuan Academy of Safety Science and Technology, Chengdu
[5] Sichuan Anxin Kechuang Technology Co., Ltd., Chengdu
基金
国家重点研发计划;
关键词
certainty factor; clustering algorithm; frequency ratio; information value; landslide susceptibility; Ningnan County; random forest;
D O I
10.12082/dqxxkx.2024.230427
中图分类号
学科分类号
摘要
Landslides frequently occur in the mountainous areas of western China. Accurate mapping of landslide susceptibility is essential for geohazard management. Integrated models combining statistical methods and machine learning models have been widely applied to landslide susceptibility mapping. However, further optimization of their results is still worth investigation. This study proposes a comprehensive assessment method that couples statistical methods, machine learning models, and clustering algorithms. The effectiveness of the proposed method on improving the accuracy of landslide susceptibility mapping in Ningnan County is investigated. Firstly, the landslide influencing factors are selected from five aspects: geological environment, topography and geomorphology, meteorology and hydrology, vegetation and soil, and human engineering activities in the study area. Indicators are initially selected based on correlation analysis using the Pearson correlation coefficient method, and highly correlated factors are eliminated to establish the landslide susceptibility mapping index system. Next, the Information Value (IV), Certainty Factor (CF), and Frequency Ratio (FR) methods are combined with Random Forest (RF) model respectively to obtain three integrated models (IV-RF, CF-RF, and FR-RF). Then, the ISO clustering algorithm, Natural Breaks clustering, and Kmeans clustering algorithms are introduced to classify the results of the three integrated models, obtaining nine coupled assessment models (IV-RF-ISO, CF-RF-ISO, FR-RF-ISO, IV-RF-NBC, CF-RF-NBC, FR-RF-NBC, IV-RF-Kmeans, CF-RF-Kmeans, and FR-RF-Kmeans). Lastly, Area Under the Curve value (AUC), accuracy, F1 score, and Seed Cell Area Indexes (SCAI) are used to evaluate the accuracy of the models. The results demonstrate that all the integrated models outperform single models. The accuracy and F1 score of all integrated models both exceed 0.85, and their AUC values exceed 0.9. The integrated models effectively address the misclassification of non-landslide samples, which is especially prominent in single IV and CF models. Among the integrated models, the FR-RF model performs the best. The accuracy (0.911), F1 score (0.912), and AUC value (0.965) of FR-RF model improves by 0.095, 0.096, and 0.074, respectively, compared to the FR model. Compared with the natural break and Kmeans clustering methods, the coupled FR-RF-ISO model exhibits the optimal classification results, and the difference in SCAI values between its high and low susceptibility zones is more significant. The extremely high landslide susceptibility zones are primarily concentrated in the southern, eastern, and central parts of Ningnan County. The study demonstrates the high accuracy of the integrated assessment method that couples statistical methods, machine learning, and clustering algorithms, and provides insights for improving the accuracy of landslide susceptibility mapping. © 2024 Science Press. All rights reserved.
引用
收藏
页码:620 / 637
页数:17
相关论文
共 54 条
  • [11] Zhuang J.Q., Cui P., Ge Y.G., Et al., Risk assessment of collapses and landslides caused by 5.12 Wenchuan earthquake: A case study of Dujiangyan-Wenchuan highway[J], Chinese Journal of Rock Mechanics and Engineering, 29, 2, (2010)
  • [12] Feng H.J., Zhou A.G., Tang X.M., Et al., Susceptibility analysis of factors controlling rainfall-triggered landslides using certainty factor method[J], Journal of Engineering Geology, 25, 2, pp. 436-446, (2017)
  • [13] Zhang Y.X., Lan H.X., Li L.P., Et al., Optimizing the frequency ratio method for landslide susceptibility assessment: A case study of the Caiyuan Basin in the southeast mountainous area of China[J], Journal of Mountain Science, 17, 2, pp. 340-357, (2020)
  • [14] Liu L.Y., Gao H.Y., Landslide susceptibility assessment based on coupling of woe model and logistic regression model[J], Journal of Engineering Geology, 31, 1, pp. 165-175, (2023)
  • [15] Zhang Z.Y., Deng M.G., Xu S.G., Et al., Comparison of landslide susceptibility assessment models in Zhenkang County, Yunnan Province, China[J], Chinese Journal of Rock Mechanics and Engineering, 41, 1, pp. 157-171, (2022)
  • [16] Wu R.Z., Hu X.D., Mei H.B., Et al., Spatial susceptibility assessment of landslides based on random forest: A case study from Hubei section in the Three Gorges Reservoir area[J], Earth Science, 2021, 1, pp. 321-330
  • [17] Althuwaynee O.F., Pradhan B., Park H.J., Et al., A novel ensemble decision tree-based CHi-squared Automatic Interaction Detection (CHAID) and multivariate logistic regression models in landslide susceptibility mapping[J], Landslides, 11, 6, pp. 1063-1078, (2014)
  • [18] Liu J.P., Liang E.J., Xu S.H., Et al., Multi-kernel support vector machine considering sample optimization selection for analysis and evaluation of landslide disaster susceptibility[J], Acta Geodaetica et Cartographica Sinica, 51, 10, pp. 2034-2045, (2022)
  • [19] Huang W.B., Ding M.T., Wang D., Et al., Evaluation of landslide susceptibility based on layer adaptive weighted convolutional neural network model along Sichuan-Tibet traffic corridor[J], Earth Science, 47, 6, pp. 2015-2030, (2022)
  • [20] Huang F., Cao Z., Guo J., Et al., Comparisons of heuristic, general statistical and machine learning models for landslide susceptibility prediction and mapping[J], CATENA, 191, (2020)