Comparing the performance of global, geographically weighted and ecologically weighted species distribution models for Scottish wildcats using GLM and Random Forest predictive modeling

被引:8
|
作者
Cushman, S. A. [1 ]
Kilshaw, K. [1 ]
Campbell, R. D. [2 ]
Kaszta, Z. [1 ]
Gaywood, M. [3 ]
Macdonald, D. W. [1 ]
机构
[1] Univ Oxford, Recanati Kaplan Ctr, Dept Biol, Wildlife Conservat Res Unit WildCRU, Tubney House,Abingdon Rd, Oxford OX13 5QL, England
[2] NatureScot, Perth PH1 3EW, Scotland
[3] NatureScot, Fodderty Way,Dingwall Business Pk, Dingwall IV15 9XB, Scotland
关键词
Scotish wildcat; Limiting factors; Nonstationary; Species distribution modeling; Habitat modeling; MULTISCALE HABITAT SELECTION; LANDSCAPE GENETICS; EUROPEAN WILDCAT; SCALE; ECOLOGY; MARTEN; CONNECTIVITY; CONSERVATION; REPLICATION; SPACE;
D O I
10.1016/j.ecolmodel.2024.110691
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Species distribution modeling has emerged as a foundational method to predict occurrence and suitability of species in relation to environmental variables to advance ecological understanding and guide conservation planning. Recent research, however, has shown that species-environmental relationships and habitat model predictions are often nonstationary in space, time and ecological context. This calls into question modeling approaches that assume a global, stationary ecological realized niche and use predictive modeling to describe it. This paper explores this issue by comparing the performance of predictive models for wildcat hybrid occurrence based on (1) global pooled data across individuals, (2) geographically weighted aggregation of individual models, (3) ecologically weighted aggregation of individual models, and (4) combinations of global, geographical and ecological weighting. Our study system included GPS telemetry data from 14 individual wildcat hybrids across Scotland. We developed predictive models both using Generalized Linear Models (GLM) and Random Forest machine learning to compare the performance of these differing algorithms and how they compare in stationary and nonstationary analyses. We validated the predicted models in four different ways. First, we used independent hold-out data from the 14 collared wildcat hybrids. Second, we used data from 8 additional GPS collared wildcat hybrids from a previous study that were not included in the training sample. Third, we used sightings data sent in by the public and researchers and validated by expert opinion. Fourth, we used data collected by camera trap surveys between 2012 - 2021 from various sources to produce a combined camera trap dataset showing where wildcats and wildcat hybrids had been detected. Our results show that validation using hold-out data from the individuals used to train the model provides highly biased assessment of true model performance in other locations, with Random Forest in particular appearing to perform exceptionally (and inaccurately) well when validated by data from the same individuals used to train the models. Very different results were obtained when the models were validated using independent data from the three other sources. Each of these three independent validation data sets gave a different result in terms of the best overall model. The average of independent validation across these three validation datasets suggested that the best overall model produced for potential wildcat occurrence and habitat suitability was obtained by an ensemble average of the global Generalized Linear Model (GLM) and Random Forest models with the ecologically weighted GLM and Random Forest models. This suggests that the debate over whether which of GLM vs machine learning approaches is superior or whether global vs aggregated nonstationary modeling is superior may be a false choice. The results presented here show that the best prediction applies a combination of all of these approaches in an ensemble modeling framework.
引用
收藏
页数:15
相关论文
共 9 条
  • [1] Comparing the performance of global, geographically weighted and ecologically weighted species distribution models for Scottish wildcats using GLM and Random Forest predictive modeling( vol 492 , 110691 , 2024)
    Cushman, S. A.
    Kilshaw, K.
    Campbell, R. D.
    Kaszta, Z.
    Gaywood, M.
    Macdonald, D. W.
    ECOLOGICAL MODELLING, 2025, 502
  • [2] Gully erosion zonation mapping using integrated geographically weighted regression with certainty factor and random forest models in GIS
    Arabameri, Alireza
    Pradhan, Biswajeet
    Rezaei, Khalil
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2019, 232 : 928 - 942
  • [3] Modeling wildfire drivers in Chinese tropical forest ecosystems using global logistic regression and geographically weighted logistic regression
    Su, Zhangwen
    Zheng, Lujia
    Luo, Sisheng
    Tigabu, Mulualem
    Guo, Futao
    NATURAL HAZARDS, 2021, 108 (01) : 1317 - 1345
  • [4] Modeling wildfire drivers in Chinese tropical forest ecosystems using global logistic regression and geographically weighted logistic regression
    Zhangwen Su
    Lujia Zheng
    Sisheng Luo
    Mulualem Tigabu
    Futao Guo
    Natural Hazards, 2021, 108 : 1317 - 1345
  • [5] Comparing the Accuracy and Developed Models for Predicting the Confrontation Naming of the Elderly in South Korea using Weighted Random Forest, Random Forest, and Support Vector Regression
    Byeon, Haewon
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 326 - 331
  • [6] Estimating Regional Forest Carbon Density Using Remote Sensing and Geographically Weighted Random Forest Models: A Case Study of Mid- to High-Latitude Forests in China
    Zhou, Yuan
    Wei, Geran
    Wang, Yang
    Wang, Bin
    Quan, Ying
    Wu, Zechuan
    Liu, Jianyang
    Bian, Shaojie
    Li, Mingze
    Fan, Wenyi
    Dai, Yuxuan
    FORESTS, 2025, 16 (01):
  • [7] Comparative performance of convolutional neural network, weighted and conventional support vector machine and random forest for classifying tree species using hyperspectral and photogrammetric data
    Sothe, C.
    De Almeida, C. M.
    Schimalski, M. B.
    La Rosa, L. E. C.
    Castro, J. D. B.
    Feitosa, R. Q.
    Dalponte, M.
    Lima, C. L.
    Liesenberg, V.
    Miyoshi, G. T.
    Tommaselli, A. M. G.
    GISCIENCE & REMOTE SENSING, 2020, 57 (03) : 369 - 394
  • [8] Performance analysis of machine learning models for intrusion detection system using Gini Impurity-based Weighted Random Forest (GIWRF) feature selection technique
    Disha, Raisa Abedin
    Waheed, Sajjad
    CYBERSECURITY, 2022, 5 (01)
  • [9] Performance analysis of machine learning models for intrusion detection system using Gini Impurity-based Weighted Random Forest (GIWRF) feature selection technique
    Raisa Abedin Disha
    Sajjad Waheed
    Cybersecurity, 5