Evaluating predictive models of species' distributions: criteria for selecting optimal models

被引:902
|
作者
Anderson, RP
Lew, D
Peterson, AT
机构
[1] Amer Museum Nat Hist, Div Vertebrate Zool Mammal, New York, NY 10024 USA
[2] Univ Kansas, Museum Nat Hist, Lawrence, KS 66045 USA
[3] Univ Kansas, Biodivers Res Ctr, Lawrence, KS 66045 USA
[4] Univ Kansas, Dept Ecol & Evolutionary Biol, Lawrence, KS 66045 USA
[5] Museo Hist Nat La Salle, Fundac La Salle, Caracas 1010A, Venezuela
关键词
asymmetrical errors; commission; confusion matrix; GARP; genetic algorithms; omission; range;
D O I
10.1016/S0304-3800(02)00349-6
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The Genetic Algorithm for Rule-Set Prediction (GARP) is one of several current approaches to modeling species' distributions using occurrence records and environmental data. Because of stochastic elements in the algorithm and underdetermination of the system (multiple solutions with the same value for the optimization criterion), no unique solution is produced. Furthermore, current implementations of GARP utilize only presence data-rather than both presence and absence, the more general case. Hence, variability among GARP models, which is typical of genetic algorithms, and complications in interpreting results based on asymmetrical (presence-only) input data make model selection critical. Generally, some locality records are randomly selected to build a distributional model, with others set aside to evaluate it. Here, we use intrinsic and extrinsic measures of model performance to determine whether optimal models can be identified based on objective intrinsic criteria, without resorting to an independent test data set. We modeled potential distributions of two rodents (Heteromys anomalus and Microryzomys minutus) and one passerine bird (Carpodacus mexicanus), creating 20 models for each species. For each model, we calculated intrinsic and extrinsic measures of omission and commission error, as well as composite indices of overall error. Although intrinsic and extrinsic composite measures of overall model performance were sometimes loosely related to each other, none was consistently associated with expert-judged model quality. In contrast, intrinsic and extrinsic measures were highly correlated for both omission and commission in the two widespread species (H. anomalus and C mexicanus). Furthermore, a clear inverse relationship existed between omission and commission there, and the best models were consistently found at low levels of omission and moderate-to-high commission values. In contrast, all models for M. minutus showed low values of both omission and commission. Because models are based only on presence data (and not all areas are adequately sampled), the corm-nission index reflects not only true commission error but also a component that results from undersampled areas that the species actually inhabits. We here propose an operational procedure for determining an optimal region of the omission/commission relationship and thus selecting high-quality GARP models. Our implementation of this technique for H. anomalus gave a much more reasonable estimation of the species' potential distribution than did the original suite of models. These findings are relevant to evaluation of other distributional-modeling techniques based on presence-only data and should also be considered with other machine-learning applications modified for use with asymmetrical input data. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:211 / 232
页数:22
相关论文
共 50 条
  • [1] Toward selecting optimal predictive multiscale models
    Tan, Jingye
    Liang, Baoshan
    Singh, Pratyush Kumar
    Farrell-Maupin, Kathryn A.
    Faghihi, Danial
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 402
  • [2] Criteria for Selecting Software Development Models
    Ben-Zahia, Mohamed A.
    Jaluta, Ibrahim
    2014 GLOBAL SUMMIT ON COMPUTER & INFORMATION TECHNOLOGY (GSCIT), 2014,
  • [3] Evaluating language models within a predictive framework:: An analysis of ranking distributions
    Alain, Pierre
    Boeffard, Olivier
    Barbot, Nelly
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 319 - 326
  • [4] SELECTING SOCIOECONOMIC ASSESSMENT MODELS - A DISCUSSION OF CRITERIA AND SELECTED MODELS
    MURDOCK, SH
    LEISTRITZ, FL
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 1980, 10 (03) : 241 - 252
  • [5] A framework for evaluating predictive models
    Tan, Yee-Leng
    Saffari, Seyed Ehsan
    Tan, Nigel Choon Kiat
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2022, 150 : 188 - 190
  • [7] OPTIMAL SIZE OF PREDICTIVE MODELS
    HAKANSON, L
    ECOLOGICAL MODELLING, 1995, 78 (03) : 195 - 204
  • [8] On criteria for evaluating models of absolute risks
    Gail, MH
    Pfeiffer, RM
    BIOSTATISTICS, 2005, 6 (02) : 227 - 239
  • [9] Predictive models of fish species distributions: A note on proper validation and chance predictions
    Olden, JD
    Jackson, DA
    Peres-Neto, PR
    TRANSACTIONS OF THE AMERICAN FISHERIES SOCIETY, 2002, 131 (02) : 329 - 336
  • [10] MODELS FOR SELECTING OPTIMAL TOBACCO MARKET LOCATIONS
    WENSINK, RB
    SOWELL, RS
    TRANSACTIONS OF THE ASAE, 1977, 20 (06): : 1194 - 1200