Evaluating predictive models of species' distributions: criteria for selecting optimal models

被引：902

作者：

Anderson, RP

Lew, D

Peterson, AT

机构：

[1] Amer Museum Nat Hist, Div Vertebrate Zool Mammal, New York, NY 10024 USA

[2] Univ Kansas, Museum Nat Hist, Lawrence, KS 66045 USA

[3] Univ Kansas, Biodivers Res Ctr, Lawrence, KS 66045 USA

[4] Univ Kansas, Dept Ecol & Evolutionary Biol, Lawrence, KS 66045 USA

[5] Museo Hist Nat La Salle, Fundac La Salle, Caracas 1010A, Venezuela

来源：

ECOLOGICAL MODELLING | 2003年 / 162卷 / 03期

关键词：

asymmetrical errors; commission; confusion matrix; GARP; genetic algorithms; omission; range;

D O I：

10.1016/S0304-3800(02)00349-6

中图分类号：

Q14 [生态学（生物生态学）];

学科分类号：

071012 ; 0713 ;

摘要：

The Genetic Algorithm for Rule-Set Prediction (GARP) is one of several current approaches to modeling species' distributions using occurrence records and environmental data. Because of stochastic elements in the algorithm and underdetermination of the system (multiple solutions with the same value for the optimization criterion), no unique solution is produced. Furthermore, current implementations of GARP utilize only presence data-rather than both presence and absence, the more general case. Hence, variability among GARP models, which is typical of genetic algorithms, and complications in interpreting results based on asymmetrical (presence-only) input data make model selection critical. Generally, some locality records are randomly selected to build a distributional model, with others set aside to evaluate it. Here, we use intrinsic and extrinsic measures of model performance to determine whether optimal models can be identified based on objective intrinsic criteria, without resorting to an independent test data set. We modeled potential distributions of two rodents (Heteromys anomalus and Microryzomys minutus) and one passerine bird (Carpodacus mexicanus), creating 20 models for each species. For each model, we calculated intrinsic and extrinsic measures of omission and commission error, as well as composite indices of overall error. Although intrinsic and extrinsic composite measures of overall model performance were sometimes loosely related to each other, none was consistently associated with expert-judged model quality. In contrast, intrinsic and extrinsic measures were highly correlated for both omission and commission in the two widespread species (H. anomalus and C mexicanus). Furthermore, a clear inverse relationship existed between omission and commission there, and the best models were consistently found at low levels of omission and moderate-to-high commission values. In contrast, all models for M. minutus showed low values of both omission and commission. Because models are based only on presence data (and not all areas are adequately sampled), the corm-nission index reflects not only true commission error but also a component that results from undersampled areas that the species actually inhabits. We here propose an operational procedure for determining an optimal region of the omission/commission relationship and thus selecting high-quality GARP models. Our implementation of this technique for H. anomalus gave a much more reasonable estimation of the species' potential distribution than did the original suite of models. These findings are relevant to evaluation of other distributional-modeling techniques based on presence-only data and should also be considered with other machine-learning applications modified for use with asymmetrical input data. (C) 2002 Elsevier Science B.V. All rights reserved.

引用

页码：211 / 232

页数：22

共 50 条

[1] Toward selecting optimal predictive multiscale models
Tan, Jingye
Liang, Baoshan
Singh, Pratyush Kumar
Farrell-Maupin, Kathryn A.
Faghihi, Danial
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 402
[2] Criteria for Selecting Software Development Models
Ben-Zahia, Mohamed A.
Jaluta, Ibrahim
2014 GLOBAL SUMMIT ON COMPUTER & INFORMATION TECHNOLOGY (GSCIT), 2014,
[3] Evaluating language models within a predictive framework:: An analysis of ranking distributions
Alain, Pierre
Boeffard, Olivier
Barbot, Nelly
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 319 - 326
[4] SELECTING SOCIOECONOMIC ASSESSMENT MODELS - A DISCUSSION OF CRITERIA AND SELECTED MODELS
MURDOCK, SH
LEISTRITZ, FL
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 1980, 10 (03) : 241 - 252
[5] A framework for evaluating predictive models
Tan, Yee-Leng
Saffari, Seyed Ehsan
Tan, Nigel Choon Kiat
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2022, 150 : 188 - 190
[6] A DERIVATION OF THE INFORMATION CRITERIA FOR SELECTING AUTOREGRESSIVE MODELS
BHANSALI, RJ
ADVANCES IN APPLIED PROBABILITY, 1986, 18 (02) : 360 - 387
[7] OPTIMAL SIZE OF PREDICTIVE MODELS
HAKANSON, L
ECOLOGICAL MODELLING, 1995, 78 (03) : 195 - 204
[8] On criteria for evaluating models of absolute risks
Gail, MH
Pfeiffer, RM
BIOSTATISTICS, 2005, 6 (02) : 227 - 239
[9] Predictive models of fish species distributions: A note on proper validation and chance predictions
Olden, JD
Jackson, DA
Peres-Neto, PR
TRANSACTIONS OF THE AMERICAN FISHERIES SOCIETY, 2002, 131 (02) : 329 - 336
[10] MODELS FOR SELECTING OPTIMAL TOBACCO MARKET LOCATIONS
WENSINK, RB
SOWELL, RS
TRANSACTIONS OF THE ASAE, 1977, 20 (06): : 1194 - 1200

← 1 2 3 4 5 →