Improving ecological niche models by data mining large environmental datasets for surrogate models

被引:31
|
作者
Stockwell, DRB [1 ]
机构
[1] Univ Calif San Diego, San Diego Supercomp Ctr, La Jolla, CA 92093 USA
基金
美国国家科学基金会;
关键词
WhyWhere; ecological niche modeling; surrogate models; data mining; remote sensing;
D O I
10.1016/j.ecolmodel.2005.05.029
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
WhyWhere is a new ecological niche modeling (ENM) algorithm for mapping and explaining the distribution of species. The algorithm uses image processing methods to efficiently sift through large amounts of data to find the few variables that best predict species occurrence. The purpose of this paper is to describe and justify the main parameterizations and to show preliminary success at rapidly providing accurate, scalable, and simple ENMs. Preliminary results for six species of plants and animals in different regions indicate a significant (p < 0.01) 14% increase in accuracy over the GARP algorithm using models with few, typically two, variables. The increase is attributed to access to additional data, particularly remotely sensed monthly versus annual climate averages. WhyWhere is also six times faster than GARP on large datasets. A data mining based approach with transparent access to remote data archives is a new paradigm for ENM, particularly suited to finding correlates in large databases of fine resolution surfaces. Software for WhyWhere is freely available, both as a service and in a desktop downloadable form from the web site http://biodi.sdsc.edu/ww-home.html. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:188 / 196
页数:9
相关论文
共 50 条
  • [31] Regional Variation Exaggerates Ecological Divergence in Niche Models
    Godsoe, William
    SYSTEMATIC BIOLOGY, 2010, 59 (03) : 298 - 306
  • [32] Large language models as surrogate models in evolutionary algorithms: A preliminary study
    Hao, Hao
    Zhang, Xiaoqun
    Zhou, Aimin
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 91
  • [33] Constructing large scale surrogate models from big data and artificial intelligence
    Edwards, Richard E.
    New, Joshua
    Parker, Lynne E.
    Cui, Borui
    Dong, Jin
    APPLIED ENERGY, 2017, 202 : 685 - 699
  • [34] Mixture of Regression Models for Large Spatial Datasets
    Kazor, Karen
    Hering, Amanda S.
    TECHNOMETRICS, 2019, 61 (04) : 507 - 523
  • [35] High accuracy polypharmacology models for large datasets
    Swamidass, S. Joshua
    Calhoun, Bradley
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 243
  • [36] A survey of datasets in medicine for large language models
    Zhang, Deshiwei
    Xue, Xiaojuan
    Gao, Peng
    Jin, Zhijuan
    Hu, Menghan
    Wu, Yue
    Ying, Xiayang
    INTELLIGENCE & ROBOTICS, 2024, 4 (04): : 457 - 478
  • [37] A decomposition algorithm for large scale surrogate models
    Koroglu, Serdar A.
    Ergin, Ahmet
    SUSTAINABLE MARITIME TRANSPORTATION AND EXPLOITATION OF SEA RESOURCES, VOL 1, 2012, : 305 - 309
  • [38] Improving the representation of stream water sources in surrogate nutrient models with water isotope data
    Ross, C. A.
    Phillips, A. K.
    Gospodyn, L.
    Oswald, C. J.
    Wellen, C. C.
    Sorichetti, R. J.
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 892
  • [39] Measurement Uncertainty in Ecological and Environmental Models
    Damgaard, Christian
    TRENDS IN ECOLOGY & EVOLUTION, 2020, 35 (10) : 871 - 873
  • [40] Data mining for selective visualization of large spatial datasets
    Shekhar, S
    Lu, CT
    Zhang, PS
    Liu, RL
    14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, : 41 - 48