A method for measuring the relative information content of data from different monitoring protocols

被引:52
|
作者
Munson, M. Arthur [1 ]
Caruana, Rich [2 ]
Fink, Daniel [3 ]
Hochachka, Wesley M. [3 ]
Iliff, Marshall [3 ]
Rosenberg, Kenneth V. [3 ]
Sheldon, Daniel [1 ]
Sullivan, Brian L. [3 ]
Wood, Christopher [3 ]
Kelling, Steve [3 ]
机构
[1] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
[2] Microsoft Corp, Redmond, WA 98052 USA
[3] Cornell Lab Ornithol, Ithaca, NY USA
来源
METHODS IN ECOLOGY AND EVOLUTION | 2010年 / 1卷 / 03期
基金
美国国家科学基金会;
关键词
cross-data validation; data efficiency ratio; data quality; eBird; North American Breeding Bird Survey; species distribution model; BIRD; PROBABILITY; PREDICTION;
D O I
10.1111/j.2041-210X.2010.00035.x
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
1. Species monitoring is an essential component of assessing conservation status, predicting effects of habitat change and establishing management and conservation priorities. The pervasive access to the Internet has led to the development of several extensive monitoring projects that engage massive networks of volunteers who provide observations following relatively unstructured protocols. However, the value of these data is largely unknown. 2. We develop a novel cross-data validation method for measuring the value of survey data from one source (e. g. an Internet checklist program) relative to a second, benchmark data source. The method fits a model to the data of interest and validates the model using benchmark data, allowing us to isolate the training data's information content from its biases. We also define a data efficiency ratio to quantify the relative efficiency of the data sources. 3. We apply our cross-data validation method to quantify the value of data collected in eBird - a western hemisphere, year-round citizen science bird checklist project - relative to data from the highly standardized North American Breeding Bird Survey (BBS). The results show that eBird data contain information similar in quality to that in BBS data, while the information per BBS datum is higher. 4. We suggest that these methods have more general use in evaluating the suitability of sources of data for addressing specific questions for taxa of interest.
引用
收藏
页码:263 / 273
页数:11
相关论文
共 50 条