A method for measuring the relative information content of data from different monitoring protocols

被引:52
|
作者
Munson, M. Arthur [1 ]
Caruana, Rich [2 ]
Fink, Daniel [3 ]
Hochachka, Wesley M. [3 ]
Iliff, Marshall [3 ]
Rosenberg, Kenneth V. [3 ]
Sheldon, Daniel [1 ]
Sullivan, Brian L. [3 ]
Wood, Christopher [3 ]
Kelling, Steve [3 ]
机构
[1] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
[2] Microsoft Corp, Redmond, WA 98052 USA
[3] Cornell Lab Ornithol, Ithaca, NY USA
来源
METHODS IN ECOLOGY AND EVOLUTION | 2010年 / 1卷 / 03期
基金
美国国家科学基金会;
关键词
cross-data validation; data efficiency ratio; data quality; eBird; North American Breeding Bird Survey; species distribution model; BIRD; PROBABILITY; PREDICTION;
D O I
10.1111/j.2041-210X.2010.00035.x
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
1. Species monitoring is an essential component of assessing conservation status, predicting effects of habitat change and establishing management and conservation priorities. The pervasive access to the Internet has led to the development of several extensive monitoring projects that engage massive networks of volunteers who provide observations following relatively unstructured protocols. However, the value of these data is largely unknown. 2. We develop a novel cross-data validation method for measuring the value of survey data from one source (e. g. an Internet checklist program) relative to a second, benchmark data source. The method fits a model to the data of interest and validates the model using benchmark data, allowing us to isolate the training data's information content from its biases. We also define a data efficiency ratio to quantify the relative efficiency of the data sources. 3. We apply our cross-data validation method to quantify the value of data collected in eBird - a western hemisphere, year-round citizen science bird checklist project - relative to data from the highly standardized North American Breeding Bird Survey (BBS). The results show that eBird data contain information similar in quality to that in BBS data, while the information per BBS datum is higher. 4. We suggest that these methods have more general use in evaluating the suitability of sources of data for addressing specific questions for taxa of interest.
引用
收藏
页码:263 / 273
页数:11
相关论文
共 50 条
  • [21] From monitoring data to experiment information - Monitoring of grid scientific workflows
    Balis, Bartosz
    Bubak, Marian
    Pelczar, Michal
    E-SCIENCE 2007: THIRD IEEE INTERNATIONAL CONFERENCE ON E-SCIENCE AND GRID COMPUTING, PROCEEDINGS, 2007, : 77 - +
  • [22] Measuring marker information content by the ambiguity of block boundaries observed in dense SNP data
    Gu, C. Charles
    Yu, Kai
    Boerwinkle, Eric
    ANNALS OF HUMAN GENETICS, 2007, 71 : 127 - 140
  • [23] A relative method for measuring nitric oxide (NO) fluxes from forest soils
    Wang, Jiaqi
    Zhang, Xiaoshan
    Wang, Zhangwei
    Kang, Ronghua
    SCIENCE OF THE TOTAL ENVIRONMENT, 2017, 574 : 544 - 552
  • [24] A Content Analysis of State Data Collection: Protocols for Measuring Post-School Outcomes for Students With Learning Disabilities
    Gerber, Paul J.
    De Arment, Serra T.
    Batalo, Cecilia G.
    LEARNING DISABILITIES-A MULTIDISCIPLINARY JOURNAL, 2014, 20 (03) : 133 - 142
  • [25] Capacitance technique for measuring moisture content using dielectric data - An immersion method
    Tripathi, RK
    Gupta, M
    Shukla, JP
    ICDL 1996 - 12TH INTERNATIONAL CONFERENCE ON CONDUCTION AND BREAKDOWN IN DIELECTRIC LIQUIDS, 1996, : 440 - 441
  • [26] The method of different-type data fusion for nuclear monitoring
    Shen Qiang
    Huang Li
    Li Shi-yi
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 521 - 523
  • [27] EVALUATION METHOD FOR INFORMATION CONTENT OF RASTER DATA USING FRACTAL DIMENSION
    Osaragi, Toshihiro
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION IV, 2022, 5-4 : 75 - 81
  • [28] INS aided relative positioning method of data link with unsynchronized source information
    Yan F.
    Fu J.
    Zhang C.
    Pang Z.
    Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2020, 28 (03): : 360 - 364
  • [29] A method for measuring distance from a training data set
    Juutilainen, Ilmari
    Roning, Juha
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2007, 36 (13-16) : 2625 - 2639
  • [30] Evaluation of different methods for extracting relative spectral emissivity information from simulated thermal infrared multispectral scanner data
    Li, ZL
    Becker, F
    Stoll, MP
    Wan, ZM
    REMOTE SENSING OF ENVIRONMENT, 1999, 69 (02) : 122 - 138