An ontology for describing and synthesizing ecological observation data

被引:169
作者
Madin, Joshua [1 ]
Bowers, Shawn
Schildhauer, Mark
Krivov, Sergeui
Pennington, Deana
Villa, Ferdinando
机构
[1] Univ Calif Santa Barbara, Natl Ctr Ecol Anal & Synth, Santa Barbara, CA 93101 USA
[2] Univ Calif Davis, Genome Ctr, Davis, CA 95616 USA
[3] Univ Vermont, Gund Inst Ecol Econ, Burlington, VT 05405 USA
[4] Univ New Mexico, Albuquerque, NM 87131 USA
[5] Macquarie Univ, Dept Biol Sci, N Ryde, NSW 2109, Australia
关键词
ecology; observation; measurement; ontology; data discovery; data integration;
D O I
10.1016/j.ecoinf.2007.05.004
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Research in ecology increasingly. relies on the integration of small, focused studies, to produce larger datasets that allow for more powerful, synthetic analyses. The results of these synthetic analyses are critical in guiding decisions about how to sustainably manage our natural environment, so it is important for researchers to effectively discover relevant data, and appropriately integrate these within their analyses. However, ecological data encompasses an extremely broad range of data types, structures, and semantic concepts. Moreover, ecological data is widely distributed, with few well-established repositories or standard protocols for their archiving and retrieval. These factors make the discovery and integration of ecological data sets a highly labor-intensive task. Metadata standards such as the Ecological Metadata Language and Darwin Core are important steps for improving our ability to discover and access ecological data, but are limited to describing only a few, relatively specific aspects of data content (e.g., data owner and contact information, variable "names", keyword descriptions, etc.). A more flexible and powerful way to capture the semantic subtleties of complex ecological data, its structure and contents, and the interrelationships among data variables is needed. We present a formal ontology for capturing the semantics of generic scientific observation and measurement. The ontology provides a convenient basis for adding detailed semantic annotations to scientific data, which crystallize the inherent "meaning" of observational data. The ontology can be used to characterize the context of an observation (e.g., space and time), and clarify inter-observational relationships such as dependency hierarchies (e.g., nested experimental observations) and meaningful dimensions within the data (e.g., axes for cross-classified categorical summarization). It also enables the robust description of measurement units (e.g., grams of carbon per liter of seawater), and can facilitate automatic unit conversions (e.g., pounds to kilograms). The ontology can be easily extended with specialized domain vocabularies, making it both broadly applicable and highly custornizable. Finally, we describe the utility of the ontology for enriching the capabilities of data discovery and integration processes. Published by Elsevier B.V.
引用
收藏
页码:279 / 296
页数:18
相关论文
共 53 条
[1]  
[Anonymous], 1977, TREATISE BASIC PHILO
[2]  
[Anonymous], 2004, OWL WEB ONTOLOGY LAN
[3]  
[Anonymous], ENTITY RELATIONSHIP
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]  
Baader F., 2003, DESCRIPTION LOGIC HD
[6]   Ontologies in biology: Design, applications and future challenges [J].
Bard, JBL ;
Rhee, SY .
NATURE REVIEWS GENETICS, 2004, 5 (03) :213-222
[7]  
Batini C., 1992, CONCEPTUAL DATABASE
[8]  
BERKLEY C, 2001, P 13 I C SCI STAT DA
[9]  
BEZKLEY C, 2005, P 17 INT C SCI STAT
[10]  
BOWERS S, 2005, LECT NOTES COMPUT SC, V3372