The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology

被引:669
作者
Camon, E [1 ]
Magrane, M [1 ]
Barrell, D [1 ]
Lee, V [1 ]
Dimmer, E [1 ]
Maslen, J [1 ]
Binns, D [1 ]
Harte, N [1 ]
Lopez, R [1 ]
Apweiler, R [1 ]
机构
[1] EBI, Cambridge CB10 1SD, England
关键词
D O I
10.1093/nar/gkh021
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Gene Ontology Annotation (GOA) database (http://www.ebi.ac.uk/GOA) aims to provide high-quality electronic and manual annotations to the UniProt Knowledgebase (Swiss-Prot, TrEMBL and PIR-PSD) using the standardized vocabulary of the Gene Ontology (GO). As a supplementary archive of GO annotation, GOA promotes a high level of integration of the knowledge represented in UnIProt with other databases. This is achieved by converting UniProt annotation into a recognized computational format. GOA provides annotated entries for nearly 60 000 species (GOA-SPTr) and is the largest and most comprehensive open-source contributor of annotations to the GO Consortium annotation effort. By integrating GO annotations from other model organism groups, GOA consolidates specialized knowledge and expertise to ensure the data remain a key reference for up-to-date biological information. Furthermore, the GOA database fully endorses the Human Proteomics Initiative by prioritizing the annotation of proteins likely to benefit human health and disease. In addition to a non-redundant set of annotations to the human proteome (GOA-Human) and monthly releases of its GO annotation for all species (GOA-SPTr), a series of GO mapping files and specific cross-references in other databases are also regularly distributed. GOA can be queried through a simple user-friendly web interface or downloaded in a parsable format via the EBI and GO FTP websites. The GOA data set can be used to enhance the annotation of particular model organism or gene expression data sets, although increasingly it has been used to evaluate GO predictions generated from text mining or protein interaction experiments. In 2004, the GOA team will build on its success and will continue to supplement the functional annotation of UniProt and work towards enhancing the ability of scientists to access all available biological information. Researchers wishing to query or contribute to the GOA project are encouraged to email: goa@ebi.ac.uk.
引用
收藏
页码:D262 / D266
页数:5
相关论文
共 29 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]  
Biswas Margaret, 2002, Brief Bioinform, V3, P285, DOI 10.1093/bib/3.3.285
[3]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[4]   ArrayExpress - a public repository for microarray gene expression data at the EBI [J].
Brazma, A ;
Parkinson, H ;
Sarkans, U ;
Shojatalab, M ;
Vilo, J ;
Abeygunawardena, N ;
Holloway, E ;
Kapushesky, M ;
Kemmeren, P ;
Lara, GG ;
Oezcimen, A ;
Rocca-Serra, P ;
Sansone, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :68-71
[5]   NIH pledges cash for global protein database [J].
Butler, D .
NATURE, 2002, 419 (6903) :101-101
[6]   The gene ontology annotation (GOA) project: Implementation of GO in SWISS-PROT, TrEMBL, and InterPro [J].
Camon, E ;
Magrane, M ;
Barrell, D ;
Binns, D ;
Fleischmann, W ;
Kersey, P ;
Mulder, N ;
Oinn, T ;
Maslen, J ;
Cox, A ;
Apweiler, R .
GENOME RESEARCH, 2003, 13 (04) :662-672
[7]   The Gene Ontology Annotation (GOA) project - application of GO in SWISS-PROT, TrEMBL and InterPro [J].
Camon, E ;
Barrell, D ;
Brooksbank, C ;
Magrane, M ;
Apweiler, R .
COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (01) :71-74
[8]   Ensembl 2002: accommodating comparative genomics [J].
Clamp, M ;
Andrews, D ;
Barker, D ;
Bevan, P ;
Cameron, G ;
Chen, Y ;
Clark, L ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Down, T ;
Durbin, R ;
Eyras, E ;
Gilbert, J ;
Hammond, M ;
Hubbard, T ;
Kasprzyk, A ;
Keefe, D ;
Lehvaslaiho, H ;
Iyer, V ;
Melsopp, C ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Rust, A ;
Schmidt, E ;
Searle, S ;
Slater, G ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Stupka, E ;
Ureta-Vidal, A ;
Vastrik, I ;
Birney, E .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :38-42
[9]   The Nuclear Protein Database (NPD): sub-nuclear localisation and functional annotation of the nuclear proteome [J].
Dellaire, G ;
Farrall, R ;
Bickmore, WA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :328-330
[10]   Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO) [J].
Dwight, SS ;
Harris, MA ;
Dolinski, K ;
Ball, CA ;
Binkley, G ;
Christie, KR ;
Fisk, DG ;
Issel-Tarver, L ;
Schroeder, M ;
Sherlock, G ;
Sethuraman, A ;
Weng, S ;
Botstein, D ;
Cherry, JM .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :69-72