Evaluating techniques for metagenome annotation using simulated sequence data

被引:46
|
作者
Randle-Boggis, Richard J. [1 ]
Helgason, Thorunn [1 ]
Sapp, Melanie [2 ]
Ashton, Peter D. [1 ]
机构
[1] Univ York, Dept Biol, York YO10 5DD, N Yorkshire, England
[2] Fera Sci Ltd, York YO41 1LZ, N Yorkshire, England
关键词
DNA sequencing; metagenomics; metagenome analysis; microbial ecology; sequence annotation; MICROBIAL DIVERSITY; PROTEIN; IDENTIFICATION; SERVER; TOOL;
D O I
10.1093/femsec/fiw095
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
The advent of next-generation sequencing has allowed huge amounts of DNA sequence data to be produced, advancing the capabilities of microbial ecosystem studies. The current challenge is to identify from which microorganisms and genes the DNA originated. Several tools and databases are available for annotating DNA sequences. The tools, databases and parameters used can have a significant impact on the results: naive choice of these factors can result in a false representation of community composition and function. We use a simulated metagenome to show how different parameters affect annotation accuracy by evaluating the sequence annotation performances of MEGAN, MG-RAST, One Codex and Megablast. This simulated metagenome allowed the recovery of known organism and function abundances to be quantitatively evaluated, which is not possible for environmental metagenomes. The performance of each program and database varied, e.g. One Codex correctly annotated many sequences at the genus level, whereas MG-RAST RefSeq produced many false positive annotations. This effect decreased as the taxonomic level investigated increased. Selecting more stringent parameters decreases the annotation sensitivity, but increases precision. Ultimately, there is a trade-off between taxonomic resolution and annotation accuracy. These results should be considered when annotating metagenomes and interpreting results from previous studies.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] MULTIPLE TECHNIQUES FOR LUNAR SURFACE MINERALS MAPPING USING SIMULATED DATA
    He, Haixia
    Zhang, Bing
    Chen, Zhengchao
    Li, Ru
    2009 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-5, 2009, : 2003 - +
  • [22] Evaluating translational correspondence using annotation projection
    Hwa, R
    Resnik, P
    Weinberg, A
    Kolak, O
    40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 392 - 399
  • [23] Evaluating image browsers using structured annotation
    Müller, W
    Marchand-Maillet, S
    Müller, H
    Squire, DM
    Pun, T
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2001, 52 (11): : 961 - 968
  • [24] TECHNIQUES FOR EVALUATING DISCREPANT DATA
    RAJPUT, MU
    MACMAHON, TD
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1992, 312 (1-2): : 289 - 295
  • [25] Evaluating intraspecific "Network" construction methods using simulated sequence data: Do existing algorithms outperform the global maximum parsimony approach?
    Cassens, I
    Mardulyn, P
    Milinkovitch, MC
    SYSTEMATIC BIOLOGY, 2005, 54 (03) : 363 - 372
  • [26] Automated Gene Ontology annotation for anonymous sequence data
    Hennig, S
    Groth, D
    Lehrach, H
    NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3712 - 3715
  • [27] Influenza sequence validation and annotation using VADR
    Calhoun, Vincent C.
    Hatcher, Eneida L.
    Yankie, Linda
    Nawrocki, Eric P.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2024, 2024
  • [28] An annotation-free method for evaluating privacy protection techniques in videos
    Nawaz, Tahir
    Ferryman, James
    2015 12TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2015,
  • [29] Invited Commentary: Evaluating Vaccination Programs Using Genetic Sequence Data
    Halloran, M. Elizabeth
    Holmes, Edward C.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2009, 170 (12) : 1464 - 1466
  • [30] Enzyme Reaction Annotation Using Cloud Techniques
    Huang, Chuan-Ching
    Lin, Chun-Yuan
    Chang, Cheng-Wen
    Tang, Chuan Yi
    BIOMED RESEARCH INTERNATIONAL, 2013, 2013