Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing

被引:269
|
作者
Gilles, Andre [2 ]
Meglecz, Emese [2 ]
Pech, Nicolas [2 ]
Ferreira, Stephanie [3 ]
Malausa, Thibaut [4 ]
Martin, Jean-Francois [1 ]
机构
[1] INRA IRD Cirad Montpellier SupAgro, CBGP, UMR, Campus Int Baillarguet,CS 30016, F-34988 Montferrier Sur Lez, France
[2] Aix Marseille Univ, CNRS, IRD,Ctr St Charles, UMR IMEP 6116,Equipe Evolut Genome Environm, F-13331 Marseille 3, France
[3] Genoscreen, Genom Platform & R&D, F-59000 Lille, France
[4] INRA, UMR 1301, Equipe BPI, F-06903 Sophia Antipolis, France
来源
BMC GENOMICS | 2011年 / 12卷
关键词
RARE BIOSPHERE; NEW-GENERATION; DISCOVERY; DIVERSITY; WRINKLES; ERRORS; RATES;
D O I
10.1186/1471-2164-12-245
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The rapid evolution of 454 GS-FLX sequencing technology has not been accompanied by a reassessment of the quality and accuracy of the sequences obtained. Current strategies for decision-making and error-correction are based on an initial analysis by Huse et al. in 2007, for the older GS20 system based on experimental sequences. We analyze here the quality of 454 sequencing data and identify factors playing a role in sequencing error, through the use of an extensive dataset for Roche control DNA fragments. Results: We obtained a mean error rate for 454 sequences of 1.07%. More importantly, the error rate is not randomly distributed; it occasionally rose to more than 50% in certain positions, and its distribution was linked to several experimental variables. The main factors related to error are the presence of homopolymers, position in the sequence, size of the sequence and spatial localization in PT plates for insertion and deletion errors. These factors can be described by considering seven variables. No single variable can account for the error rate distribution, but most of the variation is explained by the combination of all seven variables. Conclusions: The pattern identified here calls for the use of internal controls and error-correcting base callers, to correct for errors, when available (e. g. when sequencing amplicons). For shotgun libraries, the use of both sequencing primers and deep coverage, combined with the use of random sequencing primer sites should partly compensate for even high error rates, although it may prove more difficult than previous thought to distinguish between low-frequency alleles and errors.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing
    André Gilles
    Emese Meglécz
    Nicolas Pech
    Stéphanie Ferreira
    Thibaut Malausa
    Jean-François Martin
    BMC Genomics, 12
  • [2] Representativeness of microsatellite distributions in genomes, as revealed by 454 GS-FLX Titanium pyrosequencing
    Martin, Jean-Francois
    Pech, Nicolas
    Meglecz, Emese
    Ferreira, Stephanie
    Costedoat, Caroline
    Dubut, Vincent
    Malausa, Thibaut
    Gilles, Andre
    BMC GENOMICS, 2010, 11
  • [3] Representativeness of microsatellite distributions in genomes, as revealed by 454 GS-FLX Titanium pyrosequencing
    Jean-Francois Martin
    Nicolas Pech
    Emese Meglécz
    Stéphanie Ferreira
    Caroline Costedoat
    Vincent Dubut
    Thibaut Malausa
    André Gilles
    BMC Genomics, 11
  • [4] High-throughput microsatellite isolation through 454 GS-FLX Titanium pyrosequencing of enriched DNA libraries
    Malausa, Thibaut
    Gilles, Andre
    Meglecz, Emese
    Blanquart, Helene
    Duthoy, Stephanie
    Costedoat, Caroline
    Dubut, Vincent
    Pech, Nicolas
    Castagnone-Sereno, Philippe
    Delye, Christophe
    Feau, Nicolas
    Frey, Pascal
    Gauthier, Philippe
    Guillemaud, Thomas
    Hazard, Laurent
    Le Corre, Valerie
    Lung-Escarmant, Brigitte
    Male, Pierre-Jean G.
    Ferreira, Stephanie
    Martin, Jean-Francois
    MOLECULAR ECOLOGY RESOURCES, 2011, 11 (04) : 638 - 644
  • [5] Exploring Bacterial Diversity in Hospital Environments by GS-FLX Titanium Pyrosequencing
    Poza, Margarita
    Gayoso, Carmen
    Gomez, Manuel J.
    Rumbo-Feal, Soraya
    Tomas, Maria
    Aranda, Jesus
    Fernandez, Ana
    Bou, German
    PLOS ONE, 2012, 7 (08):
  • [6] Exploring salivary microbiota in AIDS patients with different periodontal statuses using 454 GS-FLX Titanium pyrosequencing
    Zhang, Fang
    He, Shenghua
    Jin, Jieqi
    Dong, Guangyan
    Wu, Hongkun
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2015, 5
  • [7] Characterization of Oncidium 'Gower Ramsey' Transcriptomes using 454 GS-FLX Pyrosequencing and Their Application to the Identification of Genes Associated with Flowering Time
    Chang, Yu-Yun
    Chu, Yen-Wei
    Chen, Chi-Wei
    Leu, Wei-Ming
    Hsu, Hsing-Fun
    Yang, Chang-Hsien
    PLANT AND CELL PHYSIOLOGY, 2011, 52 (09) : 1532 - 1545
  • [8] Application of full mitochondrial genome sequencing using 454 GS FLX pyrosequencing
    Mikkelsen, Martin
    Rockenbauer, Eszter
    Wachter, Andrea
    Fendt, Liane
    Zimmermann, Bettina
    Parson, Walther
    Nielsen, Sandra Abel
    Gilbert, Tom
    Willerslev, Eske
    Morling, Niels
    FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES, 2009, 2 (01) : 518 - 519
  • [9] De novo Sequence Assembly and Characterization of Lycoris aurea Transcriptome Using GS FLX Titanium Platform of 454 Pyrosequencing
    Wang, Ren
    Xu, Sheng
    Jiang, Yumei
    Jiang, Jingwei
    Li, Xiaodan
    Liang, Lijian
    He, Jia
    Peng, Feng
    Xia, Bing
    PLOS ONE, 2013, 8 (04):
  • [10] CANGS: A user-friendly utility for processing and analyzing 454 GS-FLX data in biodiversity studies
    Pandey R.V.
    Nolte V.
    Schlötterer C.
    BMC Research Notes, 3 (1)