Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing

被引:269
|
作者
Gilles, Andre [2 ]
Meglecz, Emese [2 ]
Pech, Nicolas [2 ]
Ferreira, Stephanie [3 ]
Malausa, Thibaut [4 ]
Martin, Jean-Francois [1 ]
机构
[1] INRA IRD Cirad Montpellier SupAgro, CBGP, UMR, Campus Int Baillarguet,CS 30016, F-34988 Montferrier Sur Lez, France
[2] Aix Marseille Univ, CNRS, IRD,Ctr St Charles, UMR IMEP 6116,Equipe Evolut Genome Environm, F-13331 Marseille 3, France
[3] Genoscreen, Genom Platform & R&D, F-59000 Lille, France
[4] INRA, UMR 1301, Equipe BPI, F-06903 Sophia Antipolis, France
来源
BMC GENOMICS | 2011年 / 12卷
关键词
RARE BIOSPHERE; NEW-GENERATION; DISCOVERY; DIVERSITY; WRINKLES; ERRORS; RATES;
D O I
10.1186/1471-2164-12-245
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The rapid evolution of 454 GS-FLX sequencing technology has not been accompanied by a reassessment of the quality and accuracy of the sequences obtained. Current strategies for decision-making and error-correction are based on an initial analysis by Huse et al. in 2007, for the older GS20 system based on experimental sequences. We analyze here the quality of 454 sequencing data and identify factors playing a role in sequencing error, through the use of an extensive dataset for Roche control DNA fragments. Results: We obtained a mean error rate for 454 sequences of 1.07%. More importantly, the error rate is not randomly distributed; it occasionally rose to more than 50% in certain positions, and its distribution was linked to several experimental variables. The main factors related to error are the presence of homopolymers, position in the sequence, size of the sequence and spatial localization in PT plates for insertion and deletion errors. These factors can be described by considering seven variables. No single variable can account for the error rate distribution, but most of the variation is explained by the combination of all seven variables. Conclusions: The pattern identified here calls for the use of internal controls and error-correcting base callers, to correct for errors, when available (e. g. when sequencing amplicons). For shotgun libraries, the use of both sequencing primers and deep coverage, combined with the use of random sequencing primer sites should partly compensate for even high error rates, although it may prove more difficult than previous thought to distinguish between low-frequency alleles and errors.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] SINGLE PASS VERY HIGH RESOLUTION HLA GENOTYPING BY NEXT GENERATION SEQUENCING WITH THE 454 LIFE SCIENCES GS FLX AND GS
    Hoglund, Bryan N.
    Holcomb, Cherie L.
    Moonsamy, Priscilla V.
    Goodridge, Damian
    Erlich, Henry A.
    HUMAN IMMUNOLOGY, 2011, 72 : S136 - S136
  • [42] Assessment of Microbial Richness in Pelagic Sediment of Andaman Sea by Bacterial Tag Encoded FLX Titanium Amplicon Pyrosequencing (bTEFAP)
    Sundarakrishnan, Balakrishnan
    Pushpanathan, Muthuirulan
    Jayashree, Sathyanarayanan
    Rajendhran, Jeyaprakash
    Sakthivel, Natarajan
    Jayachandran, Seetharaman
    Gunasekaran, Paramasamy
    INDIAN JOURNAL OF MICROBIOLOGY, 2012, 52 (04) : 544 - 550
  • [43] Assessment of Microbial Richness in Pelagic Sediment of Andaman Sea by Bacterial Tag Encoded FLX Titanium Amplicon Pyrosequencing (bTEFAP)
    Balakrishnan Sundarakrishnan
    Muthuirulan Pushpanathan
    Sathyanarayanan Jayashree
    Jeyaprakash Rajendhran
    Natarajan Sakthivel
    Seetharaman Jayachandran
    Paramasamy Gunasekaran
    Indian Journal of Microbiology, 2012, 52 : 544 - 550
  • [44] Transcriptome Sequencing and De Novo Analysis for Yesso Scallop (Patinopecten yessoensis) Using 454 GS FLX
    Hou, Rui
    Bao, Zhenmin
    Wang, Shan
    Su, Hailin
    Li, Yan
    Du, Huixia
    Hu, Jingjie
    Wang, Shi
    Hu, Xiaoli
    PLOS ONE, 2011, 6 (06):
  • [45] Single pass very high resolution HLA genotyping by next generation sequencing with the 454 Life Sciences GS FLX and GS Junior
    Hoglund, B.
    Holcomb, C. L.
    Moonsamy, P. V.
    Goodridge, D.
    Erlich, H. A.
    TISSUE ANTIGENS, 2011, 77 (05): : 465 - 465
  • [46] Expressed sequence tag analysis of the emu (Dromaius novaehollandiae) pituitary by 454 GS Junior pyrosequencing
    Kim, Ji Eun
    Leung, Frederick C.
    Jiang, Jingwei
    Kwok, Amy H. Y.
    Bennett, Darin C.
    Cheng, Kimberly M.
    POULTRY SCIENCE, 2013, 92 (01) : 90 - 96
  • [47] Metagenomic analysis using long 16S amplicons and the Roche 454 GS FLX+ platform
    Ruecker, O.
    Dangel, A.
    Kotschote, S.
    INTERNATIONAL JOURNAL OF MEDICAL MICROBIOLOGY, 2013, 303 : 80 - 80
  • [48] The analysis of oral microbial communities of wild-type and toll-like receptor 2-deficient mice using a 454 GS FLX Titanium pyrosequencer
    Jongsik Chun
    Kap Y Kim
    Jae-Hak Lee
    Youngnim Choi
    BMC Microbiology, 10
  • [49] The analysis of oral microbial communities of wild-type and toll-like receptor 2-deficient mice using a 454 GS FLX Titanium pyrosequencer
    Chun, Jongsik
    Kim, Kap Y.
    Lee, Jae-Hak
    Choi, Youngnim
    BMC MICROBIOLOGY, 2010, 10 : 101
  • [50] Bacterial tag encoded FLX titanium amplicon pyrosequencing (bTEFAP) based assessment of prokaryotic diversity in metagenome of Lonar soda lake, India
    Dudhagara, Pravin
    Ghelani, Anjana
    Patel, Rajesh
    Chaudhari, Rajesh
    Bhatt, Shreyas
    GENOMICS DATA, 2015, 4 : 8 - 11