Automated sequence preprocessing in a large-scale sequencing environment

被引:29
|
作者
Wendl, MC [1 ]
Dear, S
Hodgson, D
Hillier, L
机构
[1] Washington Univ, Genome Sequencing Ctr, St Louis, MO 63108 USA
[2] Sanger Ctr, Cambridge CB10 1SA, England
来源
GENOME RESEARCH | 1998年 / 8卷 / 09期
基金
英国惠康基金;
关键词
D O I
10.1101/gr.8.9.975
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A software system for transforming fragments from four-color Fluorescence-based gel electrophoresis experiments into assembled sequence is described. It has been developed for large-scale processing of all trace data, including shotgun and finishing reads, regardless of clone origin. Design considerations are discussed in detail, as are programming implementation and graphic tools. The importance of input validation, record tracking, and use of base quality values is emphasized. Several quality analysis metrics are proposed and applied to sample results from recently sequenced clones. Such quantities prove to be a valuable aid in evaluating modifications of sequencing protocol. The system is in full production use at both the Genome Sequencing Center and the Sanger Centre, for which combined weekly production is similar to 100,000 sequencing reads per week.
引用
收藏
页码:975 / 984
页数:10
相关论文
共 50 条
  • [31] Learning in a large-scale pervasive environment
    Barbosa, BNF
    Yamim, AC
    Augustin, I
    da Silva, LC
    Geyer, CFR
    Barbosa, JLV
    FOURTH ANNUAL IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS, PROCEEDINGS, 2006, : 226 - +
  • [32] PRODUCTION PLANNING IN A LARGE-SCALE ENVIRONMENT
    ASHFORD, HM
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1985, 36 (12) : 1157 - 1157
  • [33] Large-scale microfabricated channel plates for high-throughput, fully automated DNA sequencing
    Kumagai, Hidesato
    Utsunomiya, Shinichi
    Nakamura, Shin
    Yamamoto, Rintaro
    Harada, Akira
    Kaji, Toru
    Hazama, Makoto
    Ohashi, Tetsuo
    Inami, Atsushi
    Ikegami, Takashi
    Miyamoto, Keisuke
    Endo, Naoya
    Yoshimi, Kenichi
    Toyoda, Atsushi
    Hattori, Masahira
    Sakaki, Yoshiyuki
    ELECTROPHORESIS, 2008, 29 (23) : 4723 - 4732
  • [34] Large-scale sequencing of the human genome.
    Barbazuk, W
    Hillier, L
    Marra, MA
    McPherson, JD
    Wilson, RK
    Waterston, RH
    AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (04) : A27 - A27
  • [35] Large-scale complementary DNA sequencing methods
    Fulton, LL
    Hillier, L
    Wilson, RK
    METHODS IN CELL BIOLOGY, VOL 48, 1995, 48 : 571 - 582
  • [36] BRITAIN PLANS LARGE-SCALE SEQUENCING CENTER
    ALDHOUS, P
    SCIENCE, 1992, 256 (5059) : 958 - 958
  • [37] Large-scale sequencing and the new animal phylogeny
    Philippe, Herve
    Telford, Maximilian J.
    TRENDS IN ECOLOGY & EVOLUTION, 2006, 21 (11) : 614 - 620
  • [38] Interplay of Genes, Environment, and Tissue Physiology Controls Susceptibility to Large-Scale Sequence Rearrangements
    Kiraly, O.
    Roytman, M.
    Engelward, B.
    ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2012, 53 : S38 - S38
  • [39] Large-Scale Automated Refactoring Using ClangMR
    Wright, Hyrum K.
    Jasper, Daniel
    Klimek, Manuel
    Carruth, Chandler
    Wan, Zhanyong
    2013 29TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2013, : 548 - 551
  • [40] LARGE-SCALE SCREENING BY AUTOMATED WASSERMANN REACTION
    WAGSTAFF, W
    FIRTH, R
    BOOTH, JR
    BOWLEY, CC
    JOURNAL OF CLINICAL PATHOLOGY, 1969, 22 (02) : 236 - &