Automated sequence preprocessing in a large-scale sequencing environment

被引:29
|
作者
Wendl, MC [1 ]
Dear, S
Hodgson, D
Hillier, L
机构
[1] Washington Univ, Genome Sequencing Ctr, St Louis, MO 63108 USA
[2] Sanger Ctr, Cambridge CB10 1SA, England
来源
GENOME RESEARCH | 1998年 / 8卷 / 09期
基金
英国惠康基金;
关键词
D O I
10.1101/gr.8.9.975
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A software system for transforming fragments from four-color Fluorescence-based gel electrophoresis experiments into assembled sequence is described. It has been developed for large-scale processing of all trace data, including shotgun and finishing reads, regardless of clone origin. Design considerations are discussed in detail, as are programming implementation and graphic tools. The importance of input validation, record tracking, and use of base quality values is emphasized. Several quality analysis metrics are proposed and applied to sample results from recently sequenced clones. Such quantities prove to be a valuable aid in evaluating modifications of sequencing protocol. The system is in full production use at both the Genome Sequencing Center and the Sanger Centre, for which combined weekly production is similar to 100,000 sequencing reads per week.
引用
收藏
页码:975 / 984
页数:10
相关论文
共 50 条
  • [1] ASAP: An environment for automated preprocessing of sequencing data
    Torstenson E.S.
    Li B.
    Li C.
    BMC Research Notes, 6 (1)
  • [2] Deciphering genomes through automated large-scale sequencing
    Rowen, L
    Lasky, S
    Hood, L
    METHODS IN MICROBIOLOGY, VOL 28: AUTOMATION: GENOMIC AND FUNCTIONAL ANALYSES, 1999, 28 : 155 - 191
  • [3] LARGE-SCALE AND AUTOMATED DNA-SEQUENCE DETERMINATION
    HUNKAPILLER, T
    KAISER, RJ
    KOOP, BF
    HOOD, L
    SCIENCE, 1991, 254 (5028) : 59 - 67
  • [4] QuaSR: A large-scale automated, distributed testing environment
    Grady, S
    Madhusudan, GS
    Sugiyama, M
    PROCEEDINGS OF THE FOURTH ANNUAL TCL/TK WORKSHOP, 1996, : 61 - 68
  • [5] Automated service provisioning in heterogeneous large-scale environment
    Khalil, A
    Braun, T
    NOMS 2002: IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM: MANAGEMENT SOLUTIONS FOR THE NEW COMMUNICATIONS WORLD, 2002, : 575 - 588
  • [6] EFFICIENT AUTOMATED LARGE-SCALE SEQUENCING OF UNPURIFIED PCR PRODUCT
    TSUI, SKW
    WAYE, MMY
    LEE, CY
    BIOTECHNIQUES, 1995, 19 (04) : 577 - 578
  • [7] An automated sample preparation system for large-scale DNA sequencing
    Marziali, A
    Willis, TD
    Federspiel, NA
    Davis, RW
    GENOME RESEARCH, 1999, 9 (05) : 457 - 462
  • [8] LARGE-SCALE DNA SEQUENCING
    HUNKAPILLER, T
    KAISER, RJ
    KOOP, BF
    HOOD, L
    CURRENT OPINION IN BIOTECHNOLOGY, 1991, 2 (01) : 92 - 101
  • [9] LARGE-SCALE DNA SEQUENCING
    MIDDENDORF, LR
    BRUMBAUGH, JA
    GRONE, DL
    MORGAN, CA
    RUTH, JL
    AMERICAN BIOTECHNOLOGY LABORATORY, 1988, 6 (06): : 14 - 22
  • [10] DNA-SEQUENCE DETERMINATION BY HYBRIDIZATION - A STRATEGY FOR EFFICIENT LARGE-SCALE SEQUENCING
    DRMANAC, R
    DRMANAC, S
    STREZOSKA, Z
    PAUNESKU, T
    LABAT, I
    ZEREMSKI, M
    SNODDY, J
    FUNKHOUSER, WK
    KOOP, B
    HOOD, L
    CRKVENJAKOV, R
    SCIENCE, 1993, 260 (5114) : 1649 - 1653