Limitations of next-generation genome sequence assembly

被引:0
|
作者
Alkan C. [1 ]
Sajjadian S. [1 ]
Eichler E.E. [1 ]
机构
[1] Department of Genome Sciences, University of Washington School of Medicine, Howard Hughes Medical Institute, Seattle, WA
基金
美国国家卫生研究院;
关键词
D O I
10.1038/nmeth.1527
中图分类号
学科分类号
摘要
High-throughput sequencing technologies promise to transform the fields of genetics and comparative biology by delivering tens of thousands of genomes in the near future. Although it is feasible to construct de novo genome assemblies in a few months, there has been relatively little attention to what is lost by sole application of short sequence reads. We compared the recent de novo assemblies using the short oligonucleotide analysis package (SOAP), generated from the genomes of a Han Chinese individual and a Yoruban individual, to experimentally validated genomic features. We found that de novo assemblies were 16.2% shorter than the reference genome and that 420.2 megabase pairs of common repeats and 99.1% of validated duplicated sequences were missing from the genome. Consequently, over 2,377 coding exons were completely missing. We conclude that high-quality sequencing approaches must be considered in conjunction with high-throughput sequencing for comparative genomics analyses and studies of genome evolution. © 2011 Nature America, Inc. All rights reserved.
引用
收藏
页码:61 / 65
页数:4
相关论文
共 50 条
  • [1] Limitations of next-generation genome sequence assembly
    Alkan, Can
    Sajjadian, Saba
    Eichler, Evan E.
    NATURE METHODS, 2011, 8 (01) : 61 - 65
  • [2] A next-generation human genome sequence
    Church, Deanna M.
    SCIENCE, 2022, 376 (6588) : 34 - 35
  • [3] A GENOME ASSEMBLY PLATFORM FOR NEXT-GENERATION SEQUENCING TECHNOLOGY
    Lu Wenwen
    Lu Zhiyuan
    Wang Yaxu
    Sun Xiao
    IFPT'6: PROGRESS ON POST-GENOME TECHNOLOGIES, PROCEEDINGS, 2009, : 166 - 167
  • [4] The Genome Assembly Model for Next-Generation Sequencing Data
    Wang, Yirong
    Wei, Chengdong
    Zhang, Xiaodong
    Cen, Tailin
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELLING AND STATISTICS APPLICATION (AMMSA 2017), 2017, 141 : 97 - 101
  • [5] Next-generation genome
    Nature Methods, 2008, 5 : 989 - 989
  • [6] Next-generation genome
    不详
    NATURE METHODS, 2008, 5 (12) : 989 - 989
  • [7] EagleView: A genome assembly viewer for next-generation sequencing technologies
    Huang, Weichun
    Marth, Gabor
    GENOME RESEARCH, 2008, 18 (09) : 1538 - 1543
  • [8] NEXT-GENERATION DNA SEQUENCING FOR DE NOVO GENOME ASSEMBLY
    Hiatt, J.
    Turner, E.
    Patwardhan, R.
    Lee, C.
    Shendure, J.
    JOURNAL OF INVESTIGATIVE MEDICINE, 2009, 57 (01) : 114 - 114
  • [9] PASQUAL: Parallel Techniques for Next Generation Genome Sequence Assembly
    Liu, Xing
    Pande, Pushkar R.
    Meyerhenke, Henning
    Bader, David A.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (05) : 977 - 986
  • [10] Next-generation transcriptome assembly
    Martin, Jeffrey A.
    Wang, Zhong
    NATURE REVIEWS GENETICS, 2011, 12 (10) : 671 - 682