Limitations of next-generation genome sequence assembly

被引:0
|
作者
Alkan C. [1 ]
Sajjadian S. [1 ]
Eichler E.E. [1 ]
机构
[1] Department of Genome Sciences, University of Washington School of Medicine, Howard Hughes Medical Institute, Seattle, WA
基金
美国国家卫生研究院;
关键词
D O I
10.1038/nmeth.1527
中图分类号
学科分类号
摘要
High-throughput sequencing technologies promise to transform the fields of genetics and comparative biology by delivering tens of thousands of genomes in the near future. Although it is feasible to construct de novo genome assemblies in a few months, there has been relatively little attention to what is lost by sole application of short sequence reads. We compared the recent de novo assemblies using the short oligonucleotide analysis package (SOAP), generated from the genomes of a Han Chinese individual and a Yoruban individual, to experimentally validated genomic features. We found that de novo assemblies were 16.2% shorter than the reference genome and that 420.2 megabase pairs of common repeats and 99.1% of validated duplicated sequences were missing from the genome. Consequently, over 2,377 coding exons were completely missing. We conclude that high-quality sequencing approaches must be considered in conjunction with high-throughput sequencing for comparative genomics analyses and studies of genome evolution. © 2011 Nature America, Inc. All rights reserved.
引用
收藏
页码:61 / 65
页数:4
相关论文
共 50 条
  • [21] Next-Generation Sequence Assembly: Four Stages of Data Processing and Computational Challenges
    El-Metwally, Sara
    Hamza, Taher
    Zakaria, Magdi
    Helmy, Mohamed
    PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (12)
  • [22] Next-generation DNA assembly tools
    Peng, Lansha
    Tsvetanova, Billyana
    Liang, Xiquan
    Katzen, Federico
    Genetic Engineering and Biotechnology News, 2010, 30 (18):
  • [23] Complete Genome of a Novel Endornavirus Assembled from Next-Generation Sequence Data
    Espach, Yolandi
    Maree, Hans J.
    Burger, Johan T.
    JOURNAL OF VIROLOGY, 2012, 86 (23) : 13142 - 13142
  • [24] Clinical Applications and Limitations of Next-Generation Sequencing
    Basho, Reva Kakkar
    Eterovic, Agda Karina
    Meric-Bernstam, Funda
    AMERICAN JOURNAL OF HEMATOLOGY-ONCOLOGY, 2015, 11 (03) : 17 - 22
  • [25] GenomeView: a next-generation genome browser
    Abeel, Thomas
    Van Parys, Thomas
    Saeys, Yvan
    Galagan, James
    Van de Peer, Yves
    NUCLEIC ACIDS RESEARCH, 2012, 40 (02)
  • [26] JBrowse: A next-generation genome browser
    Skinner, Mitchell E.
    Uzilov, Andrew V.
    Stein, Lincoln D.
    Mungall, Christopher J.
    Holmes, Ian H.
    GENOME RESEARCH, 2009, 19 (09) : 1630 - 1638
  • [27] Validation of Variant Assembly Using HAPHPIPE with Next-Generation Sequence Data from Viruses
    Gibson, Keylie M.
    Steiner, Margaret C.
    Rentia, Uzma
    Bendall, Matthew L.
    Perez-Losada, Marcos
    Crandall, Keith A.
    VIRUSES-BASEL, 2020, 12 (07):
  • [28] Sequence capture and next-generation sequencing of ultraconserved elements in a large-genome salamander
    Newman, Catherine E.
    Austin, Christopher C.
    MOLECULAR ECOLOGY, 2016, 25 (24) : 6162 - 6174
  • [29] Pittosporum cryptic virus 1: genome sequence completion using next-generation sequencing
    Elbeaino, Toufic
    Abou Kubaa, Raied
    Tuzlali, Hasan Tuna
    Digiaro, Michele
    ARCHIVES OF VIROLOGY, 2016, 161 (07) : 2039 - 2042
  • [30] Pittosporum cryptic virus 1: genome sequence completion using next-generation sequencing
    Toufic Elbeaino
    Raied Abou Kubaa
    Hasan Tuna Tuzlali
    Michele Digiaro
    Archives of Virology, 2016, 161 : 2039 - 2042