Bazam: a rapid method for read extraction and realignment of high-throughput sequencing data

被引:0
|
作者
Simon P. Sadedin
Alicia Oshlack
机构
[1] Royal Children’s Hospital,Bioinformatics, Murdoch Children’s Research Institute
[2] Royal Children’s Hospital,Victorian Clinical Genetics Services
[3] University of Melbourne,Department of BioScience
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The vast quantities of short-read sequencing data being generated are often exchanged and stored as aligned reads. However, aligned data becomes outdated as new reference genomes and alignment methods become available. Here we describe Bazam, a tool that efficiently extracts the original paired FASTQ from alignment files (BAM or CRAM format) in a format that directly allows efficient realignment. Bazam facilitates up to a 90% reduction in the time for realignment compared to standard methods. Bazam can support selective extraction of read pairs from focused genomic regions for applications such as targeted region analyses, quality control, structural variant calling, and alignment comparisons.
引用
收藏
相关论文
共 50 条
  • [31] Sample Preservation, DNA or RNA Extraction and Data Analysis for High-Throughput Phytoplankton Community Sequencing
    Maki, Anita
    Salmi, Pauliina
    Mikkonen, Anu
    Kremp, Anke
    Tiirola, Marja
    FRONTIERS IN MICROBIOLOGY, 2017, 8
  • [32] PLNseq: a multivariate Poisson lognormal distribution for high-throughput matched RNA-sequencing read count data
    Zhang, Hong
    Xu, Jinfeng
    Jiang, Ning
    Hu, Xiaohua
    Luo, Zewei
    STATISTICS IN MEDICINE, 2015, 34 (09) : 1577 - 1589
  • [33] Experimental Design-Based Functional Mining and Characterization of High-Throughput Sequencing Data in the Sequence Read Archive
    Nakazato, Takeru
    Ohta, Tazro
    Bono, Hidemasa
    PLOS ONE, 2013, 8 (10):
  • [34] Read Mapping and Transcript Assembly: A Scalable and High-Throughput Workflow for the Processing and Analysis of Ribonucleic Acid Sequencing Data
    Peri, Sateesh
    Roberts, Sarah
    Kreko, Isabella R.
    McHan, Lauren B.
    Naron, Alexandra
    Ram, Archana
    Murphy, Rebecca L.
    Lyons, Eric
    Gregory, Brian D.
    Devisetty, Upendra K.
    Nelson, Andrew D. L.
    FRONTIERS IN GENETICS, 2020, 10
  • [35] S-leaping: an efficient downsampling method for large high-throughput sequencing data
    Kuwahara, Hiroyuki
    Gao, Xin
    BIOINFORMATICS, 2023, 39 (07)
  • [36] High-Throughput Sequencing Technologies
    Reuter, Jason A.
    Spacek, Damek V.
    Snyder, Michael P.
    MOLECULAR CELL, 2015, 58 (04) : 586 - 597
  • [37] High-Throughput Sequencing and Metagenomics
    Jones, William J.
    ESTUARIES AND COASTS, 2010, 33 (04) : 944 - 952
  • [38] High-throughput protein sequencing
    Pham, V
    Tropea, J
    Wong, S
    Quach, J
    Henzel, WJ
    ANALYTICAL CHEMISTRY, 2003, 75 (04) : 875 - 882
  • [39] High-Throughput Sequencing and Metagenomics
    William J. Jones
    Estuaries and Coasts, 2010, 33 : 944 - 952
  • [40] High-throughput DNA extraction method suitable for PCR
    Xin, ZG
    Velten, JP
    Oliver, MJ
    Burke, JJ
    BIOTECHNIQUES, 2003, 34 (04) : 820 - +