The vast quantities of short-read sequencing data being generated are often exchanged and stored as aligned reads. However, aligned data becomes outdated as new reference genomes and alignment methods become available. Here we describe Bazam, a tool that efficiently extracts the original paired FASTQ from alignment files (BAM or CRAM format) in a format that directly allows efficient realignment. Bazam facilitates up to a 90% reduction in the time for realignment compared to standard methods. Bazam can support selective extraction of read pairs from focused genomic regions for applications such as targeted region analyses, quality control, structural variant calling, and alignment comparisons.
机构:
Nord Univ, Fac Biosci & Aquaculture, Genom Grp, POB 1490, N-8049 Bodo, NorwayNord Univ, Fac Biosci & Aquaculture, Genom Grp, POB 1490, N-8049 Bodo, Norway
Mohideen, Asan M. S. H.
Johansen, Steinar D.
论文数: 0引用数: 0
h-index: 0
机构:
Nord Univ, Fac Biosci & Aquaculture, Genom Grp, POB 1490, N-8049 Bodo, NorwayNord Univ, Fac Biosci & Aquaculture, Genom Grp, POB 1490, N-8049 Bodo, Norway
Johansen, Steinar D.
Babiak, Igor
论文数: 0引用数: 0
h-index: 0
机构:
Nord Univ, Fac Biosci & Aquaculture, Genom Grp, POB 1490, N-8049 Bodo, NorwayNord Univ, Fac Biosci & Aquaculture, Genom Grp, POB 1490, N-8049 Bodo, Norway
机构:
Boston Univ, Sch Med, Div Computat Biomed, Boston, MA 02215 USA
Nationwide Childrens Hosp, Cytogenet Mol Genet Lab, Columbus, OH 43205 USABoston Univ, Sch Med, Div Computat Biomed, Boston, MA 02215 USA
Hong, Changjin
Manimaran, Solaiappan
论文数: 0引用数: 0
h-index: 0
机构:
Boston Univ, Sch Med, Div Computat Biomed, Boston, MA 02215 USABoston Univ, Sch Med, Div Computat Biomed, Boston, MA 02215 USA
Manimaran, Solaiappan
Johnson, William
论文数: 0引用数: 0
h-index: 0
机构:
Boston Univ, Sch Med, Div Computat Biomed, Boston, MA 02215 USABoston Univ, Sch Med, Div Computat Biomed, Boston, MA 02215 USA