The vast quantities of short-read sequencing data being generated are often exchanged and stored as aligned reads. However, aligned data becomes outdated as new reference genomes and alignment methods become available. Here we describe Bazam, a tool that efficiently extracts the original paired FASTQ from alignment files (BAM or CRAM format) in a format that directly allows efficient realignment. Bazam facilitates up to a 90% reduction in the time for realignment compared to standard methods. Bazam can support selective extraction of read pairs from focused genomic regions for applications such as targeted region analyses, quality control, structural variant calling, and alignment comparisons.
机构:
Fudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R ChinaFudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R China
Zhang, Hong
Xu, Jinfeng
论文数: 0引用数: 0
h-index: 0
机构:
NYU, Sch Med, Dept Populat Hlth, Div Biostat, New York, NY 10003 USAFudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R China
Xu, Jinfeng
Jiang, Ning
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R ChinaFudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R China
Jiang, Ning
Hu, Xiaohua
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R ChinaFudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R China
Hu, Xiaohua
Luo, Zewei
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R ChinaFudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R China
机构:
King Abdullah Univ Sci & Technol KAUST, Computat Biosci Res Ctr CBRC, Comp Elect & Math Sci & Engn Div CEMSE, Thuwal 239556900, Saudi ArabiaKing Abdullah Univ Sci & Technol KAUST, Computat Biosci Res Ctr CBRC, Comp Elect & Math Sci & Engn Div CEMSE, Thuwal 239556900, Saudi Arabia
Kuwahara, Hiroyuki
Gao, Xin
论文数: 0引用数: 0
h-index: 0
机构:
King Abdullah Univ Sci & Technol KAUST, Computat Biosci Res Ctr CBRC, Comp Elect & Math Sci & Engn Div CEMSE, Thuwal 239556900, Saudi ArabiaKing Abdullah Univ Sci & Technol KAUST, Computat Biosci Res Ctr CBRC, Comp Elect & Math Sci & Engn Div CEMSE, Thuwal 239556900, Saudi Arabia