Bazam: a rapid method for read extraction and realignment of high-throughput sequencing data

被引：0

作者：

Simon P. Sadedin

Alicia Oshlack

机构：

[1] Royal Children’s Hospital,Bioinformatics, Murdoch Children’s Research Institute

[2] Royal Children’s Hospital,Victorian Clinical Genetics Services

[3] University of Melbourne,Department of BioScience

来源：

Genome Biology | / 20卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The vast quantities of short-read sequencing data being generated are often exchanged and stored as aligned reads. However, aligned data becomes outdated as new reference genomes and alignment methods become available. Here we describe Bazam, a tool that efficiently extracts the original paired FASTQ from alignment files (BAM or CRAM format) in a format that directly allows efficient realignment. Bazam facilitates up to a 90% reduction in the time for realignment compared to standard methods. Bazam can support selective extraction of read pairs from focused genomic regions for applications such as targeted region analyses, quality control, structural variant calling, and alignment comparisons.

引用

共 50 条

[1] Bazam: a rapid method for read extraction and realignment of high-throughput sequencing data
Sadedin, Simon P.
Oshlack, Alicia
GENOME BIOLOGY, 2019, 20 (1)
[2] High-Throughput Identification of Adapters in Single-Read Sequencing Data
Mohideen, Asan M. S. H.
Johansen, Steinar D.
Babiak, Igor
BIOMOLECULES, 2020, 10 (06) : 1 - 12
[3] DisCVR: Rapid viral diagnosis from high-throughput sequencing data
Maabar, Maha
Davison, Andrew J.
Vucak, Matej
Thorburn, Fiona
Murcia, Pablo R.
Gunson, Rory
Palmarini, Massimo
Hughes, Joseph
VIRUS EVOLUTION, 2019, 5 (02)
[4] SPEEDING UP THE ANALYSIS OF READ-COUNT DATA FROM HIGH-THROUGHPUT SEQUENCING
Wang, Weibo
Sun, Wei
Wang, Wei
Szatkiewicz, Jin
EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2017, 27 : S225 - S225
[5] An efficient population genetic analysis method for high-throughput sequencing data
Li, Jie
Qian, Jiating
Ding, Xi
Ling, Yayue
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 48 - 49
[6] PathoQC: Computationally Efficient Read Preprocessing and Quality Control for High-Throughput Sequencing Data Sets
Hong, Changjin
Manimaran, Solaiappan
Johnson, William
CANCER INFORMATICS, 2014, 13 : 167 - 176
[7] Accelerating Error Correction in High-Throughput Short-Read DNA Sequencing Data with CUDA
Shi, Haixiang
Schmidt, Bertil
Liu, Weiguo
Mueller-Wittig, Wolfgang
2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 1546 - 1553
[8] Tools for mapping high-throughput sequencing data
Fonseca, Nuno A.
Rung, Johan
Brazma, Alvis
Marioni, John C.
BIOINFORMATICS, 2012, 28 (24) : 3169 - 3177
[9] Genome reassembly with high-throughput sequencing data
Parrish, Nathaniel
Sudakov, Benjamin
Eskin, Eleazar
BMC GENOMICS, 2013, 14
[10] Genome reassembly with high-throughput sequencing data
Nathaniel Parrish
Benjamin Sudakov
Eleazar Eskin
BMC Genomics, 14

← 1 2 3 4 5 →