SOAP3-dp: Fast, Accurate and Sensitive GPU-Based Short Read Aligner

被引:81
|
作者
Luo, Ruibang [1 ]
Wong, Thomas [1 ]
Zhu, Jianqiao [1 ,5 ]
Liu, Chi-Man [1 ]
Zhu, Xiaoqian [2 ]
Wu, Edward [1 ]
Lee, Lap-Kei [1 ]
Lin, Haoxiang [3 ]
Zhu, Wenjuan [3 ]
Cheung, David W. [1 ]
Ting, Hing-Fung [1 ]
Yiu, Siu-Ming [1 ]
Peng, Shaoliang [2 ]
Yu, Chang [3 ]
Li, Yingrui [3 ]
Li, Ruiqiang [4 ]
Lam, Tak-Wah [1 ]
机构
[1] Univ Hong Kong, HKU BGI Bioinformat Algorithms & Core Technol Res, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[2] Natl Univ Def Technol, Sch Comp Sci, Changsha, Hunan, Peoples R China
[3] BGI Shenzhen, Shenzhen, Guangdong, Peoples R China
[4] Peking Univ, Peking Tsinghua Ctr Life Sci, Biodynam Opt Imaging Ctr, Sch Life Sci, Beijing 100871, Peoples R China
[5] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
来源
PLOS ONE | 2013年 / 8卷 / 05期
关键词
ALIGNMENT; SEQUENCE; FRAMEWORK; EFFICIENT; ULTRAFAST; TOOL;
D O I
10.1371/journal.pone.0065632
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, Bowtie2, SeqAlto, CUSHAW2, GEM and GPU-based aligners BarraCUDA and CUSHAW, SOAP3-dp was found to be two to tens of times faster, while maintaining the highest sensitivity and lowest false discovery rate (FDR) on Illumina reads with different lengths. Transcending its predecessor SOAP3, which does not allow gapped alignment, SOAP3-dp by default tolerates alignment similarity as low as 60%. Real data evaluation using human genome demonstrates SOAP3-dp's power to enable more authentic variants and longer Indels to be discovered. Fosmid sequencing shows a 9.1% FDR on newly discovered deletions. SOAP3-dp natively supports BAM file format and provides the same scoring scheme as BWA, which enables it to be integrated into existing analysis pipelines. SOAP3-dp has been deployed on Amazon-EC2, NIH-Biowulf and Tianhe-1A.
引用
收藏
页数:11
相关论文
共 30 条
  • [21] Simultaneous and fast 3D tracking of multiple faces in video by GPU-based stream processing
    Lozano, Oscar Mateo
    Otsuka, Kazuhiro
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 713 - 716
  • [22] GPU-based fast projection-backprojection algorithm for 3-D PET image reconstruction
    Ahn, Il Jun
    Jeong, Kye Young
    Nam, Woo Hyun
    Kim, Ji Hye
    Ra, Jong Beam
    2011 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2011, : 2672 - 2674
  • [23] Accurate and Efficient CPU/GPU-Based 3-DOF Haptic Rendering of Complex Static Virtual Environments
    Kim, Jong-Phil
    Lee, Beom-Chan
    Kim, Hyungon
    Kim, Jaeha
    Ryu, Jeha
    PRESENCE-TELEOPERATORS AND VIRTUAL ENVIRONMENTS, 2009, 18 (05) : 340 - 360
  • [24] GPU-based Fast 3D Ultrasound-Endoscope Image Fusion for Complex-Shaped Objects
    Liao, Hongen
    Tsuzuki, Masayoshi
    Kobayashi, Etsuko
    Sakuma, Ichiro
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 5, 2009, 25 : 206 - +
  • [25] Fast Planar Detection System Using a GPU-Based 3D Hough Transform for LiDAR Point Clouds
    Tian, Yifei
    Song, Wei
    Chen, Long
    Sung, Yunsick
    Kwak, Jeonghoon
    Sun, Su
    APPLIED SCIENCES-BASEL, 2020, 10 (05):
  • [26] SInC: an accurate and fast error-model based simulator for SNPs, Indels and CNVs coupled with a read generator for short-read sequence data
    Swetansu Pattnaik
    Saurabh Gupta
    Arjun A Rao
    Binay Panda
    BMC Bioinformatics, 15
  • [27] SInC: an accurate and fast error-model based simulator for SNPs, Indels and CNVs coupled with a read generator for short-read sequence data
    Pattnaik, Swetansu
    Gupta, Saurabh
    Rao, Arjun A.
    Panda, Binay
    BMC BIOINFORMATICS, 2014, 15
  • [28] Fast 2D-3D point-based registration using GPU-based preprocessing for image-guided surgery
    Hong, Helen
    Kim, Kyehyun
    Park, Seongjin
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2006, 4225 : 218 - 226
  • [29] CUSHAW3: Sensitive and Accurate Base-Space and Color-Space Short-Read Alignment with Hybrid Seeding
    Liu, Yongchao
    Popp, Bernt
    Schmidt, Bertil
    PLOS ONE, 2014, 9 (01):
  • [30] Fast perspective volume ray casting method using GPU-based acceleration techniques for translucency rendering in 3D endoluminal CT colonography
    Lee, Taek-Hee
    Lee, Jeongjin
    Lee, Ho
    Kye, Heewon
    Shin, Yeong Gil
    Kim, Soo Hong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2009, 39 (08) : 657 - 666