SOAP3-dp: Fast, Accurate and Sensitive GPU-Based Short Read Aligner

被引:81
|
作者
Luo, Ruibang [1 ]
Wong, Thomas [1 ]
Zhu, Jianqiao [1 ,5 ]
Liu, Chi-Man [1 ]
Zhu, Xiaoqian [2 ]
Wu, Edward [1 ]
Lee, Lap-Kei [1 ]
Lin, Haoxiang [3 ]
Zhu, Wenjuan [3 ]
Cheung, David W. [1 ]
Ting, Hing-Fung [1 ]
Yiu, Siu-Ming [1 ]
Peng, Shaoliang [2 ]
Yu, Chang [3 ]
Li, Yingrui [3 ]
Li, Ruiqiang [4 ]
Lam, Tak-Wah [1 ]
机构
[1] Univ Hong Kong, HKU BGI Bioinformat Algorithms & Core Technol Res, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[2] Natl Univ Def Technol, Sch Comp Sci, Changsha, Hunan, Peoples R China
[3] BGI Shenzhen, Shenzhen, Guangdong, Peoples R China
[4] Peking Univ, Peking Tsinghua Ctr Life Sci, Biodynam Opt Imaging Ctr, Sch Life Sci, Beijing 100871, Peoples R China
[5] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
来源
PLOS ONE | 2013年 / 8卷 / 05期
关键词
ALIGNMENT; SEQUENCE; FRAMEWORK; EFFICIENT; ULTRAFAST; TOOL;
D O I
10.1371/journal.pone.0065632
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, Bowtie2, SeqAlto, CUSHAW2, GEM and GPU-based aligners BarraCUDA and CUSHAW, SOAP3-dp was found to be two to tens of times faster, while maintaining the highest sensitivity and lowest false discovery rate (FDR) on Illumina reads with different lengths. Transcending its predecessor SOAP3, which does not allow gapped alignment, SOAP3-dp by default tolerates alignment similarity as low as 60%. Real data evaluation using human genome demonstrates SOAP3-dp's power to enable more authentic variants and longer Indels to be discovered. Fosmid sequencing shows a 9.1% FDR on newly discovered deletions. SOAP3-dp natively supports BAM file format and provides the same scoring scheme as BWA, which enables it to be integrated into existing analysis pipelines. SOAP3-dp has been deployed on Amazon-EC2, NIH-Biowulf and Tianhe-1A.
引用
收藏
页数:11
相关论文
共 30 条
  • [11] A Novel GPU-Based Fast Monte Carlo Photon Dose Calculating Method for Accurate Radiotherapy Treatment Planning
    Tavakoli, M.
    Karbalaee, M.
    Shahbazi, D.
    MEDICAL PHYSICS, 2017, 44 (06) : 2910 - 2910
  • [12] A Fast and Accurate GPU-Based Proton Transport Monte Carlo Simulation for Validating Proton Therapy Treatment Plans
    Tseung, H. Wan Chan
    Ma, J.
    Beltran, C.
    MEDICAL PHYSICS, 2014, 41 (06) : 410 - 410
  • [13] GPU-based 3D iceball modeling for fast cryoablation simulation and planning
    Golkar, Ehsan
    Rao, Pramod P.
    Joskowicz, Leo
    Gangi, Afshin
    Essert, Caroline
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (09) : 1577 - 1588
  • [14] GPU-based 3D iceball modeling for fast cryoablation simulation and planning
    Ehsan Golkar
    Pramod P. Rao
    Leo Joskowicz
    Afshin Gangi
    Caroline Essert
    International Journal of Computer Assisted Radiology and Surgery, 2019, 14 : 1577 - 1588
  • [15] A Fast Runtime Visualization of a GPU-Based 3D-FDTD Electromagnetic Simulation
    Aoki, Kota
    Dohi, Keisuke
    Shibata, Yuichiro
    Oguri, Kiyoshi
    Fujimoto, Takafumi
    2013 FIRST INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2013, : 30 - 37
  • [16] Fast 2D-3D registration using GPU-based preprocessing
    Kim, K
    Park, S
    Hong, H
    Shin, YG
    Healthcom 2005: 7th International Workshop on Enterprise Networking and Computing in Healthcare Industry, Proceedings, 2005, : 139 - 143
  • [17] A fast and accurate approach for bankruptcy forecasting using squared logistics loss with GPU-based extreme gradient boosting
    Tuong Le
    Bay Vo
    Fujita, Hamido
    Ngoc-Thanh Nguyen
    Baik, Sung Wook
    INFORMATION SCIENCES, 2019, 494 : 294 - 310
  • [18] GPU-based Online Track Reconstruction for the ALICE TPC in Run 3 with Continuous Read-Out
    Rohr, David
    Gorbunov, Sergey
    Marten, Schmidt Ole
    Shahoyan, Ruben
    23RD INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2018), 2019, 214
  • [19] A Fast and Accurate FPGA System for Short Read Mapping Based on Parallel Comparison on Hash Table
    Sogabe, Yoko
    Maruyama, Tsutomu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (05): : 1016 - 1025
  • [20] GPU-based fast rendering of large-scale forest scenes for 3D games
    Wang, Huadong
    Xing, Linfen
    Journal of Computational Information Systems, 2015, 11 (02): : 743 - 749