GPU-accelerated and pipelined methylation calling

被引:1
|
作者
Feng, Yilin [1 ]
Akbulut, Gulsum Gudukbay [1 ]
Tang, Xulong [2 ]
Gunasekaran, Jashwant Raj [3 ]
Rahman, Amatur [1 ]
Medvedev, Paul [1 ,4 ,5 ]
Kandemir, Mahmut [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[2] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
[3] Adobe, Adobe Res, San Jose, CA 95110 USA
[4] Penn State Univ, Dept Biochem & Mol Biol, University Pk, PA 16802 USA
[5] Penn State Univ, HuckInstitutes Life Sci, University Pk, PA 16802 USA
来源
BIOINFORMATICS ADVANCES | 2022年 / 2卷 / 01期
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioadv/vbac088
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivation The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. Nanopolish is a software package for signal-level analysis of Oxford Nanopore sequencing data. Call-methylation module of Nanopolish can detect methylation based on Hidden Markov Model (HMM). However, Nanopolish is limited by the long running time of some serial and computationally expensive processes. Among these, Adaptive Banded Event Alignment (ABEA) is the most time-consuming step, and the prior work, f5c, has already parallelized and optimized ABEA on GPU. As a result, the remaining methylation score calculation part, which uses HMM to identify if a given base is methylated or not, has become the new performance bottleneck.Results This article focuses on the call-methylation module that resides in the Nanopolish package. We propose Galaxy-methyl, which parallelizes and optimizes the methylation score calculation step on GPU and then pipelines the four steps of the call-methylation module. Galaxy-methyl increases the execution concurrency across CPUs and GPUs as well as hardware resource utilization for both. The experimental results collected indicate that Galaxy-methyl can achieve 3x-5x speedup compared with Nanopolish, and reduce the total execution time by 35% compared with f5c, on average.Availability and implementation The source code of Galaxy-methyl is available at https://github.com/fengyilin118/.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] GPU-accelerated connectome discovery at scale
    Varsha Sreenivasan
    Sawan Kumar
    Franco Pestilli
    Partha Talukdar
    Devarajan Sridharan
    Nature Computational Science, 2022, 2 : 298 - 306
  • [42] GPU-Accelerated BFS for Dynamic Networks
    Ziche, Filippo
    Bombieri, Nicola
    Busato, Federico
    Giugno, Rosalba
    EURO-PAR 2024: PARALLEL PROCESSING, PT III, EURO-PAR 2024, 2024, 14803 : 74 - 87
  • [43] GaccO - A GPU-accelerated OLTP DBMS
    Boeschen, Nils
    Binnig, Carsten
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1003 - 1016
  • [44] GPU-Accelerated Coupled Ptychographic Tomography
    Achilles, Silvio
    Ehrig, Simeon
    Hoffmann, Nico
    Kahnt, Maik
    Becher, Johannes
    Fam, Yakub
    Sheppard, Thomas
    Brueckner, Dennis
    Schropp, Andreas
    Schroer, Christian G.
    DEVELOPMENTS IN X-RAY TOMOGRAPHY XIV, 2022, 12242
  • [45] GPUNFV: a GPU-Accelerated NFV System
    Yi, Xiaodong
    Duan, Jingpu
    Wu, Chuan
    PROCEEDINGS OF THE 2017 ASIA-PACIFIC WORKSHOP ON NETWORKING (APNET '17), 2017, : 85 - 91
  • [46] GPU-Accelerated LOD Generation for Point Clouds
    Schuetz, Markus
    Kerbl, Bernhard
    Klaus, Philip
    Wimmer, Michael
    COMPUTER GRAPHICS FORUM, 2023, 42 (08)
  • [47] GPU-accelerated level-set segmentation
    Julián Lamas-Rodríguez
    Dora B. Heras
    Francisco Argüello
    Dagmar Kainmueller
    Stefan Zachow
    Montserrat Bóo
    Journal of Real-Time Image Processing, 2016, 12 : 15 - 29
  • [48] GPU-accelerated string matching for database applications
    Evangelia A. Sitaridi
    Kenneth A. Ross
    The VLDB Journal, 2016, 25 : 719 - 740
  • [49] A Performance Model for GPU-Accelerated FDTD Applications
    Baumeister, Paul F.
    Hater, Thorsten
    Kraus, Jiri
    Pleiter, Dirk
    Wahl, Pierre
    2015 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2015, : 185 - 193
  • [50] GPApriori: GPU-Accelerated Frequent Itemset Mining
    Zhang, Fan
    Zhang, Yan
    Bakos, Jason
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 590 - 594