GPU-accelerated and pipelined methylation calling

被引：1

作者：

Feng, Yilin ^{[1
]}

Akbulut, Gulsum Gudukbay ^{[1
]}

Tang, Xulong ^{[2
]}

Gunasekaran, Jashwant Raj ^{[3
]}

Rahman, Amatur ^{[1
]}

Medvedev, Paul ^{[1
,4
,5
]}

Kandemir, Mahmut ^{[1
]}

机构：

[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA

[2] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA

[3] Adobe, Adobe Res, San Jose, CA 95110 USA

[4] Penn State Univ, Dept Biochem & Mol Biol, University Pk, PA 16802 USA

[5] Penn State Univ, HuckInstitutes Life Sci, University Pk, PA 16802 USA

来源：

BIOINFORMATICS ADVANCES | 2022年 / 2卷 / 01期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1093/bioadv/vbac088

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Motivation The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. Nanopolish is a software package for signal-level analysis of Oxford Nanopore sequencing data. Call-methylation module of Nanopolish can detect methylation based on Hidden Markov Model (HMM). However, Nanopolish is limited by the long running time of some serial and computationally expensive processes. Among these, Adaptive Banded Event Alignment (ABEA) is the most time-consuming step, and the prior work, f5c, has already parallelized and optimized ABEA on GPU. As a result, the remaining methylation score calculation part, which uses HMM to identify if a given base is methylated or not, has become the new performance bottleneck.Results This article focuses on the call-methylation module that resides in the Nanopolish package. We propose Galaxy-methyl, which parallelizes and optimizes the methylation score calculation step on GPU and then pipelines the four steps of the call-methylation module. Galaxy-methyl increases the execution concurrency across CPUs and GPUs as well as hardware resource utilization for both. The experimental results collected indicate that Galaxy-methyl can achieve 3x-5x speedup compared with Nanopolish, and reduce the total execution time by 35% compared with f5c, on average.Availability and implementation The source code of Galaxy-methyl is available at https://github.com/fengyilin118/.

引用

页数：8

共 50 条

[41] GPU-accelerated connectome discovery at scale
Varsha Sreenivasan
Sawan Kumar
Franco Pestilli
Partha Talukdar
Devarajan Sridharan
Nature Computational Science, 2022, 2 : 298 - 306
[42] GPU-Accelerated BFS for Dynamic Networks
Ziche, Filippo
Bombieri, Nicola
Busato, Federico
Giugno, Rosalba
EURO-PAR 2024: PARALLEL PROCESSING, PT III, EURO-PAR 2024, 2024, 14803 : 74 - 87
[43] GaccO - A GPU-accelerated OLTP DBMS
Boeschen, Nils
Binnig, Carsten
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1003 - 1016
[44] GPU-Accelerated Coupled Ptychographic Tomography
Achilles, Silvio
Ehrig, Simeon
Hoffmann, Nico
Kahnt, Maik
Becher, Johannes
Fam, Yakub
Sheppard, Thomas
Brueckner, Dennis
Schropp, Andreas
Schroer, Christian G.
DEVELOPMENTS IN X-RAY TOMOGRAPHY XIV, 2022, 12242
[45] GPUNFV: a GPU-Accelerated NFV System
Yi, Xiaodong
Duan, Jingpu
Wu, Chuan
PROCEEDINGS OF THE 2017 ASIA-PACIFIC WORKSHOP ON NETWORKING (APNET '17), 2017, : 85 - 91
[46] GPU-Accelerated LOD Generation for Point Clouds
Schuetz, Markus
Kerbl, Bernhard
Klaus, Philip
Wimmer, Michael
COMPUTER GRAPHICS FORUM, 2023, 42 (08)
[47] GPU-accelerated level-set segmentation
Julián Lamas-Rodríguez
Dora B. Heras
Francisco Argüello
Dagmar Kainmueller
Stefan Zachow
Montserrat Bóo
Journal of Real-Time Image Processing, 2016, 12 : 15 - 29
[48] GPU-accelerated string matching for database applications
Evangelia A. Sitaridi
Kenneth A. Ross
The VLDB Journal, 2016, 25 : 719 - 740
[49] A Performance Model for GPU-Accelerated FDTD Applications
Baumeister, Paul F.
Hater, Thorsten
Kraus, Jiri
Pleiter, Dirk
Wahl, Pierre
2015 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2015, : 185 - 193
[50] GPApriori: GPU-Accelerated Frequent Itemset Mining
Zhang, Fan
Zhang, Yan
Bakos, Jason
2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 590 - 594

← 1 2 3 4 5 →