共 21 条
CUDA-enabled hierarchical ward clustering of protein structures based on the nearest neighbour chain algorithm
被引:5
|作者:
Dang, Hoang-Vu
[1
]
Schmidt, Bertil
[1
]
Hildebrandt, Andreas
[1
]
Tran, Tuan Tu
[1
]
Hildebrandt, Anna Katharina
[2
]
机构:
[1] Johannes Gutenberg Univ Mainz, Inst Informat, Mainz, Germany
[2] Max Planck Inst Informat, Saarbrucken, Germany
来源:
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS
|
2016年
/
30卷
/
02期
关键词:
CUDA;
protein structures;
clustering;
bioinformatics;
protein docking;
TOOL;
D O I:
10.1177/1094342015597988
中图分类号:
TP3 [计算技术、计算机技术];
学科分类号:
0812 ;
摘要:
Clustering of molecular systems according to their three-dimensional structure is an important step in many bioinformatics workflows. In applications such as docking or structure prediction, many algorithms initially generate large numbers of candidate poses (or decoys), which are then clustered to allow for subsequent computationally expensive evaluations of reasonable representatives. Since the number of such candidates can easily range from thousands to millions, performing the clustering on standard central processing units (CPUs) is highly time consuming. In this paper, we analyse and evaluate different approaches to parallelize the nearest neighbour chain algorithm to perform hierarchical Ward clustering of protein structures, using both atom-based root mean square deviation (RMSD) and rigid-body RMSD molecular distances on a graphics processing unit (GPU). This leads to a speedup of around one order of magnitude of our CUDA implementation on a GeForce Titan GPU compared to a multi-threaded CPU implementation on a Core-i7 2700. Furthermore, the runtimes compare favourably with ClusCo, another state-of-the-art CUDA-enabled protein structure clustering method, while achieving similar accuracy on the iTasser benchmark dataset. Our implementation has also been incorporated into the Biochemical Algorithms library to allow easy integration into biologists' workflows.
引用
收藏
页码:200 / 211
页数:12
相关论文