CUDA-enabled hierarchical ward clustering of protein structures based on the nearest neighbour chain algorithm

被引:5
|
作者
Dang, Hoang-Vu [1 ]
Schmidt, Bertil [1 ]
Hildebrandt, Andreas [1 ]
Tran, Tuan Tu [1 ]
Hildebrandt, Anna Katharina [2 ]
机构
[1] Johannes Gutenberg Univ Mainz, Inst Informat, Mainz, Germany
[2] Max Planck Inst Informat, Saarbrucken, Germany
来源
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS | 2016年 / 30卷 / 02期
关键词
CUDA; protein structures; clustering; bioinformatics; protein docking; TOOL;
D O I
10.1177/1094342015597988
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering of molecular systems according to their three-dimensional structure is an important step in many bioinformatics workflows. In applications such as docking or structure prediction, many algorithms initially generate large numbers of candidate poses (or decoys), which are then clustered to allow for subsequent computationally expensive evaluations of reasonable representatives. Since the number of such candidates can easily range from thousands to millions, performing the clustering on standard central processing units (CPUs) is highly time consuming. In this paper, we analyse and evaluate different approaches to parallelize the nearest neighbour chain algorithm to perform hierarchical Ward clustering of protein structures, using both atom-based root mean square deviation (RMSD) and rigid-body RMSD molecular distances on a graphics processing unit (GPU). This leads to a speedup of around one order of magnitude of our CUDA implementation on a GeForce Titan GPU compared to a multi-threaded CPU implementation on a Core-i7 2700. Furthermore, the runtimes compare favourably with ClusCo, another state-of-the-art CUDA-enabled protein structure clustering method, while achieving similar accuracy on the iTasser benchmark dataset. Our implementation has also been incorporated into the Biochemical Algorithms library to allow easy integration into biologists' workflows.
引用
收藏
页码:200 / 211
页数:12
相关论文
共 21 条
  • [21] Overlapping Structures Detection in Protein-Protein Interaction Networks Using Community Detection Algorithm Based on Neighbor Clustering Coefficient (vol 12, 689515, 2021)
    Wang, Yan
    Chen, Qiong
    Yang, Lili
    Yang, Sen
    He, Kai
    Xie, Xuping
    FRONTIERS IN GENETICS, 2021, 12