Improving vertex-frontier based GPU breadth-first search

被引:0
|
作者
杨博 [1 ,2 ]
卢凯 [1 ,2 ]
高颖慧 [3 ]
徐凯 [1 ,2 ]
王小平 [1 ,2 ]
程志权 [4 ]
机构
[1] Science and Technology on Parallel and Distributed Processing Laboratory,National University of Defense Technology
[2] College of Computer, National University of Defense Technology
[3] Department of Electronic Science and Engineering, National University of Defense Technology
[4] Avatar Science Company
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
breadth-first search; GPU; graph traversal; vertex frontier;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
Breadth-first search(BFS) is an important kernel for graph traversal and has been used by many graph processing applications. Extensive studies have been devoted in boosting the performance of BFS. As the most effective solution, GPU-acceleration achieves the state-of-the-art result of 3.3×109 traversed edges per second on a NVIDIA Tesla C2050 GPU. A novel vertex frontier based GPU BFS algorithm is proposed, and its main features are three-fold. Firstly, to obtain a better workload balance for irregular graphs, a virtual-queue task decomposition and mapping strategy is introduced for vertex frontier expanding. Secondly, a global deduplicate detection scheme is proposed to remove reduplicative vertices from vertex frontier effectively. Finally, a GPU-based bottom-up BFS approach is employed to process large frontier. The experimental results demonstrate that the algorithm can achieve 10% improvement over the state-of-the-art method on diverse graphs. Especially, it exhibits 2-3 times speedup on low-diameter and scale-free graphs over the state-of-the-art on a NVIDIA Tesla K20 c GPU, reaching a peak traversal rate of 11.2×109 edges/s.
引用
收藏
页码:3828 / 3836
页数:9
相关论文
共 50 条
  • [21] Direction-Optimizing Breadth-First Search
    Beamer, Scott
    Asanovic, Krste
    Patterson, David
    2012 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2012,
  • [22] iBFS: Concurrent Breadth-First Search on GPUs
    Liu, Hang
    Huang, H. Howie
    Hu, Yang
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 403 - 416
  • [23] Efficient distributed breadth-first search algorithm
    Makki, SAM
    COMPUTER COMMUNICATIONS, 1996, 19 (08) : 628 - 636
  • [24] A parallel algorithm for the stack breadth-first search
    Nakashima, T
    Fujiwara, A
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (12) : 1955 - 1958
  • [25] Ring perception using breadth-first search
    Figueras, J
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (05): : 986 - 991
  • [26] Parallelizability of the stack breadth-first search problem
    Nakashima, T
    Fujiwara, A
    PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 722 - 727
  • [27] Efficient Breadth-First Search on a Heterogeneous Processor
    Daga, Mayank
    Nutter, Mark
    Meswani, Mitesh
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 373 - 382
  • [28] BREADTH-FIRST SEARCH - SOME SURPRISING RESULTS
    SIKLOSSY, L
    RICH, A
    MARINOV, V
    ARTIFICIAL INTELLIGENCE, 1973, 4 (01) : 1 - 27
  • [29] Maximum Flows by Incremental Breadth-First Search
    Goldberg, Andrew V.
    Hed, Sagi
    Kaplan, Haim
    Tarjan, Robert E.
    Werneck, Renato F.
    ALGORITHMS - ESA 2011, 2011, 6942 : 457 - 468
  • [30] Efficient breadth-first search on the Cell/BE processor
    Scarpazza, Daniele Paolo
    Villa, Oreste
    Petrini, Fabrizio
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, 19 (10) : 1381 - 1395