SIMD-X: Programming and Processing of Graph Algorithms on GPUs

被引:0
|
作者
Liu, Hang [1 ]
Huang, H. Howie [2 ]
机构
[1] Univ Massachusetts, Lowell, MA 01854 USA
[2] George Washington Univ, Washington, DC 20052 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With high computation power and memory bandwidth, graphics processing units (GPUs) lend themselves to accelerate data-intensive analytics, especially when such applications fit the single instruction multiple data (SIMD) model. However, graph algorithms such as breadth-first search and k-core, often fail to take full advantage of GPUs, due to irregularity in memory access and control flow. To address this challenge, we have developed SIMD-X, for programming and processing of single instruction multiple, complex, data on GPUs. Specifically, the new Active-Compute-Combine (ACC) model not only provides ease of programming to programmers, but more importantly creates opportunities for system-level optimizations. To this end, SIMD-X utilizes just-in-time task management which filters out inactive vertices at runtime and intelligently maps various tasks to different amount of GPU cores in pursuit of workload balancing. In addition, SIMD-X leverages push-pull based kernel fusion that, with the help of a new deadlock-free global barrier, reduces a large number of computation kernels to very few. Using SIMD-X, a user can program a graph algorithm in tens of lines of code, while achieving 3x, 6x, 24x, 3x speedup over Gunrock, Galois, CuSha, and Ligra, respectively.
引用
收藏
页码:411 / 427
页数:17
相关论文
共 50 条
  • [21] Constraint programming and graph algorithms
    Mehlhorn, K
    AUTOMATA LANGUAGES AND PROGRAMMING, 2000, 1853 : 571 - 575
  • [22] Design of a processing element of a SIMD computer for genetic algorithms
    Inoue, T
    Sano, M
    Takahashi, Y
    HIGH PERFORMANCE COMPUTING ON THE INFORMATION SUPERHIGHWAY - HPC ASIA '97, PROCEEDINGS, 1997, : 688 - 691
  • [23] Design of a processing element of a SIMD computer for genetic algorithms
    Univ of Tokushima, Tokushima, Japan
    Proc Conf High Perform Comput Inf Superhighway HPC Asia, (688-691):
  • [25] Reduce, Reuse, and Adapt: Accelerating Graph Processing on GPUs
    Ullas, A.
    Nasre, Rupesh
    Govindarajan, R.
    2023 IEEE 30TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC 2023, 2023, : 335 - 346
  • [26] A Pattern Based Algorithmic Autotuner for Graph Processing on GPUs
    Meng, Ke
    Li, Jiajia
    Tan, Guangming
    Sun, Ninghui
    PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), 2019, : 201 - 213
  • [27] Puffin: Graph Processing System on Multi-GPUs
    Zhao, Peng
    Luo, Xuan
    Xiao, Jiang
    Shi, Xuanhua
    Jin, Hai
    2017 IEEE 10TH CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA), 2017, : 50 - 57
  • [28] Solving Classification Problems Using Genetic Programming Algorithms on GPUs
    Cano, Alberto
    Zafra, Amelia
    Ventura, Sebastian
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, PT 2, 2010, 6077 : 17 - 26
  • [29] Design and Performance Evaluation of Image Processing Algorithms on GPUs
    Park, In Kyu
    Singhal, Nitin
    Lee, Man Hee
    Cho, Sungdae
    Kim, Chris W.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (01) : 91 - 104
  • [30] Speeding Up Set Intersections in Graph Algorithms using SIMD Instructions
    Han, Shuo
    Zou, Lei
    Yu, Jeffery Xu
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1587 - 1602