SIMD-X: Programming and Processing of Graph Algorithms on GPUs

被引:0
|
作者
Liu, Hang [1 ]
Huang, H. Howie [2 ]
机构
[1] Univ Massachusetts, Lowell, MA 01854 USA
[2] George Washington Univ, Washington, DC 20052 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With high computation power and memory bandwidth, graphics processing units (GPUs) lend themselves to accelerate data-intensive analytics, especially when such applications fit the single instruction multiple data (SIMD) model. However, graph algorithms such as breadth-first search and k-core, often fail to take full advantage of GPUs, due to irregularity in memory access and control flow. To address this challenge, we have developed SIMD-X, for programming and processing of single instruction multiple, complex, data on GPUs. Specifically, the new Active-Compute-Combine (ACC) model not only provides ease of programming to programmers, but more importantly creates opportunities for system-level optimizations. To this end, SIMD-X utilizes just-in-time task management which filters out inactive vertices at runtime and intelligently maps various tasks to different amount of GPU cores in pursuit of workload balancing. In addition, SIMD-X leverages push-pull based kernel fusion that, with the help of a new deadlock-free global barrier, reduces a large number of computation kernels to very few. Using SIMD-X, a user can program a graph algorithm in tens of lines of code, while achieving 3x, 6x, 24x, 3x speedup over Gunrock, Galois, CuSha, and Ligra, respectively.
引用
收藏
页码:411 / 427
页数:17
相关论文
共 50 条
  • [1] Scalable SIMD-Efficient Graph Processing on GPUs
    Khorasani, Farzad
    Gupta, Rajiv
    Bhuyan, Laxmi N.
    2015 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION (PACT), 2015, : 39 - 50
  • [2] Optimizing Graph Processing on GPUs
    Zhong, Wenyong
    Sun, Jianhua
    Chen, Hao
    Xiao, Jun
    Chen, Zhiwen
    Cheng, Chang
    Shi, Xuanhua
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (04) : 1149 - 1162
  • [3] Graph Processing on GPUs: A Survey
    Shi, Xuanhua
    Zheng, Zhigao
    Zhou, Yongluan
    Jin, Hai
    He, Ligang
    Liu, Bo
    Hua, Qiang-Sheng
    ACM COMPUTING SURVEYS, 2018, 50 (06)
  • [4] C-for-Metal: High Performance SIMD Programming on Intel GPUs
    Lueh, Guei-Yuan
    Chen, Kaiyu
    Chen, Gang
    Fuentes, Joel
    Chen, Wei-Yu
    Fu, Fangwen
    Jiang, Hong
    Li, Hongzheng
    Rhee, Daniel
    CGO '21: PROCEEDINGS OF THE 2021 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2021, : 289 - 300
  • [5] MultiGraph: Efficient Graph Processing on GPUs
    Hong, Changwan
    Sukumaran-Rajam, Aravind
    Kim, Jinsung
    Sadayappan, P.
    2017 26TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2017, : 27 - 40
  • [6] Graph Processing on GPUs: Where are the Bottlenecks?
    Xu, Qiumin
    Jeon, Hyeran
    Annavaram, Murali
    2014 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2014, : 140 - 149
  • [7] Medusa: Simplified Graph Processing on GPUs
    Zhong, Jianlong
    He, Bingsheng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (06) : 1543 - 1552
  • [8] GraphPEG: Accelerating Graph Processing on GPUs
    Lu, Yashuai
    Guo, Hui
    Huang, Libo
    Yu, Qi
    Shen, Li
    Xiao, Nong
    Wang, Zhiying
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (03)
  • [9] Deploying Graph Algorithms on GPUs: an Adaptive Solution
    Li, Da
    Becchi, Michela
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 1013 - 1024
  • [10] A Compiler for Throughput Optimization of Graph Algorithms on GPUs
    Pai, Sreepathi
    Pingali, Keshav
    ACM SIGPLAN NOTICES, 2016, 51 (10) : 1 - 19