SIMD-X: Programming and Processing of Graph Algorithms on GPUs

被引：0

作者：

Liu, Hang ^{[1
]}

Huang, H. Howie ^{[2
]}

机构：

[1] Univ Massachusetts, Lowell, MA 01854 USA

[2] George Washington Univ, Washington, DC 20052 USA

来源：

PROCEEDINGS OF THE 2019 USENIX ANNUAL TECHNICAL CONFERENCE | 2019年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

With high computation power and memory bandwidth, graphics processing units (GPUs) lend themselves to accelerate data-intensive analytics, especially when such applications fit the single instruction multiple data (SIMD) model. However, graph algorithms such as breadth-first search and k-core, often fail to take full advantage of GPUs, due to irregularity in memory access and control flow. To address this challenge, we have developed SIMD-X, for programming and processing of single instruction multiple, complex, data on GPUs. Specifically, the new Active-Compute-Combine (ACC) model not only provides ease of programming to programmers, but more importantly creates opportunities for system-level optimizations. To this end, SIMD-X utilizes just-in-time task management which filters out inactive vertices at runtime and intelligently maps various tasks to different amount of GPU cores in pursuit of workload balancing. In addition, SIMD-X leverages push-pull based kernel fusion that, with the help of a new deadlock-free global barrier, reduces a large number of computation kernels to very few. Using SIMD-X, a user can program a graph algorithm in tens of lines of code, while achieving 3x, 6x, 24x, 3x speedup over Gunrock, Galois, CuSha, and Ligra, respectively.

引用

页码：411 / 427

页数：17

共 50 条

[21] Constraint programming and graph algorithms
Mehlhorn, K
AUTOMATA LANGUAGES AND PROGRAMMING, 2000, 1853 : 571 - 575
[22] Design of a processing element of a SIMD computer for genetic algorithms
Inoue, T
Sano, M
Takahashi, Y
HIGH PERFORMANCE COMPUTING ON THE INFORMATION SUPERHIGHWAY - HPC ASIA '97, PROCEEDINGS, 1997, : 688 - 691
[23] Design of a processing element of a SIMD computer for genetic algorithms
Univ of Tokushima, Tokushima, Japan
Proc Conf High Perform Comput Inf Superhighway HPC Asia, (688-691):
[24] SIMD architectures and algorithms for image processing and computer vision
Cypher, Robert, 1600, (37):
[25] Reduce, Reuse, and Adapt: Accelerating Graph Processing on GPUs
Ullas, A.
Nasre, Rupesh
Govindarajan, R.
2023 IEEE 30TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC 2023, 2023, : 335 - 346
[26] A Pattern Based Algorithmic Autotuner for Graph Processing on GPUs
Meng, Ke
Li, Jiajia
Tan, Guangming
Sun, Ninghui
PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), 2019, : 201 - 213
[27] Puffin: Graph Processing System on Multi-GPUs
Zhao, Peng
Luo, Xuan
Xiao, Jiang
Shi, Xuanhua
Jin, Hai
2017 IEEE 10TH CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA), 2017, : 50 - 57
[28] Solving Classification Problems Using Genetic Programming Algorithms on GPUs
Cano, Alberto
Zafra, Amelia
Ventura, Sebastian
HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, PT 2, 2010, 6077 : 17 - 26
[29] Design and Performance Evaluation of Image Processing Algorithms on GPUs
Park, In Kyu
Singhal, Nitin
Lee, Man Hee
Cho, Sungdae
Kim, Chris W.
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (01) : 91 - 104
[30] Speeding Up Set Intersections in Graph Algorithms using SIMD Instructions
Han, Shuo
Zou, Lei
Yu, Jeffery Xu
SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1587 - 1602

← 1 2 3 4 5 →