Rgs-SpMM: Accelerate Sparse Matrix-Matrix Multiplication by Row Group Splitting Strategy on the GPU

被引：0

作者：

Guo, Mingfeng ^{[1
]}

Wang, Yaobin ^{[1
]}

Huang, Jun ^{[1
]}

Wang, Qingfeng ^{[1
]}

Zhang, Yaqing ^{[1
]}

Xu, Mu ^{[2
]}

Lu, Fang ^{[2
]}

机构：

[1] Southwest Univ Sci & Technol, Sch Comp Sci & Technol, Minist Educ, Key Lab Testing Technol Mfg Proc, Mianyang 621010, Sichuan, Peoples R China

[2] Alibaba Grp, Hangzhou, Peoples R China

来源：

NETWORK AND PARALLEL COMPUTING, NPC 2022 | 2022年 / 13615卷

基金：

中国国家自然科学基金;

关键词：

Sparse Matrix-Matrix Multiplication; GPU; Row group splitting;

D O I：

10.1007/978-3-031-21395-3_6

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Sparse Matrix-Matrix Multiplication (SpMM) operation is widely used in different fields, especially the recently popular GNN framework. Researchers have designed many kernels on the GPU to accelerate the SpMM operation. Existing methods mostly adopt a row splitting strategy to obtain better parallelism and memory access efficiency. However, due to irregularities of sparse matrices such as short rows with few non-zero elements, current methods suffer from the under-utilization of thread resources in GPU. In this paper, We rearrange the distribution of non-zero elements in the sparse matrix and design the SpMM kernel based on the row group splitting strategy. In contrast to previous methods which assign a "row" task unit to a warp for processing, we combine short rows in a sparse matrix into "row groups" as a task unit, which allocate more appropriate non-zero elements tasks to the GPU resources. This method reduces the thread divergence in a warp and improves load balancing among warps. Our experimental data comes from the SNAP Matrix Collection. The results show that our kernel is faster than cuSPARSE and GE-SpMM, with an average speedup of 1.61 and 1.42 respectively.

引用

页码：61 / 66

页数：6

共 50 条

[21] Scaling sparse matrix-matrix multiplication in the accumulo database
Gunduz Vehbi Demirci
Cevdet Aykanat
Distributed and Parallel Databases, 2020, 38 : 31 - 62
[22] MEMORY-EFFICIENT SPARSE MATRIX-MATRIX MULTIPLICATION BY ROW MERGING ON MANY-CORE ARCHITECTURES
Gremse, Felix
Kuepper, Kerstin
Naumann, Uwe
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (04): : C429 - C449
[23] Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore Platforms
Patwary, Md. Mostofa Ali
Satish, Nadathur Rajagopalan
Sundaram, Narayanan
Park, Jongsoo
Anderson, Michael J.
Vadlamudi, Satya Gautam
Das, Dipankar
Pudov, Sergey G.
Pirogov, Vadim O.
Dubey, Pradeep
HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2015, 2015, 9137 : 48 - 57
[24] DeltaSPARSE: High-Performance Sparse General Matrix-Matrix Multiplication on Multi-GPU Systems
Yang, Shuai
Zhang, Changyou
Ma, Ji
2023 IEEE 30TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC 2023, 2023, : 194 - 202
[25] Design space exploration for sparse matrix-matrix multiplication on FPGAs
Lin, Colin Yu
Wong, Ngai
So, Hayden Kwok-Hay
INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2013, 41 (02) : 205 - 219
[26] EXPLOITING MULTIPLE LEVELS OF PARALLELISM IN SPARSE MATRIX-MATRIX MULTIPLICATION
Azad, Ariful
Ballard, Grey
Buluc, Aydin
Demmel, James
Grigori, Laura
Schwartz, Oded
Toledo, Sivan
Williams, Samuel
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2016, 38 (06): : C624 - C651
[27] PARALLEL SPARSE MATRIX-MATRIX MULTIPLICATION AND INDEXING: IMPLEMENTATION AND EXPERIMENTS
Buluc, Aydin
Gilbert, John R.
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2012, 34 (04): : C170 - C191
[28] Partitioning Models for Scaling Parallel Sparse Matrix-Matrix Multiplication
Akbudak, Kadir
Selvitopi, Oguz
Aykanat, Cevdet
ACM TRANSACTIONS ON PARALLEL COMPUTING, 2018, 4 (03)
[29] Sparse approximate matrix-matrix multiplication for density matrix purification with error control
Artemov, Anton G.
Rubensson, Emanuel H.
JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 438
[30] MergePath-SpMM: Parallel Sparse Matrix-Matrix Algorithm for Graph Neural Network Acceleration
Shan, Mohsin
Gurevin, Deniz
Nye, Jared
Ding, Caiwen
Khan, Omer
2023 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS, 2023, : 145 - 156

← 1 2 3 4 5 →