Accelerating sparse matrix-matrix multiplication with GPU Tensor Cores

被引:38
|
作者
Zachariadis, Orestis [1 ]
Satpute, Nitin [1 ]
Gomez-Luna, Juan [2 ]
Olivares, Joaquin [1 ]
机构
[1] Univ Cordoba, Dept Elect & Comp Engn, Cordoba, Spain
[2] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
基金
欧盟地平线“2020”;
关键词
Sparse matrix multiplication; GPU; Tensor Cores; Parallel computing; SpGEMM; MANY-CORE;
D O I
10.1016/j.compeleceng.2020.106848
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Sparse general matrix-matrix multiplication (spGEMM) is an essential component in many scientific and data analytics applications. However, the sparsity pattern of the input matrices and the interaction of their patterns make spGEMM challenging. Modern GPUs include Tensor Core Units (TCUs), which specialize in dense matrix multiplication. Our aim is to re-purpose TCUs for sparse matrices. The key idea of our spGEMM algorithm, tSparse, is to multiply sparse rectangular blocks using the mixed precision mode of TCUs. tSparse partitions the input matrices into files and operates only on files which contain one or more elements. It creates a task list of the files, and performs matrix multiplication of these files using TCUs. To the best of our knowledge, this is the first time that TCUs are used in the context of spGEMM. We show that spGEMM, with our filing approach, benefits from TCUs. Our approach significantly improves the performance of spGEMM in comparison to cuSPARSE, CUSP, RMerge2, Nsparse, AC-SpGEMM and spECK.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Benchmarking GPU Tensor Cores on General Matrix Multiplication Kernels through CUTLASS
    Huang, Xuanteng
    Zhang, Xianwei
    Yang, Panfei
    Xiao, Nong
    APPLIED SCIENCES-BASEL, 2023, 13 (24):
  • [32] MATRIX MULTIPLICATION IN MULTIWORD ARITHMETIC: ERROR ANALYSIS AND APPLICATION TO GPU TENSOR CORES
    Fasi, Massimiliano
    Higham, Nicholas J.
    Lopez, Florent
    Mary, Theo
    Mikaitis, Mantas
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2023, 45 (01): : C1 - C19
  • [33] Column-Segmented Sparse Matrix-Matrix Multiplication on Multicore CPUs
    An, Xiaojing
    Catalyurek, Umit, V
    2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), 2021, : 202 - 211
  • [34] Brief Announcement: Hypergraph Partitioning for Parallel Sparse Matrix-Matrix Multiplication
    Ballard, Grey
    Druinsky, Alex
    Knight, Nicholas
    Schwartz, Oded
    SPAA'15: PROCEEDINGS OF THE 27TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2015, : 86 - 88
  • [35] Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs
    Wei, Bingxin
    Wang, Yizhuo
    Chang, Fangli
    Gao, Jianhua
    Ji, Weixing
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2024, 38 (03): : 245 - 259
  • [36] A framework for general sparse matrix-matrix multiplication on GPUs and heterogeneous processors
    Liu, Weifeng
    Vinter, Brian
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 85 : 47 - 61
  • [37] Generalized Sparse Matrix-Matrix Multiplication for Vector Engines and Graph Applications
    Li, Jiayu
    Wang, Fugang
    Araki, Takuya
    Qiu, Judy
    PROCEEDINGS OF MCHPC'19: 2019 IEEE/ACM WORKSHOP ON MEMORY CENTRIC HIGH PERFORMANCE COMPUTING (MCHPC), 2019, : 33 - 42
  • [38] A high-performance matrix-matrix multiplication methodology for CPU and GPU architectures
    Kelefouras, Vasilios
    Kritikakou, A.
    Mporas, Iosif
    Kolonias, Vasilios
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (03): : 804 - 844
  • [39] Bitmap-Based Sparse Matrix-Vector Multiplication with Tensor Cores
    Chen, YuAng
    Yu, Jeffery Xu
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 1135 - 1144
  • [40] Communication-Avoiding Parallel Sparse-Dense Matrix-Matrix Multiplication
    Koanantakool, Penporn
    Azad, Ariful
    Buluc, Aydin
    Morozov, Dmitriy
    Oh, Sang-Yun
    Oliker, Leonid
    Yelick, Katherine
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 842 - 853