Fast algorithm for parallel solving inversion of large scale small matrices based on GPU

被引:0
|
作者
Jin Xuebin
Chen Yewang
Fan Wentao
Zhang Yong
Du Jixiang
机构
[1] Huaqiao University,The College of Computer Science and Technology
[2] Huaqiao University,Fujian Key Laboratory of Big Data Intelligence and Security
[3] Huaqiao University,Xiamen Key Laboratory of Computer Vision and Pattern Recognition
[4] Huaqiao University,College of Mechanical Engineering and Automation
来源
The Journal of Supercomputing | 2023年 / 79卷
关键词
GPU acceleration; Matrix inversion; A large number of small matrices; High performance computing; CUDA;
D O I
暂无
中图分类号
学科分类号
摘要
Inverting a matrix is time-consuming, and many works focus on accelerating the inversion of a single large matrix by GPU. However, the problem of parallelizing the inversion of a large number of small matrices has received little attention. These problems are widely applied in computer science, including accelerating cryptographic algorithms and image processing algorithms. In this paper, we propose a Revised In-Place Inversion algorithm for inverting a large number of small matrices on the CUDA platform, which adopts a more refined parallelization scheme and outperforms other algorithms, achieving a speedup of up to 20.9572 times over the batch matrix inverse kernel in CUBLAS. Additionally, we found that there is an upper bound on the input data size for each GPU device, and the performance will degrade if the input data size is too large. Therefore, we propose the Saturation Size Curve based on this finding to divide matrices into batches and improve the algorithm performance. Experimental results show that this strategy increases the algorithm’s performance by 1.75 times and effectively alleviates the problem of performance degradation.
引用
收藏
页码:18313 / 18339
页数:26
相关论文
共 50 条
  • [11] A FAST ALGORITHM FOR SOLVING LARGE-SCALE MEAN-VARIANCE MODELS BY COMPACT FACTORIZATION OF COVARIANCE MATRICES
    KONNO, H
    SUZUKI, K
    JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 1992, 35 (01) : 93 - 104
  • [12] ALGORITHM FOR INVERSION OF LARGE MATRICES BY ITERATION
    WIRGIN, A
    ELECTRONICS LETTERS, 1969, 5 (24) : 627 - &
  • [13] Parallel fast algorithm for large-scale electromagnetic scattering
    Wu, F
    Zhang, YJ
    Oo, ZZ
    Li, EP
    SEVENTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND GRID IN ASIA PACIFIC REGION, PROCEEDINGS, 2004, : 188 - 194
  • [14] GPU Based Parallel Matrix Exponential Algorithm for Large Scale Power System Electromagnetic Transient Simulation
    Zhao, Jinli
    Liu, Juntao
    Li, Peng
    Fu, Xiaopeng
    Song, Guanyu
    Wang, Chengshan
    2016 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT-ASIA), 2016, : 110 - 114
  • [15] A fast algorithm for the inversion of block Toeplitz matrices
    Davis, G
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (12) : 3022 - 3025
  • [16] A fast algorithm for the inversion of general Toeplitz matrices
    Martinsson, PG
    Rokhlin, V
    Tygert, M
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2005, 50 (5-6) : 741 - 752
  • [17] A new parallel algorithm for solving large-scale Markov chains
    Touzene, Abderezak
    JOURNAL OF SUPERCOMPUTING, 2014, 67 (01): : 239 - 253
  • [18] A new parallel algorithm for solving large-scale Markov chains
    Abderezak Touzene
    The Journal of Supercomputing, 2014, 67 : 239 - 253
  • [19] A GaBP-GPU algorithm of solving large-scale sparse linear systems
    Zheng, Hanyuan
    Song, Anping
    Liu, Zhixiang
    Xu, Lei
    Zhang, Wu
    Journal of Information and Computational Science, 2014, 11 (03): : 911 - 921
  • [20] A fast calculation method for large-scale shell structure based on multigird method and GPU parallel computing
    State Key Laboratory of Advanced Design and Manufacturing for Vehicle Body, Hunan University, Changsha 410082, China
    Gongcheng Lixue, 5 (20-26):