Efficient implementation of Jacobi iterative method for large sparse linear systems on graphic processing units

被引:12
作者
Ahamed, Abal-Kassim Cheik [1 ]
Magoules, Frederic [1 ]
机构
[1] Univ Paris Saclay, Cent Supelec, Grande Voie Vignes, F-92295 Chatenay Malabry, France
关键词
Jacobi method; GPU; Sparse matrices; CSR format; Finite element method; GPU;
D O I
10.1007/s11227-016-1701-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an original Jacobi implementation is considered for the solution of sparse linear systems of equations. The proposed algorithm helps to optimize the parallel implementation on GPU. The performance analysis of GPU-based (using CUDA) algorithm of the implementation of this algorithm is compared to the corresponding serial CPU-based algorithm. Numerical experiments performed on a set of matrices arising from the finite element discretization of various equations (3D Laplace equation, 3D gravitational potential equation, 3D Heat equation) with different meshes, illustrate the performance, robustness and efficiency of our algorithm, with a speed up to 23 in double-precision arithmetics.
引用
收藏
页码:3411 / 3432
页数:22
相关论文
共 38 条
[1]   Energy Consumption Analysis on Graphics Processing Units [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
PROCEEDINGS OF THIRTEENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE, (DCABES 2014), 2014, :46-50
[2]   Iterative Krylov Methods for Acoustic Problems on Graphics Processing Unit [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
PROCEEDINGS OF THIRTEENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE, (DCABES 2014), 2014, :19-23
[3]   Parallel Sub-Structuring Methods for solving Sparse Linear Systems on a cluster of GPU [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, :121-128
[4]   A Stochastic-Based Optimized Schwarz Method for the Gravimetry Equations on GPU Clusters [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
DOMAIN DECOMPOSITION METHODS IN SCIENCE AND ENGINEERING XXI, 2014, 98 :687-695
[5]   Iterative Krylov Methods for Gravity problems on Graphics Processing Unit [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
2013 12TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING & SCIENCE (DCABES), 2013, :16-20
[6]   Fast Sparse Matrix-Vector Multiplication on Graphics Processing Unit for Finite Element Analysis [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, :1307-1314
[7]   Iterative Methods for Sparse Linear Systems on Graphics Processing Unit [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, :836-842
[8]  
[Anonymous], P 17 ENC PORT COMP G
[9]  
[Anonymous], 2012, ARXIV12106412
[10]  
[Anonymous], 2008, NVIDIA Technical Report NVR-2008-004