A hybrid CPU/GPU approach for the parallel algebraic recursive multilevel solver pARMS

被引:0
|
作者
Jamal, Aygul [1 ]
Baboulin, Marc [1 ]
Khabou, Amal [1 ]
Sosonkina, Masha [2 ]
机构
[1] Univ Paris Sud, F-91405 Orsay, France
[2] Old Dominion Univ, Norfolk, VA 23529 USA
基金
美国国家科学基金会;
关键词
SCHEMES; GPU;
D O I
10.1109/SYNASC.2016.63
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We illustrate how the distributed parallel Algebraic Recursive Multilevel Solver based on MPI can be adapted for heterogeneous CPU/GPU architectures. The tasks performed on the GPU are related to the preconditioning of each part of the distributed matrix (local preconditioning) which is handled in the distributed version by each MPI process. The solving step remains on the CPU. In our implementation, the local preconditioning can be based either on the randomization of the last Schur complement system in the multilevel recursive process, or on an Incomplete LU factorization from the MAGMA library. Numerical experiments show that a promising performance improvement can be obtained using either randomized multilevel recursive preconditioning or Incomplete LU preconditioning for large enough matrices. Each preconditioning method ensures a good performance for a given set of matrices.
引用
收藏
页码:411 / 416
页数:6
相关论文
共 50 条
  • [21] CPU-GPU hybrid parallel strategy for cosmological simulations
    Wang, Yueqing
    Dou, Yong
    Guo, Song
    Lei, Yuanwu
    Zou, Dan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (03): : 748 - 765
  • [22] Hybrid approach of parallel implementation on CPU-GPU for high-speed ECDSA verification
    Lee, Sokjoon
    Seo, Hwajeong
    Kwon, Hyeokchan
    Yoon, Hyunsoo
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (08): : 4329 - 4349
  • [23] A CPU-GPU Parallel Ant Colony Optimization Solver for the Vehicle Routing Problem
    Rey, Anton
    Prieto, Manuel
    Gomez, J., I
    Tenllado, Christian
    Ignacio Hidalgo, J.
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 653 - 667
  • [24] A hybrid CPU/GPU approach for optimizing sorting throughput
    Gowanlock, Michael
    Karsin, Ben
    PARALLEL COMPUTING, 2019, 85 : 45 - 55
  • [25] A comparison of Algebraic Multigrid Bidomain solvers on hybrid CPU-GPU architectures
    Centofanti, Edoardo
    Scacchi, Simone
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2024, 423
  • [26] A class of new parallel hybrid algebraic multilevel iterations
    Bai, Z.-Z., 2001, Inst. of Computational Mathematics and Sc./Eng. Computing (19):
  • [27] A class of new parallel hybrid algebraic multilevel iterations
    Bai, ZZ
    JOURNAL OF COMPUTATIONAL MATHEMATICS, 2001, 19 (06) : 651 - 672
  • [28] A CLASS OF NEW PARALLEL HYBRID ALGEBRAIC MULTILEVEL ITERATIONS
    Zhong-zhi Bai (LSEC ICMSEC
    JournalofComputationalMathematics, 2001, (06) : 651 - 672
  • [29] Parallel Modulus Operations in RSA Encryption by CPU/GPU Hybrid Computation
    Lin, Chu-Hsing
    Liu, Jung-Chun
    Li, Cheng-Chieh
    Chu, Po-Wei
    2014 NINTH ASIA JOINT CONFERENCE ON INFORMATION SECURITY (ASIA JCIS), 2014, : 71 - 75
  • [30] Parallel Aligning Multiple Metabolic Pathways on Hybrid CPU and GPU Architectures
    Huang, Yiran
    Zhong, Cheng
    Zhang, Jinxiong
    Li, Ye
    Liu, Jun
    PARALLEL ARCHITECTURE, ALGORITHM AND PROGRAMMING, PAAP 2017, 2017, 729 : 483 - 492