A hybrid CPU/GPU approach for the parallel algebraic recursive multilevel solver pARMS

被引:0
|
作者
Jamal, Aygul [1 ]
Baboulin, Marc [1 ]
Khabou, Amal [1 ]
Sosonkina, Masha [2 ]
机构
[1] Univ Paris Sud, F-91405 Orsay, France
[2] Old Dominion Univ, Norfolk, VA 23529 USA
来源
PROCEEDINGS OF 2016 18TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC) | 2016年
基金
美国国家科学基金会;
关键词
SCHEMES; GPU;
D O I
10.1109/SYNASC.2016.63
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We illustrate how the distributed parallel Algebraic Recursive Multilevel Solver based on MPI can be adapted for heterogeneous CPU/GPU architectures. The tasks performed on the GPU are related to the preconditioning of each part of the distributed matrix (local preconditioning) which is handled in the distributed version by each MPI process. The solving step remains on the CPU. In our implementation, the local preconditioning can be based either on the randomization of the last Schur complement system in the multilevel recursive process, or on an Incomplete LU factorization from the MAGMA library. Numerical experiments show that a promising performance improvement can be obtained using either randomized multilevel recursive preconditioning or Incomplete LU preconditioning for large enough matrices. Each preconditioning method ensures a good performance for a given set of matrices.
引用
收藏
页码:411 / 416
页数:6
相关论文
共 50 条
  • [31] CPU-GPU Hybrid Parallel Binomial American Option Pricing
    Zhang, Nan
    Lim, Eng Gee
    Man, Ka Lok
    Lei, Chi-Un
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTIST, IMECS 2012, VOL II, 2012, : 1157 - 1162
  • [32] Parallel Power Grid Analysis Using Preconditioned GMRES Solver on CPU-GPU Platforms
    Liu, Xue-Xin
    Wang, Hai
    Tan, Sheldon X-D
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2013, : 561 - 568
  • [33] A CPU-GPU hybrid approach for the unsymmetric multifrontal method
    Yu, Chenhan D.
    Wang, Weichung
    Pierce, Dan'l
    PARALLEL COMPUTING, 2011, 37 (12) : 759 - 770
  • [34] Multilevel algebraic approach for performance analysis of parallel algorithms
    D'Amore, Luisa
    Mele, Valeria
    Romano, Diego
    Laccetti, Giuliano
    Computing and Informatics, 2020, 38 (04) : 817 - 850
  • [35] MULTILEVEL ALGEBRAIC APPROACH FOR PERFORMANCE ANALYSIS OF PARALLEL ALGORITHMS
    D'Amore, Luisa
    Mele, Valeria
    Romano, Diego
    Laccetti, Giuliano
    COMPUTING AND INFORMATICS, 2019, 38 (04) : 817 - 850
  • [36] Performance models and workload distribution algorithms for optimizing a hybrid CPU-GPU multifrontal solver
    Yu, Chenhan D.
    Wang, Weichung
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2014, 67 (07) : 1421 - 1437
  • [37] New hybrid CPU-GPU solver for CFD-DEM simulation of fluidized beds
    Norouzi, H. R.
    Zarghami, R.
    Mostoufi, N.
    POWDER TECHNOLOGY, 2017, 316 : 233 - 244
  • [38] Simeuro: A Hybrid CPU-GPU Parallel Simulator for Neuromorphic Computing Chips
    Zhang, Huaipeng
    Ho, Nhut-Minh
    Polat, Dogukan Yigit
    Chen, Peng
    Wahib, Mohamed
    Nguyen, Truong Thao
    Meng, Jintao
    Goh, Rick Siow Mong
    Matsuoka, Satoshi
    Luo, Tao
    Wong, Weng-Fai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (10) : 2767 - 2782
  • [39] Accelerated Parallel Hybrid GPU/CPU Hash Table Queries with String Keys
    Groth, Tobias
    Groppe, Sven
    Pionteck, Thilo
    Valdiek, Franz
    Koppehel, Martin
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT II, 2022, 13427 : 191 - 203
  • [40] Higher Order Method of Moments With a Parallel Out-of-Core LU Solver on GPU/CPU Platform
    Mu, Xing
    Zhou, Hou-Xing
    Chen, Kang
    Hong, Wei
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2014, 62 (11) : 5634 - 5646