A hybrid CPU/GPU approach for the parallel algebraic recursive multilevel solver pARMS

被引:0
|
作者
Jamal, Aygul [1 ]
Baboulin, Marc [1 ]
Khabou, Amal [1 ]
Sosonkina, Masha [2 ]
机构
[1] Univ Paris Sud, F-91405 Orsay, France
[2] Old Dominion Univ, Norfolk, VA 23529 USA
基金
美国国家科学基金会;
关键词
SCHEMES; GPU;
D O I
10.1109/SYNASC.2016.63
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We illustrate how the distributed parallel Algebraic Recursive Multilevel Solver based on MPI can be adapted for heterogeneous CPU/GPU architectures. The tasks performed on the GPU are related to the preconditioning of each part of the distributed matrix (local preconditioning) which is handled in the distributed version by each MPI process. The solving step remains on the CPU. In our implementation, the local preconditioning can be based either on the randomization of the last Schur complement system in the multilevel recursive process, or on an Incomplete LU factorization from the MAGMA library. Numerical experiments show that a promising performance improvement can be obtained using either randomized multilevel recursive preconditioning or Incomplete LU preconditioning for large enough matrices. Each preconditioning method ensures a good performance for a given set of matrices.
引用
收藏
页码:411 / 416
页数:6
相关论文
共 50 条
  • [41] PARALLEL BINOMIAL AMERICAN OPTION PRICING ON CPU-GPU HYBRID PLATFORM
    Zhang, Nan
    Lei, Chi-Un
    Man, Ka Lok
    IAENG TRANSACTIONS ON ELECTRICAL ENGINEERING, VOL 1, 2012, : 161 - 174
  • [42] A Parallel Method for Impulsive Image Noise Removal on Hybrid CPU/GPU Systems
    Sanchez, M. G.
    Vidal, V.
    Bataller, J.
    Arnal, J.
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 2504 - 2507
  • [43] Crom - Massively Parallel, CPU/GPU Hybrid Computation Platform for Visual Effects
    Cournia, Nathan
    Smith, Bradley
    Spitzak, Bill
    Vanover, Casey
    Rijpkema, Hans
    Tomlinson, Josh
    Litke, Nathan
    SIGGRAPH '12: SPECIAL INTEREST GROUP ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES CONFERENCE, 2012,
  • [44] New Multithreaded Hybrid CPU/GPU Approach to Hartree-Fock
    Asadchev, Andrey
    Gordon, Mark S.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2012, 8 (11) : 4166 - 4176
  • [45] A second-order distributed Trotter-Suzuki solver with a hybrid CPU-GPU kernel
    Wittek, Peter
    Cucchietti, Fernando M.
    COMPUTER PHYSICS COMMUNICATIONS, 2013, 184 (04) : 1165 - 1171
  • [46] On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures
    Morgan, Nathaniel
    Yenusah, Caleb
    Diaz, Adrian
    Dunning, Daniel
    Moore, Jacob
    Heilman, Erin
    Roth, Calvin
    Lieberman, Evan
    Walton, Steven
    Brown, Sarah
    Holladay, Daniel
    Knezevic, Marko
    Whetstone, Gavin
    Baker, Zachary
    Robey, Robert
    INFORMATION, 2024, 15 (11)
  • [47] Fast parallel Newton-Raphson power flow solver for large number of system calculations with CPU and GPU
    Wang, Zhenqi
    Wende-von Berg, Sebastian
    Braun, Martin
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2021, 27 (27):
  • [48] A Highly Efficient GPU-CPU Hybrid Parallel Implementation of Sparse LU Factorization
    Liu Li
    Liu Li
    Yang Guangwen
    CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (01): : 7 - 12
  • [49] A Hybrid GPU and CPU Parallel Computing Method to Accelerate Millimeter-Wave Imaging
    Ding, Li
    Dong, Zhaomiao
    He, Huagang
    Zheng, Qibin
    ELECTRONICS, 2023, 12 (04)
  • [50] Parallel Preconditioning and Modular Finite Element Solvers on Hybrid CPU-GPU Systems
    Heuveline, V.
    Lukarski, D.
    Subramanian, C.
    Weiss, J. -P.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, GRID AND CLOUD COMPUTING FOR ENGINEERING, 2011, 95