High performance CPU/GPU multiresolution Poisson solver

被引:2
|
作者
Van Rees, Wim M. [1 ]
Rossinelli, Diego [1 ]
Hadjidoukas, Panagiotis [1 ]
Koumoutsakos, Petros [1 ]
机构
[1] Swiss Fed Inst Technol, Chair Computat Sci, Zurich, Switzerland
关键词
tree-code; multipole method; GPU; vortex method; multiresolution; SIMULATIONS; FLOW;
D O I
10.3233/978-1-61499-381-0-481
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a multipole-based N-body solver for 3D multiresolution, block-structured grids. The solver is designed for a single heterogeneous CPU/GPU compute node, and evaluates the multipole expansions on the CPU while offloading the compute-heavy particle-particle interactions to the GPU. The regular structure of the destination points is exploited for data parallelism on the CPU, to reduce data transfer to the GPU and to minimize memory accesses during evaluation of the direct and indirect interactions. The algorithmic improvements together with HPC techniques lead to 81% and 96% of the upper bound performance for the CPU and GPU parts, respectively.
引用
收藏
页码:481 / 490
页数:10
相关论文
共 50 条
  • [41] Reducing CPU-GPU Interferences to Improve CPU Performance in Heterogeneous Architectures
    Wen H.
    Zhang W.
    Journal of Computing Science and Engineering, 2020, 16 (04) : 131 - 145
  • [42] Symmetric Tridiagonal Eigenvalue Solver Across CPU Graphics Processing Unit (GPU) Nodes
    Hernandez-Rubio, Erika
    Estrella-Cruz, Alberto
    Meneses-Viveros, Amilcar
    Rivera-Rivera, Jorge Alberto
    Barbosa-Santillan, Liliana Ibeth
    Chapa-Vergara, Sergio Victor
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [43] A customized 3D GPU Poisson solver for free boundary conditions
    Dugan, Nazim
    Genovese, Luigi
    Goedecker, Stefan
    COMPUTER PHYSICS COMMUNICATIONS, 2013, 184 (08) : 1815 - 1820
  • [44] GPU friendly Fast Poisson Solver for Structured Power Grid Network Analysis
    Shi, Jin
    Cai, Yici
    Hou, Wenting
    Ma, Liwei
    Tan, Sheldon X. -D.
    Ho, Pei-Hsin
    Wang, Xiaoyi
    DAC: 2009 46TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2009, : 178 - +
  • [45] High performance computing of stiff bubble collapse on CPU-GPU heterogeneous platform
    Dubois, Remy
    da Silva, Eric Goncalves
    Parnaudeau, Philippe
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2021, 99 : 246 - 256
  • [46] Design of a simulation model for high performance LINPACK in hybrid CPU-GPU systems
    Hu, Yichang
    Lu, Lu
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (12): : 13739 - 13756
  • [47] High performance computing of stiff bubble collapse on CPU-GPU heterogeneous platform
    Dubois, Remy
    Goncalves da Silva, Eric
    Parnaudeau, Philippe
    Computers and Mathematics with Applications, 2021, 99 : 246 - 256
  • [48] Design of a simulation model for high performance LINPACK in hybrid CPU-GPU systems
    Yichang Hu
    Lu Lu
    The Journal of Supercomputing, 2021, 77 : 13739 - 13756
  • [49] A high-performance matrix-matrix multiplication methodology for CPU and GPU architectures
    Kelefouras, Vasilios
    Kritikakou, A.
    Mporas, Iosif
    Kolonias, Vasilios
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (03): : 804 - 844
  • [50] High-performance hybrid CPU and GPU parallel algorithm for digital volume correlation
    Gates, Mark
    Heath, Michael T.
    Lambros, John
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2015, 29 (01): : 92 - 106