High performance CPU/GPU multiresolution Poisson solver

被引:2
|
作者
Van Rees, Wim M. [1 ]
Rossinelli, Diego [1 ]
Hadjidoukas, Panagiotis [1 ]
Koumoutsakos, Petros [1 ]
机构
[1] Swiss Fed Inst Technol, Chair Computat Sci, Zurich, Switzerland
关键词
tree-code; multipole method; GPU; vortex method; multiresolution; SIMULATIONS; FLOW;
D O I
10.3233/978-1-61499-381-0-481
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a multipole-based N-body solver for 3D multiresolution, block-structured grids. The solver is designed for a single heterogeneous CPU/GPU compute node, and evaluates the multipole expansions on the CPU while offloading the compute-heavy particle-particle interactions to the GPU. The regular structure of the destination points is exploited for data parallelism on the CPU, to reduce data transfer to the GPU and to minimize memory accesses during evaluation of the direct and indirect interactions. The algorithmic improvements together with HPC techniques lead to 81% and 96% of the upper bound performance for the CPU and GPU parts, respectively.
引用
收藏
页码:481 / 490
页数:10
相关论文
共 50 条
  • [21] Benchmarking of High Performance Computing Clusters with Heterogeneous CPU/GPU Architecture
    Sukharev, Pavel V.
    Vasilyev, Nikolay P.
    Rovnyagin, Mikhail M.
    Durnov, Maxim A.
    PROCEEDINGS OF THE 2017 IEEE RUSSIA SECTION YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING CONFERENCE (2017 ELCONRUS), 2017, : 574 - 577
  • [22] A High Performance Implementation of Spectral Clustering on CPU-GPU Platforms
    Jin, Yu
    Jaja, Joseph F.
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 825 - 834
  • [23] High performance iterative solver for linear system using multi GPU
    Ikuno S.
    Fujita N.
    Kawaguchi Y.
    Itoh T.
    Nakata S.
    Watanabe K.
    Nakamura H.
    Plasma and Fusion Research, 2011, 6 (1 SPECIAL ISSUE)
  • [24] Performances of Navier-Stokes Solver on a Hybrid CPU/GPU Computing System
    Alfonsi, Giancarlo
    Ciliberti, Stefania A.
    Mancini, Marco
    Primavera, Leonardo
    PARALLEL COMPUTING TECHNOLOGIES, 2011, 6873 : 404 - 416
  • [25] Improving Dense Linear Equation Solver on Hybrid CPU-GPU System
    Cao, Zhichao
    Xu, Shiming
    Xue, Wei
    Chen, Wenguang
    2009 10TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (ISPAN 2009), 2009, : 556 - +
  • [26] Hybrid CPU-GPU Solver for Gradient Domain Processing of Massive Images
    Philip, Sujin
    Summa, Brian
    Pascucci, Valerio
    Bremer, Peer-Timo
    2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 244 - 251
  • [27] An incompressible flow solver on a GPU/CPU heterogeneous architecture parallel computing platform
    Qianqian Li
    Rong Li
    Zixuan Yang
    Theoretical & Applied Mechanics Letters, 2023, 13 (05) : 387 - 393
  • [28] An incompressible flow solver on a GPU/CPU heterogeneous architecture parallel computing platform
    Li, Qianqian
    Li, Rong
    Yang, Zixuan
    THEORETICAL AND APPLIED MECHANICS LETTERS, 2023, 13 (05)
  • [29] Developing a CPU-GPU LES Parallel Solver for Canonical Turbulent Flows
    Nafiseh ZendehAli
    Homayoun Emdad
    Omid Abouali
    Iranian Journal of Science and Technology, Transactions of Mechanical Engineering, 2023, 47 : 1535 - 1551
  • [30] Developing a CPU-GPU LES Parallel Solver for Canonical Turbulent Flows
    ZendehAli, Nafiseh
    Emdad, Homayoun
    Abouali, Omid
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF MECHANICAL ENGINEERING, 2023, 47 (04) : 1535 - 1551