A comparison of the shared-memory parallel programming models OpenMP, OpenACC and Kokkos in the context of implicit solvers for high-order FEM

被引:15
|
作者
Eichstadt, Jan [1 ]
Vymazal, Martin [1 ]
Moxey, David [2 ]
Peiro, Joaquim [1 ]
机构
[1] Imperial Coll London, Dept Aeronaut, London, England
[2] Univ Exeter, Coll Engn Math & Phys Sci, Exeter, Devon, England
基金
欧盟地平线“2020”; 英国工程与自然科学研究理事会;
关键词
Shared-memory parallel programming models; OpenMP; OpenACC; Kokkos; Helmholtz equation; FEM; PERFORMANCE; BENCHMARK; FRAMEWORK;
D O I
10.1016/j.cpc.2020.107245
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We consider the application of three performance-portable programming models in the context of a high-order spectral element, implicit time-stepping solver for the Navier-Stokes equations. We aim to evaluate whether the use of these models allows code developers to deliver high-performance solvers for computational fluid dynamics simulations that are capable of effectively utilising both many-core CPU and GPU architectures. Using the core elliptic solver for the Navier-Stokes equations as a benchmarking guide, we evaluate the performance of these models on a range of unstructured meshes and give guidelines for the translation of existing codebases and their data structures to these models. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 12 条
  • [1] Evaluating performance portability of five shared-memory programming models using a high-order unstructured CFD solver
    Dai, Zhe
    Deng, Liang
    Che, Yonggang
    Li, Ming
    Zhang, Jian
    Wang, Yueqing
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 187
  • [2] A Comparative Study and Evaluation of Parallel Programming Models for Shared-Memory Parallel Architectures
    Luis Miguel Sanchez
    Javier Fernandez
    Rafael Sotomayor
    Soledad Escolar
    J. Daniel. Garcia
    New Generation Computing, 2013, 31 : 139 - 161
  • [3] A Comparative Study and Evaluation of Parallel Programming Models for Shared-Memory Parallel Architectures
    Miguel Sanchez, Luis
    Fernandez, Javier
    Sotomayor, Rafael
    Escolar, Soledad
    Daniel Garcia, J.
    NEW GENERATION COMPUTING, 2013, 31 (03) : 139 - 161
  • [4] COMPARING DISTRIBUTED-MEMORY AND VIRTUAL SHARED-MEMORY PARALLEL PROGRAMMING-MODELS
    KEANE, JA
    GRANT, AJ
    XU, MQ
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING AND ESCIENCE, 1995, 11 (02): : 233 - 243
  • [5] A Methodology Approach to Compare Performance of Parallel Programming Models for Shared-Memory Architectures
    Utrera, Gladys
    Gil, Marisa
    Martorell, Xavier
    NUMERICAL COMPUTATIONS: THEORY AND ALGORITHMS, PT I, 2020, 11973 : 318 - 325
  • [6] A parallel shared-memory implementation of a high-order accurate solution technique for variable coefficient Helmholtz problems
    Beams, Natalie
    Gillman, Adrianna
    Hewett, Russell J.
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2020, 79 (04) : 996 - 1011
  • [7] Benchmarking of a distributed-memory, high-order discontinuous finite element flow solver on a shared-memory parallel architecture
    Ali, Amjad
    Farooq, Hamayun
    Shahzadi, Gullnaz
    Umar, Muhammad
    Syed, Khalid Saifullah
    AIP ADVANCES, 2020, 10 (03)
  • [8] MulticoreBSP for C: A High-Performance Library for Shared-Memory Parallel Programming
    A. N. Yzelman
    R. H. Bisseling
    D. Roose
    K. Meerbergen
    International Journal of Parallel Programming, 2014, 42 : 619 - 642
  • [9] MulticoreBSP for C: A High-Performance Library for Shared-Memory Parallel Programming
    Yzelman, A. N.
    Bisseling, R. H.
    Roose, D.
    Meerbergen, K.
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2014, 42 (04) : 619 - 642
  • [10] Performance comparison of shared-memory OR- and AND-parallel logic programming architectures for a common benchmark
    Tick, Evan
    Journal of information processing, 1990, 13 (01) : 62 - 71