Multiple-GPU accelerated high-order gas-kinetic scheme for direct numerical simulation of compressible turbulence

被引:5
|
作者
Wang, Yuhang [1 ]
Cao, Guiyu [2 ]
Pan, Liang [1 ]
机构
[1] Beijing Normal Univ, Sch Math Sci, Lab Math & Complex Syst, Beijing, Peoples R China
[2] Southern Univ Sci & Technol, Acad Adv Interdisciplinary Studies, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
High-order gas-kinetic scheme; Direct numerical simulation; Compressible turbulence; Multiple-GPU accelerated computation; BOUNDARY-LAYER; CHANNEL FLOW; DISSIPATION; EFFICIENT; EULER;
D O I
10.1016/j.jcp.2022.111899
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-order gas-kinetic scheme (HGKS) has become a workable tool for the direct numerical simulation (DNS) of turbulence. In this paper, to accelerate the computation, HGKS is implemented with the graphical processing unit (GPU) using the compute unified device architecture (CUDA). Due to the limited available memory size, the computational scale is constrained by single GPU. For large-scale DNS of turbulence, we develop a multi-GPU HGKS simulation using message passing interface (MPI) and CUDA. The benchmark cases for compressible turbulence, including Taylor-Green vortex and turbulent channel flows, are presented to assess the numerical performance of HGKS with Nvidia TITAN RTX and Tesla V100 GPUs. For single-GPU computation, compared with the parallel central processing unit (CPU) code running on the Intel Core i7-9700 with open multi-processing (OpenMP) directives, 7x speedup is achieved by TITAN RTX and 16x speedup is achieved by Tesla V100. For multiple-GPU computation, multiple-GPU accelerated HGKS code scales properly with the increasing number of GPU. The computational time of parallel CPU code running on 1024 Intel Xeon E5-2692 cores with MPI is approximately 3 times longer than that of GPU code using 8 Tesla V100 GPUs with MPI and CUDA. Numerical results confirm the excellent performance of multiple-GPU accelerated HGKS for large-scale DNS of turbulence. Besides reducing memory access pressure, we also exploit single precision floating point arithmetic to accelerate HGKS on GPUs. Reasonably, compared to the computation with FP64 precision, the efficiency is improved and the memory cost is reduced with FP32 precision. Meanwhile, the differences in accuracy for statistical turbulent quantities appear. For turbulent channel flows, difference in long-time statistical turbulent quantities is acceptable between FP32 and FP64 precision solutions. While the obvious discrepancy in instantaneous turbulent quantities can be observed, which shows that FP32 precision is not safe for DNS in compressible turbulence. The choice of precision should be depended on the requirement of accuracy and the available computational resources. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] A high-order gas-kinetic method for multidimensional ideal magnetohydrodynamics
    Tang, HZ
    Xu, K
    JOURNAL OF COMPUTATIONAL PHYSICS, 2000, 165 (01) : 69 - 88
  • [42] A compact high-order gas-kinetic scheme on unstructured mesh for acoustic and shock wave computations
    Zhao, Fengxiang
    Ji, Xing
    Shyy, Wei
    Xu, Kun
    JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 449
  • [43] High-Order Gas-Kinetic Scheme in Curvilinear Coordinates for the Euler and Navier-Stokes Solutions
    Pan, Liang
    Xu, Kun
    COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2020, 28 (04) : 1321 - 1351
  • [44] Numerical simulation of flow around an airfoil with gas-kinetic BGK scheme
    Xiong, Shengwei
    Zhong, Chengwen
    Zhuo, Congshan
    Li, Kai
    Chen, Xiaopeng
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2010, 31 (06): : 1099 - 1105
  • [45] Large-eddy simulation of wall-bounded turbulent flow with high-order discrete unified gas-kinetic scheme
    Zhang, Rui
    Zhong, Chengwen
    Liu, Sha
    Zhuo, Congshan
    ADVANCES IN AERODYNAMICS, 2020, 2 (01)
  • [46] Large-eddy simulation of wall-bounded turbulent flow with high-order discrete unified gas-kinetic scheme
    Rui Zhang
    Chengwen Zhong
    Sha Liu
    Congshan Zhuo
    Advances in Aerodynamics, 2
  • [47] Circular Function-Based Gas-Kinetic Scheme for Simulation of Viscous Compressible Flows
    Meng, Zhuxuan
    Yang, Liming
    Wang, Donghui
    Shu, Chang
    Zhang, Weihua
    COMPUTATIONAL SCIENCE - ICCS 2018, PT III, 2018, 10862 : 37 - 47
  • [48] High-order gas-kinetic scheme with TENO class reconstruction for the Euler and Navier-Stokes equations
    Mu, Junlei
    Zhuo, Congshan
    Zhang, Qingdian
    Liu, Sha
    Zhong, Chengwen
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2025, 179 : 126 - 147
  • [49] Circular function-based gas-kinetic scheme for simulation of inviscid compressible flows
    Yang, L. M.
    Shu, C.
    Wu, J.
    Zhao, N.
    Lu, Z. L.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2013, 255 : 540 - 557
  • [50] An Arbitrary-Lagrangian-Eulerian High-Order Gas-Kinetic Scheme for Three-Dimensional Computations
    Liang Pan
    Kun Xu
    Journal of Scientific Computing, 2021, 88