High-performance Shallow Water Model for Use on Massively Parallel and Heterogeneous Computing Systems

被引:0
|
作者
Chaplygin A.V. [1 ]
Gusev A.V. [1 ,2 ,3 ]
Diansky N.A. [1 ,3 ,4 ]
机构
[1] Marchuk Institute of Numerical Mathematics of the Russian Academy of Sciences, Moscow
[2] P.P. Shirshov Institute of Oceanology of the Russian Academy of Sciences, Moscow
[3] N.N. Zubov State Oceanographic Institute, Moscow
[4] Lomonosov Moscow State University, Moscow
基金
俄罗斯基础研究基金会;
关键词
Cuda; Heterogeneous computing systems; Mpi; Openmp; Shallow water; Supercomputer modeling;
D O I
10.14529/JSFI210407
中图分类号
学科分类号
摘要
This paper presents the shallow water model, formulated from the ocean general circulation sigma model INMOM (Institute of Numerical Mathematics Ocean Model). The shallow water model is based on software architecture, which separates the physics-related code from parallel implementation features, thereby simplifying the model’s support and development. As an improvement of the two-dimensional domain decomposition method, we present the blocked-based decomposition proposing load-balanced and cache-friendly calculations on CPUs. We propose various hybrid parallel programming patterns in the shallow water model for effective calculation on massively parallel and heterogeneous computing systems and evaluate their scaling performances on the Lomonosov-2 supercomputer. We demonstrate that performance per a single grid point on GPUs dramatically decreases for small grid sizes starting from 219 points per node, while performance on CPUs scales up to 217 well. Although, calculations on GPUs outperform calculations on CPUs by a factor of 4.7 at 30 nodes using 60 GPUs and 360 CPU cores at 6100 × 4460 grid size. We demonstrate that overlapping kernel execution with data transfers on GPUs increases performance by 28%. Furthermore, we demonstrate the advantage of using the load-balancing method in the Azov Sea model on CPUs and GPUs. © The Authors 2021. This paper is published with open access at SuperFri.org
引用
收藏
页码:74 / 93
页数:19
相关论文
共 50 条
  • [41] High-performance parallel computing for stiffness equation of FEM
    Nippon Kikai Gakkai Ronbunshu A Hen, 603 (2468-2473):
  • [42] A high performance parallel DCT with OpenCL on heterogeneous computing environment
    Cheong Ghil Kim
    Yong Soo Choi
    Multimedia Tools and Applications, 2013, 64 : 475 - 489
  • [43] A high performance parallel DCT with OpenCL on heterogeneous computing environment
    Kim, Cheong Ghil
    Choi, Yong Soo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 64 (02) : 475 - 489
  • [44] Fully Parallel Optimization of Coordinated Electricity and Natural Gas Systems on High-Performance Computing
    Gong, Lin
    Peng, Yehong
    Zhang, Chenxu
    Fu, Yong
    IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (05) : 3499 - 3511
  • [45] Survey of Methodologies, Approaches, and Challenges in Parallel Programming Using High-Performance Computing Systems
    Czarnul, Pawel
    Proficz, Jerzy
    Drypczewski, Krzysztof
    SCIENTIFIC PROGRAMMING, 2020, 2020
  • [46] Thoughts on Massively-Parallel Heterogeneous Computing for Solving Large Problems
    Hwu, Wen-mei
    Hidayetoglu, Mert
    Chew, Weng Cho
    Pearson, Carl
    Garcia, Simon
    Huang, Sitao
    Dakkak, Abdul
    2017 COMPUTING AND ELECTROMAGNETICS INTERNATIONAL WORKSHOP (CEM'17), 2017, : 67 - 68
  • [47] High-performance computing in water resources hydrodynamics
    Morales-Hernandez, M.
    Sharif, M. B.
    Gangrade, S.
    Dullo, T. T.
    Kao, S-C
    Kalyanapu, A.
    Ghafoor, S. K.
    Evans, K. J.
    Madadi-Kandjani, E.
    Hodges, B. R.
    JOURNAL OF HYDROINFORMATICS, 2020, 22 (05) : 1217 - 1235
  • [48] Computational performance of heterogeneous ensemble frameworks on high-performance computing platforms
    Wang, Linhua
    Timsina, Prem
    Pandey, Gaurav
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2843 - 2850
  • [49] Parallel application-level behavioral attributes for performance and energy management of high-performance computing systems
    Jeffrey J. Evans
    Charles E. Lucas
    Cluster Computing, 2013, 16 : 91 - 115
  • [50] Parallel application-level behavioral attributes for performance and energy management of high-performance computing systems
    Evans, Jeffrey J.
    Lucas, Charles E.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (01): : 91 - 115