Optimizing Load Balance in a Parallel CFD Code for a Large-scale Turbine Simulation on a Vector Supercomputer

被引:0
|
作者
Watanabe O. [1 ]
Komatsu K. [2 ]
Sato M. [2 ]
Kobayashi H. [2 ]
机构
[1] NEC Corporation, Tokyo
[2] Tohoku University, Sendai
关键词
hybrid parallelization; load balance; MPI; OpenMP; turbine simulation code; vector supercomputer;
D O I
10.14529/js210207
中图分类号
学科分类号
摘要
A turbine for power generation is one of the essential infrastructures in our society. A turbine’s failure causes severe social and economic impacts on our everyday life. Therefore, it is necessary to foresee such failures in advance. However, it is not easy to expect these failures from a real turbine. Hence, it is required to simulate various events occurring in the turbine by numerical simulations of the turbine. A multiphysics CFD code, “Numerical Turbine,” has been developed on vector supercomputer systems for large-scale simulations of unsteady wet steam flows inside a turbine. To solve this problem, the Numerical Turbine code is a block structure code using MPI parallelization, and the calculation space consists of grid blocks of different sizes. Therefore, load imbalance occurs when executing the code in MPI parallelization. This paper creates an estimation model that finds the calculation time from each grid block’s calculation amount and calculation performance. It proposes an OpenMP parallelization method for the load balance of MPI applications. This proposed method reduces the load imbalance by considering the vector performance according to the calculation amount based on the model. Moreover, this proposed method recognizes the need to reduce the load imbalance without pre-execution. The performance evaluation shows that the proposed method improves the load balance from 24.4 % to 9.3 %. © 2021. The Authors. All Rights Reserved.
引用
收藏
页码:114 / 130
页数:16
相关论文
共 50 条
  • [21] Parallel cellular automata for large-scale urban simulation using load-balancing techniques
    Li, Xia
    Zhang, Xiaohu
    Yeh, Anthony
    Liu, Xiaoping
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2010, 24 (06) : 803 - 820
  • [22] Genesis: A system for large-scale parallel network simulation
    Szymanski, BK
    Saifee, A
    Sastry, A
    Liu, Y
    Madnani, K
    16TH WORKSHOP ON PARALLEL AND DISTRIBUTED SIMULATION, PROCEEDINGS, 2002, : 89 - 96
  • [23] Massively parallel simulation on large-scale carbon nanotubes
    Tejima, S
    Berebr, S
    Minami, K
    Jimbo, N
    Nakamura, H
    Kanada, Y
    Tamanek, D
    NANOTECH 2003, VOL 3, 2003, : 102 - 105
  • [24] Efficient parallel simulation of large-scale PCS networks
    Boukerche, A
    Das, SK
    Fabbri, A
    Yildiz, O
    TRANSACTIONS OF THE SOCIETY FOR COMPUTER SIMULATION INTERNATIONAL, 1999, 16 (03): : 113 - 125
  • [25] A PARALLEL PARTITIONING METHOD FOR LARGE-SCALE CIRCUIT SIMULATION
    ZHANG, XD
    UNIVERSITY PROGRAMS IN COMPUTER-AIDED ENGINEERING, DESIGN, AND MANUFACTURING, 1989, : 134 - 141
  • [26] A parallel reservoir simulator for large-scale reservoir simulation
    Dogru, AH
    Sunaidi, HA
    Fung, LS
    Habiballah, WA
    Al-Zamel, N
    Li, KG
    SPE RESERVOIR EVALUATION & ENGINEERING, 2002, 5 (01) : 11 - 23
  • [27] Parallel Simulation of Large-scale Microscopic Traffic Networks
    Dai, Wei
    Zhang, Jiachen
    Zhang, Dongliang
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 3, 2010, : 22 - 28
  • [28] Modeling and Simulation Analysis of Large-scale Smelting Load
    Huang, Xiaoming
    Lou, Boliang
    Huang, Hongyang
    Chen, Daxuan
    Yu, Yiping
    Shen, Fu
    Ju, Ping
    2016 IEEE INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2016,
  • [29] Solving large-scale eigenvalue problems on vector parallel processors
    Harrar, DL
    Osborne, MR
    VECTOR AND PARALLEL PROCESSING - VECPAR'98, 1999, 1573 : 100 - 113
  • [30] Characterizing Load and Communication Imbalance in Large-Scale Parallel Applications
    Boehme, David
    Wolf, Felix
    Geimer, Markus
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2538 - 2541