BATSRUS GPU: Faster-than-real-time Magnetospheric Simulations with a Block-adaptive Grid Code

被引:0
|
作者
An, Yifu [1 ]
Chen, Yuxi [1 ]
Zhou, Hongyang [2 ]
Gaenko, Alexander [1 ]
Toth, Gabor [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Boston Univ, Boston, MA USA
来源
ASTROPHYSICAL JOURNAL | 2025年 / 981卷 / 02期
基金
美国国家科学基金会;
关键词
MODEL; MAGNETOHYDRODYNAMICS;
D O I
10.3847/1538-4357/adaf15
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Block-Adaptive-Tree Solar-wind Roe-type Upwind Scheme (BATSRUS), our state-of-the-art extended magnetohydrodynamic code, is the most used and one of the most resource-consuming models in the Space Weather Modeling Framework. It has always been our objective to improve its efficiency and speed with emerging techniques, such as GPU acceleration. To utilize the GPU nodes on modern supercomputers, we port BATSRUS to GPUs with the OpenACC API. Porting the code to a single GPU requires rewriting and optimizing the most used functionalities of the original code into a new solver, which accounts for around 1% of the entire program in length. To port it to multiple GPUs, we implement a new message-passing algorithm to support its unique block-adaptive grid feature. We conduct weak scaling tests on as many as 256 GPUs and find good performance. The program has 50%-60% parallel efficiency on up to 256 GPUs and up to 95% efficiency within a single node (four GPUs). Running large problems on more than one node has reduced efficiency due to hardware bottlenecks. We also demonstrate our ability to run representative magnetospheric simulations on GPUs. The performance for a single A100 GPU is about the same as 270 AMD "Rome" CPU cores (2.1 128-core nodes), and it runs 3.6 times faster than real time. The simulation can run 6.9 times faster than real time on four A100 GPUs.
引用
收藏
页数:15
相关论文
共 41 条