BATSRUS GPU: Faster-than-real-time Magnetospheric Simulations with a Block-adaptive Grid Code

被引:0
|
作者
An, Yifu [1 ]
Chen, Yuxi [1 ]
Zhou, Hongyang [2 ]
Gaenko, Alexander [1 ]
Toth, Gabor [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Boston Univ, Boston, MA USA
来源
ASTROPHYSICAL JOURNAL | 2025年 / 981卷 / 02期
基金
美国国家科学基金会;
关键词
MODEL; MAGNETOHYDRODYNAMICS;
D O I
10.3847/1538-4357/adaf15
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Block-Adaptive-Tree Solar-wind Roe-type Upwind Scheme (BATSRUS), our state-of-the-art extended magnetohydrodynamic code, is the most used and one of the most resource-consuming models in the Space Weather Modeling Framework. It has always been our objective to improve its efficiency and speed with emerging techniques, such as GPU acceleration. To utilize the GPU nodes on modern supercomputers, we port BATSRUS to GPUs with the OpenACC API. Porting the code to a single GPU requires rewriting and optimizing the most used functionalities of the original code into a new solver, which accounts for around 1% of the entire program in length. To port it to multiple GPUs, we implement a new message-passing algorithm to support its unique block-adaptive grid feature. We conduct weak scaling tests on as many as 256 GPUs and find good performance. The program has 50%-60% parallel efficiency on up to 256 GPUs and up to 95% efficiency within a single node (four GPUs). Running large problems on more than one node has reduced efficiency due to hardware bottlenecks. We also demonstrate our ability to run representative magnetospheric simulations on GPUs. The performance for a single A100 GPU is about the same as 270 AMD "Rome" CPU cores (2.1 128-core nodes), and it runs 3.6 times faster than real time. The simulation can run 6.9 times faster than real time on four A100 GPUs.
引用
收藏
页数:15
相关论文
共 41 条
  • [31] Real-Time and Faster-Than-Real-Time Simulation of Modular Multilevel Converters using standard multi-core CPU and FPGA Chips
    Wang, Can
    Li, Wei
    Belanger, Jean
    39TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2013), 2013, : 5405 - 5411
  • [32] Faster-Than-Real-Time Linear Lane Detection Implementation Using SoC DSP TMS320C6678
    Fan, Rui
    Prokhorov, Victor
    Dahnoun, Nairn
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST), 2016, : 306 - 311
  • [33] Detailed Multi-Domain Modeling and Faster-Than-Real-Time Hardware Emulation of Small Modular Reactor for EMT Studies
    Chen, Weiran
    Dinavahi, Venkata
    Lin, Ning
    IEEE TRANSACTIONS ON ENERGY CONVERSION, 2024, 39 (03) : 1644 - 1657
  • [34] Faster-Than-Real-Time Power System Transient Stability Simulation using Parallel General Norton with Multiport Equivalent (PGNME)
    Sullivan, Brian
    Shi, Jian
    Mazzola, Mike
    Saravi, Babak
    2017 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, 2017,
  • [35] Damping of Subsynchronous Control Interactions in Large-Scale PV Installations Through Faster-Than-Real-Time Dynamic Emulation
    Cao, Shiqi
    Lin, Ning
    Dinavahi, Venkata
    IEEE ACCESS, 2021, 9 : 128481 - 128493
  • [36] Faster than real-time machine learning within high fidelity Simulations
    Danahy, EE
    Morrison, SA
    35TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2002, : 300 - 307
  • [37] Faster-than-real-time Simulation of Stator-rotor Decoupling Digital Twin of Doubly-fed Induction Generator Based on FPGA
    Chen H.
    Yang Z.
    Ye H.
    Pei W.
    Kai S.
    Gaodianya Jishu/High Voltage Engineering, 2023, 49 (05): : 1819 - 1833
  • [38] Saliency-Maximized Audio Visualization and Efficient Audio-Visual Browsing for Faster-Than-Real-Time Human Acoustic Event Detection
    Lin, Kai-Hsiang
    Zhuang, Xiaodan
    Goudeseune, Camille
    King, Sarah
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    ACM TRANSACTIONS ON APPLIED PERCEPTION, 2013, 10 (04)
  • [39] From Concept to Field Tests: Accelerated Development of Multi-AUV Missions Using a High-Fidelity Faster-than-Real-Time Simulator
    Player, Timothy R.
    Chakravarty, Arjo
    Zhang, Mabel M.
    Raanan, Ben Yair
    Kieft, Brian
    Zhang, Yanwu
    Hobson, Brett
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3102 - 3108
  • [40] Machine Learning Based Transient Stability Emulation and Dynamic System Equivalencing of Large-Scale AC-DC Grids for Faster-Than-Real-Time Digital Twin
    Cao, Shiqi
    Dinavahi, Venkata
    Lin, Ning
    IEEE ACCESS, 2022, 10 (112975-112988) : 112975 - 112988