Multi-GPU parallelization of shallow water modelling on unstructured meshes

被引:0
|
作者
Dong, Boliang [1 ,2 ,3 ,4 ]
Huang, Bensheng [1 ,2 ]
Tan, Chao [1 ,2 ]
Xia, Junqiang [3 ]
Lin, Kairong [4 ]
Gao, Shuailing [1 ]
Hu, Yong [5 ]
机构
[1] Guangdong Res Inst Water Resources & Hydropower, Guangzhou 510635, Peoples R China
[2] Guangdong Key Lab Hydrodynam Res, Guangzhou 510635, Peoples R China
[3] Wuhan Univ, State Key Lab Water Resources Engn & Management, Wuhan 430072, Peoples R China
[4] Sun Yat sen Univ, Ctr Water Resources & Environm, Sch Civil Engn, Guangdong Key Lab Marine Civil Engn, Guangzhou, Peoples R China
[5] Chinese Acad Sci, Inst Earth Environm, State Key Lab Loess & Quaternary Geol, Xian 710061, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Hydrodynamic model; Parallel computation; Shallow water equation; Multi-GPU acceleration; FINITE-VOLUME METHOD; SIMULATION; FLOWS; EFFICIENT; RAINFALL; SOLVER;
D O I
10.1016/j.jhydrol.2025.133105
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Floods are one of the most devastating natural hazards globally, causing significant loss of life and extensive economic damage. Shallow water equation (SWE) models, due to their clear physical mechanism and good accuracy, can provide detailed predictions of flood behaviour, which are essential for flood risk evaluation and mitigation. However, traditional SWE models face significant limitations in supporting large-scale, long-duration, and high-resolution numerical simulations, which are increasingly demanded by modern applications such as flood forecasting and the establishment of warning systems. In response to the increasing demand for rapid and accurate flood modelling, this study presents a multi-GPU accelerated unstructured mesh SWE model. The proposed model employs MPI-OpenACC method to facilitate multi-GPU parallel computing for hydrodynamic simulations and incorporates a novel asynchronous communication strategy aimed at minimizing the overhead associated with parallel communication. Three representative flood cases were employed to assess the accuracy and efficiency of the proposed model. The results indicated that the speedup of the proposed model reached more than 800 when using eight GPUs in parallel, and the model could simulate a 30 h extreme flood in a 1,300 km2 watershed within 0.35 h. Multi-GPU parallel computing holds great promise for applications in rapid flood simulation and real-time risk assessment.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Modelling Multi-GPU Systems
    Spampinato, Daniele G.
    Elster, Anne C.
    Natvig, Thorvald
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 562 - 569
  • [2] A multi-GPU shallow-water simulation with transport of contaminants
    Vinas, M.
    Lobeiras, J.
    Fraguela, B. B.
    Arenaz, M.
    Amor, M.
    Garcia, J. A.
    Castro, M. J.
    Doallo, R.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (08): : 1153 - 1169
  • [3] The Optimization of Model Parallelization Strategies for Multi-GPU Training
    Zhang, Zechao
    Chen, Jianfeng
    Hu, Bing
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [4] Multi-GPU Parallelization of the NAS Multi-Zone Parallel Benchmarks
    Gonzalez, Marc
    Morancho, Enric
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (01) : 229 - 241
  • [5] Multi-GPU and Multi-CPU Parallelization for Interactive Physics Simulations
    Hermann, Everton
    Raffin, Bruno
    Faure, Francois
    Gautier, Thierry
    Allard, Jeremie
    EURO-PAR 2010 - PARALLEL PROCESSING, PART II, 2010, 6272 : 235 - 246
  • [6] Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training
    Pal, Saptadeep
    Ebrahimi, Eiman
    Zulfiqar, Arslan
    Fu, Yaosheng
    Zhang, Victor
    Migacz, Szymon
    Nellans, David
    Gupta, Puneet
    IEEE MICRO, 2019, 39 (05) : 91 - 101
  • [7] Problems Related to Parallelization of CFD Algorithms on GPU, Multi-GPU and Hybrid Architectures.
    Blazewicz, Marek
    Kurowski, Krzysztof'
    Ludwiczak, Bogdan
    Napierala, Krystyna
    NUMERICAL ANALYSIS AND APPLIED MATHEMATICS, VOLS I-III, 2010, 1281 : 1301 - 1304
  • [8] Automatic Parallelization of Kernels in Shared-Memory Multi-GPU Nodes
    Cabezas, Javier
    Vilanova, Lluis
    Gelado, Isaac
    Jablin, Thomas B.
    Navarro, Nacho
    Hwu, Wen-mei W.
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS'15), 2015, : 3 - 13
  • [9] WORKLOAD-AWARE AUTOMATIC PARALLELIZATION FOR MULTI-GPU DNN TRAINING
    Shin, Sungho
    Jo, Youngmin
    Choi, Jungwook
    Venkataramani, Swagath
    Srinivasan, Vijayalakshmi
    Sung, Wonyong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1453 - 1457
  • [10] Hybrid MPI and CUDA Parallelization for CFD Applications on Multi-GPU HPC Clusters
    Lai, Jianqi
    Yu, Hang
    Tian, Zhengyu
    Li, Hua
    SCIENTIFIC PROGRAMMING, 2020, 2020