Multi-GPU parallelization of shallow water modelling on unstructured meshes

被引:0
|
作者
Dong, Boliang [1 ,2 ,3 ,4 ]
Huang, Bensheng [1 ,2 ]
Tan, Chao [1 ,2 ]
Xia, Junqiang [3 ]
Lin, Kairong [4 ]
Gao, Shuailing [1 ]
Hu, Yong [5 ]
机构
[1] Guangdong Res Inst Water Resources & Hydropower, Guangzhou 510635, Peoples R China
[2] Guangdong Key Lab Hydrodynam Res, Guangzhou 510635, Peoples R China
[3] Wuhan Univ, State Key Lab Water Resources Engn & Management, Wuhan 430072, Peoples R China
[4] Sun Yat sen Univ, Ctr Water Resources & Environm, Sch Civil Engn, Guangdong Key Lab Marine Civil Engn, Guangzhou, Peoples R China
[5] Chinese Acad Sci, Inst Earth Environm, State Key Lab Loess & Quaternary Geol, Xian 710061, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Hydrodynamic model; Parallel computation; Shallow water equation; Multi-GPU acceleration; FINITE-VOLUME METHOD; SIMULATION; FLOWS; EFFICIENT; RAINFALL; SOLVER;
D O I
10.1016/j.jhydrol.2025.133105
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Floods are one of the most devastating natural hazards globally, causing significant loss of life and extensive economic damage. Shallow water equation (SWE) models, due to their clear physical mechanism and good accuracy, can provide detailed predictions of flood behaviour, which are essential for flood risk evaluation and mitigation. However, traditional SWE models face significant limitations in supporting large-scale, long-duration, and high-resolution numerical simulations, which are increasingly demanded by modern applications such as flood forecasting and the establishment of warning systems. In response to the increasing demand for rapid and accurate flood modelling, this study presents a multi-GPU accelerated unstructured mesh SWE model. The proposed model employs MPI-OpenACC method to facilitate multi-GPU parallel computing for hydrodynamic simulations and incorporates a novel asynchronous communication strategy aimed at minimizing the overhead associated with parallel communication. Three representative flood cases were employed to assess the accuracy and efficiency of the proposed model. The results indicated that the speedup of the proposed model reached more than 800 when using eight GPUs in parallel, and the model could simulate a 30 h extreme flood in a 1,300 km2 watershed within 0.35 h. Multi-GPU parallel computing holds great promise for applications in rapid flood simulation and real-time risk assessment.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Hybrid MPI and CUDA paralleled finite volume unstructured CFD simulations on a multi-GPU system
    Zhang, Xi
    Guo, Xiaohu
    Weng, Yue
    Zhang, Xianwei
    Lu, Yutong
    Zhao, Zhong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 139 : 1 - 16
  • [22] Cell-vertex discretization of shallow water equations on mixed unstructured meshes
    S. Danilov
    A. Androsov
    Ocean Dynamics, 2015, 65 : 33 - 47
  • [23] The FVC Scheme on Unstructured Meshes for the Two-Dimensional Shallow Water Equations
    Ziggaf, Moussa
    Boubekeur, Mohamed
    Kissami, Imad
    Benkhaldoun, Fayssal
    El Mahi, Imad
    FINITE VOLUMES FOR COMPLEX APPLICATIONS IX-METHODS, THEORETICAL ASPECTS, EXAMPLES, FVCA 9, 2020, 323 : 455 - 465
  • [24] On the well-balanced numerical discretization of shallow water equations on unstructured meshes
    Duran, A.
    Liang, Q.
    Marche, F.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2013, 235 : 565 - 586
  • [25] Cell-vertex discretization of shallow water equations on mixed unstructured meshes
    Danilov, S.
    Androsov, A.
    OCEAN DYNAMICS, 2015, 65 (01) : 33 - 47
  • [26] An immersed boundary method for unstructured meshes in depth averaged shallow water models
    Ouro, Pablo
    Cea, Luis
    Ramirez, Luis
    Nogueira, Xesus
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2016, 81 (11) : 672 - 688
  • [27] Asymptotic preserving scheme for the shallow water equations with source terms on unstructured meshes
    Duran, A.
    Marche, F.
    Turpault, R.
    Berthon, C.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2015, 287 : 184 - 206
  • [28] Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems
    Boratto, Murilo
    Alonso, Pedro
    Gimenez, Domingo
    Lastovetsky, Alexey
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (01): : 227 - 239
  • [29] Toward GPU accelerated topology optimization on unstructured meshes
    Zegard, Tomas
    Paulino, Glaucio H.
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2013, 48 (03) : 473 - 485
  • [30] Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems
    Murilo Boratto
    Pedro Alonso
    Domingo Giménez
    Alexey Lastovetsky
    The Journal of Supercomputing, 2017, 73 : 227 - 239