EFFICIENT AND SCALABLE ALGORITHMS FOR SMOOTHED PARTICLE HYDRODYNAMICS ON HYBRID SHARED/DISTRIBUTED-MEMORY ARCHITECTURES

被引:11
|
作者
Gonnet, Pedro [1 ,2 ]
机构
[1] Univ Durham, Sch Engn & Comp Sci, Durham, England
[2] Google Switzerland, CH-8002 Zurich, Switzerland
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2015年 / 37卷 / 01期
关键词
smoothed particle hydrodynamics; simulation; task-based parallelism; multicores; BINARY SEARCH TREES; MOLECULAR-DYNAMICS; VERLET LISTS; SIMULATION; CODE; SPH;
D O I
10.1137/140964266
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper describes a new fast and implicitly parallel approach to neighbor-finding in multiresolution smoothed particle hydrodynamics (SPH) simulations. This new approach is based on hierarchical cell decompositions and sorted interactions, within a task-based formulation. It is shown to be faster than traditional tree-based codes and to scale better than domain decomposition-based approaches on hybrid shared/distributed-memory parallel architectures, e.g., clusters of multicores, achieving a 40x speedup over the Gadget-2 simulation code.
引用
收藏
页码:C95 / C121
页数:27
相关论文
共 50 条
  • [41] Efficient local smoothed particle hydrodynamics with precomputed patches
    Kanetsuki, Yasutomo
    Wells, John C.
    Nakata, Susumu
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2020, 97 (1-2) : 63 - 71
  • [42] PARALLEL RENDERING OF VOLUMETRIC DATA SET ON DISTRIBUTED-MEMORY ARCHITECTURES
    MONTANI, C
    PEREGO, R
    SCOPIGNO, R
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1993, 5 (02): : 153 - 167
  • [43] ADAPTING FORTRAN 90 ARRAY PROGRAMS FOR DISTRIBUTED-MEMORY ARCHITECTURES
    MERLIN, JH
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 591 : 184 - 200
  • [44] Parallelizing RRT on Large-Scale Distributed-Memory Architectures
    Devaurs, Didier
    Simeon, Thierry
    Cortes, Juan
    IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (02) : 571 - 579
  • [45] COMMUNICATION OPTIMIZATIONS FOR IRREGULAR SCIENTIFIC COMPUTATIONS ON DISTRIBUTED-MEMORY ARCHITECTURES
    DAS, R
    UYSAL, M
    SALTZ, J
    HWANG, YS
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 22 (03) : 462 - 478
  • [46] Compiling Affine Loop Nests for Distributed-Memory Parallel Architectures
    Bondhugula, Uday
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [47] EFFICIENT IMPLEMENTATION OF JACOBI ALGORITHMS AND JACOBI SETS ON DISTRIBUTED MEMORY ARCHITECTURES
    EBERLEIN, PJ
    PARK, H
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1990, 8 (04) : 358 - 366
  • [48] Allocating data to distributed-memory multiprocessors by genetic algorithms
    Mansour, Nashat, 1600, John Wiley & Sons Ltd, Chichester, United Kingdom (06):
  • [49] A framework for scalable greedy coloring on distributed-memory parallel computers
    Bozdag, Doruk
    Gebremedhin, Assefaw H.
    Manne, Fredrik
    Boman, Erik G.
    Catalyurek, Umit V.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (04) : 515 - 535
  • [50] HY-DBSCAN: A hybrid parallel DBSCAN clustering algorithm scalable on distributed-memory computers
    Wu, Guoqing
    Cao, Liqiang
    Tian, Hongyun
    Wang, Wei
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 168 : 57 - 69