EFFICIENT AND SCALABLE ALGORITHMS FOR SMOOTHED PARTICLE HYDRODYNAMICS ON HYBRID SHARED/DISTRIBUTED-MEMORY ARCHITECTURES

被引:11
|
作者
Gonnet, Pedro [1 ,2 ]
机构
[1] Univ Durham, Sch Engn & Comp Sci, Durham, England
[2] Google Switzerland, CH-8002 Zurich, Switzerland
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2015年 / 37卷 / 01期
关键词
smoothed particle hydrodynamics; simulation; task-based parallelism; multicores; BINARY SEARCH TREES; MOLECULAR-DYNAMICS; VERLET LISTS; SIMULATION; CODE; SPH;
D O I
10.1137/140964266
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper describes a new fast and implicitly parallel approach to neighbor-finding in multiresolution smoothed particle hydrodynamics (SPH) simulations. This new approach is based on hierarchical cell decompositions and sorted interactions, within a task-based formulation. It is shown to be faster than traditional tree-based codes and to scale better than domain decomposition-based approaches on hybrid shared/distributed-memory parallel architectures, e.g., clusters of multicores, achieving a 40x speedup over the Gadget-2 simulation code.
引用
收藏
页码:C95 / C121
页数:27
相关论文
共 50 条
  • [31] PSEUDOSPECTRAL CORRELATION METHODS ON DISTRIBUTED-MEMORY PARALLEL ARCHITECTURES
    MARTINEZ, TJ
    CARTER, EA
    CHEMICAL PHYSICS LETTERS, 1995, 241 (5-6) : 490 - 496
  • [32] Compiling High Performance Fortran for distributed-memory architectures
    Benkner, Siegfried
    Zima, Hans
    Parallel Computing, 1999, 25 (13): : 1785 - 1825
  • [33] Migration of vectorized iterative solvers to distributed-memory architectures
    Pommerell, C
    Ruhl, R
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1996, 17 (01): : 239 - 259
  • [34] Scalability and Locality of Extrapolation Methods for Distributed-Memory Architectures
    Korch, Matthias
    Rauber, Thomas
    Scholtes, Carsten
    EURO-PAR 2010 - PARALLEL PROCESSING, PART II, 2010, 6272 : 65 - 76
  • [35] Compiling High Performance Fortran for distributed-memory architectures
    Benkner, S
    Zima, H
    PARALLEL COMPUTING, 1999, 25 (13-14) : 1785 - 1825
  • [36] Performance of scalable shared-memory architectures
    Motlagh, BS
    DeMara, RF
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2000, 10 (1-2) : 1 - 22
  • [37] Language Features for Scalable Distributed-Memory Dataflow Computing
    Wozniak, Justin M.
    Wilde, Michael
    Foster, Ian T.
    2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale Computing DFM 2014, 2014, : 50 - 53
  • [38] Representing shared data on distributed-memory parallel computers
    Herley, KT
    MATHEMATICAL SYSTEMS THEORY, 1996, 29 (02): : 111 - 156
  • [39] FREE-LAGRANGE HYDRODYNAMICS WITH A DISTRIBUTED-MEMORY PARALLEL PROCESSOR
    WILLIAMS, R
    PARALLEL COMPUTING, 1988, 7 (03) : 439 - 443
  • [40] A Hybrid Parallel Delaunay Image-to-Mesh Conversion Algorithm Scalable on Distributed-Memory Clusters
    Feng, Daming
    Chernikov, Andrey N.
    Chrisochoides, Nikos P.
    25TH INTERNATIONAL MESHING ROUNDTABLE, 2016, 163 : 59 - 71