EFFICIENT AND SCALABLE ALGORITHMS FOR SMOOTHED PARTICLE HYDRODYNAMICS ON HYBRID SHARED/DISTRIBUTED-MEMORY ARCHITECTURES

被引:11
|
作者
Gonnet, Pedro [1 ,2 ]
机构
[1] Univ Durham, Sch Engn & Comp Sci, Durham, England
[2] Google Switzerland, CH-8002 Zurich, Switzerland
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2015年 / 37卷 / 01期
关键词
smoothed particle hydrodynamics; simulation; task-based parallelism; multicores; BINARY SEARCH TREES; MOLECULAR-DYNAMICS; VERLET LISTS; SIMULATION; CODE; SPH;
D O I
10.1137/140964266
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper describes a new fast and implicitly parallel approach to neighbor-finding in multiresolution smoothed particle hydrodynamics (SPH) simulations. This new approach is based on hierarchical cell decompositions and sorted interactions, within a task-based formulation. It is shown to be faster than traditional tree-based codes and to scale better than domain decomposition-based approaches on hybrid shared/distributed-memory parallel architectures, e.g., clusters of multicores, achieving a 40x speedup over the Gadget-2 simulation code.
引用
收藏
页码:C95 / C121
页数:27
相关论文
共 50 条
  • [1] Efficient Lagrangian particle tracking algorithms for distributed-memory architectures
    Baldan, Giacomo
    Bellosta, Tommaso
    Guardone, Alberto
    COMPUTERS & FLUIDS, 2023, 256
  • [2] Efficient and scalable distributed-memory hierarchization algorithms for the sparse grid combination technique
    Heene, Mario
    Pflueger, Dirk
    PARALLEL COMPUTING: ON THE ROAD TO EXASCALE, 2016, 27 : 339 - 348
  • [3] LU FACTORIZATION ALGORITHMS ON DISTRIBUTED-MEMORY MULTIPROCESSOR ARCHITECTURES
    GEIST, GA
    ROMINE, CH
    SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1988, 9 (04): : 639 - 649
  • [4] COMPILING FOR DISTRIBUTED-MEMORY ARCHITECTURES
    ROGERS, A
    PINGALI, K
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (03) : 281 - 298
  • [5] BSPCloud: A Hybrid Distributed-memory and Shared-memory Programming Model
    Liu, Xiaodong
    Tong, Weiqin
    ZhiRen, Fu
    WenZhao, Liao
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2013, 6 (01): : 87 - 97
  • [6] IMPLEMENTATION OF A PARALLEL UNSTRUCTURED EULER SOLVER ON SHARED-MEMORY AND DISTRIBUTED-MEMORY ARCHITECTURES
    MAVRIPLIS, DJ
    DAS, R
    SALTZ, J
    VERMELAND, RE
    JOURNAL OF SUPERCOMPUTING, 1995, 8 (04): : 329 - 344
  • [7] Aho-Corasick String Matching on Shared and Distributed-Memory Parallel Architectures
    Tumeo, Antonino
    Villa, Oreste
    Chavarria-Miranda, Daniel G.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (03) : 436 - 443
  • [8] Parallelizing RRT on Distributed-Memory Architectures
    Devaurs, Didier
    Simeon, Thierry
    Cortes, Juan
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011, : 2261 - 2266
  • [9] Parallel ILP for distributed-memory architectures
    Nuno A. Fonseca
    Ashwin Srinivasan
    Fernando Silva
    Rui Camacho
    Machine Learning, 2009, 74 : 257 - 279
  • [10] Code Generation for Distributed-Memory Architectures
    Zhao, Jie
    Zhao, Rongcai
    Xu, Jinchen
    COMPUTER JOURNAL, 2016, 59 (01): : 119 - 132