EFFICIENT AND SCALABLE ALGORITHMS FOR SMOOTHED PARTICLE HYDRODYNAMICS ON HYBRID SHARED/DISTRIBUTED-MEMORY ARCHITECTURES

被引：11

作者：

Gonnet, Pedro ^{[1
,2
]}

机构：

[1] Univ Durham, Sch Engn & Comp Sci, Durham, England

[2] Google Switzerland, CH-8002 Zurich, Switzerland

来源：

SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2015年 / 37卷 / 01期

关键词：

smoothed particle hydrodynamics; simulation; task-based parallelism; multicores; BINARY SEARCH TREES; MOLECULAR-DYNAMICS; VERLET LISTS; SIMULATION; CODE; SPH;

D O I：

10.1137/140964266

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper describes a new fast and implicitly parallel approach to neighbor-finding in multiresolution smoothed particle hydrodynamics (SPH) simulations. This new approach is based on hierarchical cell decompositions and sorted interactions, within a task-based formulation. It is shown to be faster than traditional tree-based codes and to scale better than domain decomposition-based approaches on hybrid shared/distributed-memory parallel architectures, e.g., clusters of multicores, achieving a 40x speedup over the Gadget-2 simulation code.

引用

页码：C95 / C121

页数：27

共 50 条

[41] Efficient local smoothed particle hydrodynamics with precomputed patches
Kanetsuki, Yasutomo
Wells, John C.
Nakata, Susumu
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2020, 97 (1-2) : 63 - 71
[42] PARALLEL RENDERING OF VOLUMETRIC DATA SET ON DISTRIBUTED-MEMORY ARCHITECTURES
MONTANI, C
PEREGO, R
SCOPIGNO, R
CONCURRENCY-PRACTICE AND EXPERIENCE, 1993, 5 (02): : 153 - 167
[43] ADAPTING FORTRAN 90 ARRAY PROGRAMS FOR DISTRIBUTED-MEMORY ARCHITECTURES
MERLIN, JH
LECTURE NOTES IN COMPUTER SCIENCE, 1992, 591 : 184 - 200
[44] Parallelizing RRT on Large-Scale Distributed-Memory Architectures
Devaurs, Didier
Simeon, Thierry
Cortes, Juan
IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (02) : 571 - 579
[45] COMMUNICATION OPTIMIZATIONS FOR IRREGULAR SCIENTIFIC COMPUTATIONS ON DISTRIBUTED-MEMORY ARCHITECTURES
DAS, R
UYSAL, M
SALTZ, J
HWANG, YS
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 22 (03) : 462 - 478
[46] Compiling Affine Loop Nests for Distributed-Memory Parallel Architectures
Bondhugula, Uday
2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
[47] EFFICIENT IMPLEMENTATION OF JACOBI ALGORITHMS AND JACOBI SETS ON DISTRIBUTED MEMORY ARCHITECTURES
EBERLEIN, PJ
PARK, H
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1990, 8 (04) : 358 - 366
[48] Allocating data to distributed-memory multiprocessors by genetic algorithms
Mansour, Nashat, 1600, John Wiley & Sons Ltd, Chichester, United Kingdom (06):
[49] A framework for scalable greedy coloring on distributed-memory parallel computers
Bozdag, Doruk
Gebremedhin, Assefaw H.
Manne, Fredrik
Boman, Erik G.
Catalyurek, Umit V.
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (04) : 515 - 535
[50] HY-DBSCAN: A hybrid parallel DBSCAN clustering algorithm scalable on distributed-memory computers
Wu, Guoqing
Cao, Liqiang
Tian, Hongyun
Wang, Wei
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 168 : 57 - 69

← 1 2 3 4 5 →