A detailed performance analysis of the interpolation supplemented lattice Boltzmann method on the Cray T3E and Cray X1

被引:7
|
作者
Sunder, C. Shyam [1 ]
Baskar, G.
Babu, V.
Strenski, David
机构
[1] Indian Inst Technol, Dept Mech Engn, TDCE, Madras 600036, Tamil Nadu, India
[2] Cornell Univ, Sibley Sch Mech & Aerosp Engn, Mat Proc Design & Control Lab, Ithaca, NY 14853 USA
[3] ETH, Inst Energietech, CH-8092 Zurich, Switzerland
[4] Cray Inc, Seattle, WA 98104 USA
关键词
shared memory; multiprocessors; parallel computing; SHMEM; MPI;
D O I
10.1177/1094342006064572
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A detailed study of the parallel performance of the interpolation supplemented lattice Boltzmann (ISLB) method using SHMEM and MPI on the Cray T3E-900 and Cray X1 architectures is presented. The noteworthy feature of the present implementation of the ISLB method is that it is. able to achieve a sustained speed of 4.2 Tflop/s while using 504 processors on a Cray X1. The code is shown to achieve super-linear speedups on the Cray T3E-900. It is shown through detailed profiling that the computation and the communication scale well on the Cray X1, although the overall speedup is adversely affected by the cost of barrier synchronization.
引用
收藏
页码:557 / 570
页数:14
相关论文
共 50 条
  • [21] A recursive PVM implementation of an image segmentation algorithm with performance results comparing the HIVE and the Cray T3E
    Tilton, JC
    FRONTIERS '99 - THE SEVENTH SYMPOSIUM ON THE FRONTIERS OF MASSIVELY PARALLEL COMPUTATION, PROCEEDINGS, 1999, : 146 - 153
  • [22] Parallel rendering of 3D AMR data on the SGI/Cray T3E
    Ma, KL
    FRONTIERS '99 - THE SEVENTH SYMPOSIUM ON THE FRONTIERS OF MASSIVELY PARALLEL COMPUTATION, PROCEEDINGS, 1999, : 138 - 145
  • [24] Performance evaluation of the Cray X1 distributed shared memory architecture
    Dunigan, TH
    Vetter, JS
    Worley, PH
    12TH ANNUAL IEEE SYMPOSIUM ON HIGH PERFORMANCE INTERCONNECTS, PROCEEDINGS, 2004, : 20 - 25
  • [25] ParGrad system:: Dynamical adaptation of the parallelism degree of programs on Cray T3E
    Werner-Kytölä, O
    HIGH PERFORMANCE COMPUTING IN SCIENCE AND ENGINEERING '99, 2000, : 457 - 468
  • [26] 3-D spectral element-by-element wave modelling on Cray T3E
    Seriani, G
    PHYSICS AND CHEMISTRY OF THE EARTH PART A-SOLID EARTH AND GEODESY, 1999, 24 (03): : 241 - 245
  • [27] Parallel performance of the interpolation supplemented lattice Boltzmann method
    Sunder, CS
    Baskar, G
    Babu, V
    Strenski, D
    HIGH PERFORMANCE COMPUTING - HIPC 2003, 2003, 2913 : 428 - 437
  • [28] Iterative solution of block tridiagonal systems on the Cray T3D and T3E supercomputers
    Pini, G
    Sartoretto, F
    SUPERCOMPUTER, 1997, 13 (3-4): : 67 - 82
  • [29] Porting and performance of the Community Climate System Model (CCSM3) on the Cray X1
    Carr, GR
    Carpenter, IL
    Cordery, MJ
    Drake, JB
    Ham, MW
    Hoffman, FM
    Worley, PH
    Use of High Performance Computing in Meteorology, 2005, : 259 - 271
  • [30] Cray T3E performances of a parallel code for a stochastic dynamic assets and liabilities management model
    Zanghirati, G
    Cocco, F
    Taddei, F
    Paruolo, G
    EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 1176 - 1186