Parallel pivots LU algorithm on the Cray T3E

被引:0
|
作者
Asenjo, R [1 ]
Zapata, EL [1 ]
机构
[1] Univ Malaga, Comp Architecture Dept, E-29071 Malaga, Spain
来源
PARALLEL COMPUTATION | 1999年 / 1557卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Solving large nonsymmetric sparse linear systems on distributed memory multiprocessors is an active research area. We present a loop-level parallelized generic LU algorithm which comprises analyse-factorize and solve stages. To further exploit matrix sparsity and parallelism, the analyse step looks for a set of compatible pivots. Sparse techniques are applied until the reduced submatrix reaches a threshold density. At this point, a switch to dense routines takes place in both analyse-factorize and solve stages. The SPMD code follows a sparse cyclic distribution to map the system matrix onto a P x Q processor mesh. Experimental results show a good behavior of our sequential algorithm compared with a standard generic solver: the MA48 routine. Additionally, a parallel version an the Gray T3E exhibits high performance in terms of speed-up and efficiency.
引用
收藏
页码:38 / 47
页数:10
相关论文
共 50 条
  • [21] Iterative solution of block tridiagonal systems on the Cray T3D and T3E supercomputers
    Pini, G
    Sartoretto, F
    SUPERCOMPUTER, 1997, 13 (3-4): : 67 - 82
  • [22] Message passing evaluation and analysis on Cray T3E and SGI origin 2000 systems
    Prieto, M
    Espadas, D
    Llorente, IM
    Tirado, F
    EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 173 - 182
  • [23] A detailed performance analysis of the interpolation supplemented lattice Boltzmann method on the Cray T3E and Cray X1
    Sunder, C. Shyam
    Baskar, G.
    Babu, V.
    Strenski, David
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2006, 20 (04): : 557 - 570
  • [25] Scaling performance of MM5 on the Cray T3E and a beowulf-like system
    Kouatchou, J
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOL VI, PROCEEDINGS, 1999, : 2947 - 2953
  • [26] Simulation of point defect clustering in Cz-silicon wafers on the Cray T3E scalable parallel computer: Application to oxygen precipitation
    Karoui, FS
    Karoui, A
    Rozgonyi, GA
    2000 INTERNATIONAL CONFERENCE ON MODELING AND SIMULATION OF MICROSYSTEMS, TECHNICAL PROCEEDINGS, 2000, : 98 - 101
  • [27] A scalable molecular-dynamics algorithm suite for materials simulations: design-space diagram on 1024 Cray T3E processors
    Shimojo, F
    Campbell, TJ
    Kalia, RK
    Nakano, A
    Vashishta, P
    Ogata, S
    Tsuruta, K
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2000, 17 (03): : 279 - 291
  • [28] Computations of three-dimensional compressible Rayleigh-Taylor instability on SGI/Cray T3E
    Deane, A
    PARALLEL COMPUTATIONAL FLUID DYNAMICS: TOWARDS TERAFLOPS, OPTIMIZATION, AND NOVEL FORMULATIONS, 2000, : 189 - 198
  • [29] Cluster computing vs. Cray T3E - A case study from numerical field theory
    Arnold, G
    Eicker, N
    Lippert, T
    Schilling, K
    NINTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2001, : 475 - 479