Scalable parallel linear solver for compact banded systems on heterogeneous architectures

被引:1
|
作者
Song, Hang [1 ]
Matsuno, Kristen V. [1 ]
West, Jacob R. [1 ]
Subramaniam, Akshay [2 ]
Ghate, Aditya S. [2 ]
Lele, Sanjiva K. [1 ,2 ]
机构
[1] Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Aeronaut & Astronaut, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Compact banded system; Periodic boundary; Parallel cyclic reduction; Distributed memory; Parallel computing; BLOCK TRIDIAGONAL SYSTEMS; LARGE-EDDY SIMULATION; CYCLIC REDUCTION; DIFFERENCE SCHEMES; FLOW;
D O I
10.1016/j.jcp.2022.111443
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A scalable algorithm for solving compact banded linear systems on distributed memory architectures is presented. The proposed method factorizes the original system into two levels of memory hierarchies, and solves it using parallel cyclic reduction on both distributed and shared memory. This method has a lower communication footprint across distributed memory partitions compared to conventional algorithms involving data transposes or re-partitioning. The algorithm developed in this work is generalized to cyclic compact banded systems with flexible data decompositions. For cyclic compact banded systems, the method is a direct solver with a deterministic operation and communication counts depending on the matrix size, its bandwidth, and the partition strategy. The implementation and runtime configuration details are discussed for performance opti-mization. Scalability is demonstrated on the linear solver as well as on a representative fluid mechanics application problem, in which the dominant computational cost is solving the cyclic tridiagonal linear systems of compact numerical schemes on a 3D periodic domain. The algorithm is particularly useful for solving the linear systems arising from the application of compact finite difference operators to a wide range of partial differential equation problems, such as but not limited to the numerical simulations of compressible turbulent flows, aeroacoustics, elastic-plastic wave propagation, and electromagnetics. It alleviates obstacles to their use on modern high performance computing hardware, where memory and computational power are distributed across nodes with multi-threaded processing units. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] A PARALLEL SOLVER FOR TRIDIAGONAL LINEAR-SYSTEMS FOR DISTRIBUTED MEMORY PARALLEL COMPUTERS
    BRUGNANO, L
    PARALLEL COMPUTING, 1991, 17 (09) : 1017 - 1023
  • [32] Scalable Parallel Solver of Boolean Satisfiability Problems
    Bogdanova, V. G.
    Gorsky, S. A.
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 222 - 227
  • [33] Parallel Programming for Heterogeneous Architectures
    Krammer, Bettina
    Mix, Hartmut
    Geimer, Markus
    PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, 25 : 731 - 732
  • [34] Performance models for the Spike banded linear system solver
    Manguoglu, Murat
    Saied, Faisal
    Sameh, Ahmed
    Grama, Ananth
    SCIENTIFIC PROGRAMMING, 2011, 19 (01) : 13 - 25
  • [35] Stability of a parallel partitioning algorithm for special classes of banded linear systems
    Pavlov, V
    NUMERICAL ANALYSIS AND ITS APPLICATIONS, 2001, 1988 : 658 - 665
  • [36] A computational technique for parallel solution of diagonally dominant banded linear systems
    Rao, S. Chandra Sekhara
    Kamra, Rabia
    2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), 2021, : 448 - 453
  • [37] A PARALLEL PARTITION METHOD FOR SOLVING BANDED SYSTEMS OF LINEAR-EQUATIONS
    MEIER, U
    PARALLEL COMPUTING, 1985, 2 (01) : 33 - 43
  • [38] Performance portability study for massively parallel computational fluid dynamics application on scalable heterogeneous architectures
    Lee, Seyong
    Gounley, John
    Randles, Amanda
    Vetter, Jeffrey S.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 129 : 1 - 13
  • [39] A parallel direct solver for large sparse highly unsymmetric linear systems
    Duff, IS
    Scott, JA
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2004, 30 (02): : 95 - 117
  • [40] A parallel multilevel spectral Galerkin solver for linear systems with uncertain parameters
    Schick, Michael
    2014 22ND EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2014), 2014, : 352 - 359