Scalable parallel linear solver for compact banded systems on heterogeneous architectures

被引:1
|
作者
Song, Hang [1 ]
Matsuno, Kristen V. [1 ]
West, Jacob R. [1 ]
Subramaniam, Akshay [2 ]
Ghate, Aditya S. [2 ]
Lele, Sanjiva K. [1 ,2 ]
机构
[1] Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Aeronaut & Astronaut, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Compact banded system; Periodic boundary; Parallel cyclic reduction; Distributed memory; Parallel computing; BLOCK TRIDIAGONAL SYSTEMS; LARGE-EDDY SIMULATION; CYCLIC REDUCTION; DIFFERENCE SCHEMES; FLOW;
D O I
10.1016/j.jcp.2022.111443
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A scalable algorithm for solving compact banded linear systems on distributed memory architectures is presented. The proposed method factorizes the original system into two levels of memory hierarchies, and solves it using parallel cyclic reduction on both distributed and shared memory. This method has a lower communication footprint across distributed memory partitions compared to conventional algorithms involving data transposes or re-partitioning. The algorithm developed in this work is generalized to cyclic compact banded systems with flexible data decompositions. For cyclic compact banded systems, the method is a direct solver with a deterministic operation and communication counts depending on the matrix size, its bandwidth, and the partition strategy. The implementation and runtime configuration details are discussed for performance opti-mization. Scalability is demonstrated on the linear solver as well as on a representative fluid mechanics application problem, in which the dominant computational cost is solving the cyclic tridiagonal linear systems of compact numerical schemes on a 3D periodic domain. The algorithm is particularly useful for solving the linear systems arising from the application of compact finite difference operators to a wide range of partial differential equation problems, such as but not limited to the numerical simulations of compressible turbulent flows, aeroacoustics, elastic-plastic wave propagation, and electromagnetics. It alleviates obstacles to their use on modern high performance computing hardware, where memory and computational power are distributed across nodes with multi-threaded processing units. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] GPU-Accelerated Scalable Solver for Banded Linear Systems
    Liu, Hang
    Seo, Jung-Hee
    Mital, Rajat
    Huang, H. Howie
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [3] A MULTILEVEL PARALLEL SOLVER FOR BLOCK TRIDIAGONAL AND BANDED LINEAR-SYSTEMS
    HAJJ, IN
    SKELBOE, S
    PARALLEL COMPUTING, 1990, 15 (1-3) : 21 - 45
  • [4] An asynchronous direct solver for banded linear systems
    Michael A. Jandron
    Anthony A. Ruffa
    James Baglama
    Numerical Algorithms, 2017, 76 : 211 - 235
  • [5] An asynchronous direct solver for banded linear systems
    Jandron, Michael A.
    Ruffa, Anthony A.
    Baglama, James
    NUMERICAL ALGORITHMS, 2017, 76 (01) : 211 - 235
  • [6] Implementation of a non-linear solver on heterogeneous architectures
    Carracciuolo, Luisa
    Lapegna, Marco
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (24):
  • [7] PSPASES: Building a high performance scalable parallel direct solver for sparse linear systems
    Joshi, MV
    Karypis, G
    Kumar, V
    Gupta, A
    Gustavson, F
    PARALLEL NUMERICAL COMPUTATION WITH APPLICATIONS, 1999, 515 : 3 - 18
  • [8] A tearing-based hybrid parallel banded linear system solver
    Naumov, Maxim
    Sameh, Ahmed H.
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2009, 226 (02) : 306 - 318
  • [9] Parallel logic programming systems on scalable architectures
    Costa, VS
    Bianchini, R
    Dutra, ID
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2000, 60 (07) : 835 - 852
  • [10] Direct parallel algorithms for banded linear systems
    Arbenz, P
    Gander, W
    ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1996, 76 : 119 - 122