Fully parallel and pipelined sparse direct solver for large symmetric indefinite finite element problems

被引:0
|
作者
Wang, Yujie [1 ]
Wang, Shengquan [1 ]
Cai, Yong [1 ]
Wang, Guidong [1 ]
Li, Guangyao [2 ]
机构
[1] Hunan Univ, State Key Lab Adv Design & Mfg Technol Vehicle, Changsha 410082, Peoples R China
[2] Beijing Inst Technol, Shenzhen Automot Res Inst, Shenzhen 518118, Guangdong, Peoples R China
关键词
Sparse direct solver; High performance computing; FEM; CHOLESKY FACTORIZATION; ALGORITHM;
D O I
10.1016/j.camwa.2024.10.017
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Sparse linear system solving is a primary computational cost in large-scale finite element analysis, and improving its performance is a key technological challenge in this field. Real-world engineering problems involve diverse materials, elements, and connectivity relationships, making it difficult for iterative methods to handle their global stiffness matrices. Direct methods, owing to their robustness, emerge as the preferred choice. In this paper, a novel block-based supernodal LDLT numerical factorization method is introduced. The computational process is disassembled into distinct tasks, and the dependency relationships between these tasks are expressed via a directed acyclic graph to guide the calculation sequence. Based on this approach, a global task pool and local task stack are established to store task queues, enhancing data reuse and multicore collaboration efficiency. Additionally, an effective task dispatch and work-stealing mechanism is implemented to prevent performance degradation caused by load imbalances. Numerical experiments, including a publicly available matrix test set and real-world engineering finite element problems, are conducted to compare the parallel performances of the Pardiso, MUMPS, and proposed solver. The results illustrate that the proposed solver performs significantly better than the other solvers when handling various types of sparse matrices and diverse architectures of multicore processors.
引用
收藏
页码:447 / 469
页数:23
相关论文
共 50 条
  • [1] A Sparse Symmetric Indefinite Direct Solver for GPU Architectures
    Hogg, Jonathan D.
    Ovtchinnikov, Evgueni
    Scott, Jennifer A.
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2016, 42 (01):
  • [2] Sparse direct solver for large finite element problems based on the minimum degree algorithm
    Parik, Petr
    Plesek, Jiri
    ADVANCES IN ENGINEERING SOFTWARE, 2017, 113 : 2 - 6
  • [3] Fine-grained heterogeneous parallel direct solver for finite element problems
    Wang, Yujie
    Wang, Shengquan
    Zhang, Xuerui
    Li, Guangyao
    Cai, Yong
    COMPUTER PHYSICS COMMUNICATIONS, 2023, 284
  • [4] Task-parallel tiled direct solver for dense symmetric indefinite systems
    Shen, Zhongyu
    Zhang, Jilin
    Suzuki, Tomohiro
    PARALLEL COMPUTING, 2022, 111
  • [5] Direct Finite Element Solver of Linear Complexity for Analyzing Electrically Large Problems
    Zhou, Bangda
    Jiao, Dan
    2015 31ST INTERNATIONAL REVIEW OF PROGRESS IN APPLIED COMPUTATIONAL ELECTROMAGNETICS (ACES) VOL 31, 2015,
  • [6] Sparse Matrix and Solver Objects for Parallel Finite Element Simulation of Multi-field Problems
    Wang, Wenqing
    Kolditz, Olaf
    HIGH PERFORMANCE COMPUTING AND APPLICATIONS, 2010, 5938 : 418 - 425
  • [7] An efficient parallel direct solver for finite element applications
    Anderheggen, E
    DEVELOPMENTS IN ENGINEERING COMPUTATIONAL TECHNOLOGY, 2000, : 259 - 264
  • [8] Performance of a fully parallel sparse solver
    Heath, MT
    Raghavan, P
    INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1997, 11 (01): : 49 - 64
  • [9] Replicated Computational Results (RCR) Report for A Sparse Symmetric Indefinite Direct Solver for GPU Architectures
    Bavier, Eric T.
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2016, 42 (01):
  • [10] A Parallel Randomized Finite Element Domain Decomposition Solver for Large-scale EM Problems
    Wang, Wei
    Vouvakis, Marinos N.
    2016 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM, 2016, : 1125 - 1126