Application of AVX (Advanced Vector Extensions) for Improved Performance of the PARFES - Finite Element Parallel Direct Solver

被引:0
|
作者
Fialko, Sergiy [1 ]
机构
[1] Tadeusz Kosciuszko Cracow Univ Technol, PL-31155 Krakow, Poland
来源
2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS) | 2013年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper considers application of the AVX (Advanced Vector Extensions) technique to improve the performance of the PARFES parallel finite element solver, intended for finite element analysis of large-scale problems of structural and solid mechanics using multi-core computers. The basis for this paper was the fact that the dgemm matrix multiplication procedure implemented in the Intel MKL (Math Kernel Library) and ACML (AMID Core Math Library) libraries, which lays down the foundations for achieving high performance of direct methods for sparse matrices, does not provide for satisfactory performance with the AMID Opteron 6276 processor, Bulldozer architecture, when used with the algorithm required for PARFES. The procedure presented herein significantly improves the performance of PARFES on computers with processors of the above architecture, while maintaining the competitiveness of PARFES with the Intel MKL dgemm procedure on computers with Intel processors.
引用
收藏
页码:447 / 454
页数:8
相关论文
共 29 条
  • [1] Parallel finite element solver PARFES for the structural analysis in NUMA architecture
    Fialko, Sergiy
    ADVANCES IN ENGINEERING SOFTWARE, 2022, 174
  • [2] Performance of multi level parallel direct solver for hp Finite Element Method
    Paszynski, Maciej
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 1303 - 1312
  • [3] An efficient parallel direct solver for finite element applications
    Anderheggen, E
    DEVELOPMENTS IN ENGINEERING COMPUTATIONAL TECHNOLOGY, 2000, : 259 - 264
  • [4] Performance study of the domain decomposition method with direct equation solver for parallel finite element analysis
    Nikishkov, GP
    Makinouchi, A
    Yagawa, G
    Yoshimura, S
    COMPUTATIONAL MECHANICS, 1996, 19 (02) : 84 - 93
  • [5] Performance Evaluation of Matrix-Matrix Multiplications Using Intel's Advanced Vector Extensions (AVX)
    Hassana, Somaia Awad
    Hemeida, A. M.
    Mahmoud, Mountasser M. M.
    MICROPROCESSORS AND MICROSYSTEMS, 2016, 47 : 369 - 374
  • [6] Fine-grained heterogeneous parallel direct solver for finite element problems
    Wang, Yujie
    Wang, Shengquan
    Zhang, Xuerui
    Li, Guangyao
    Cai, Yong
    COMPUTER PHYSICS COMMUNICATIONS, 2023, 284
  • [7] A parallel direct solver for the self-adaptive hp Finite Element Method
    Paszynski, Maciej
    Pardo, David
    Torres-Verdin, Carlos
    Demkowicz, Leszek
    Calo, Victor
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (03) : 270 - 281
  • [8] A Parallel Direct Solver for a Hierarchical H-Adaptive Finite Element Code
    Rodenas, J. J.
    Corral, C.
    Mas, J.
    Olmeda, F.
    Albelda, J.
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON ENGINEERING COMPUTATIONAL TECHNOLOGY, 2010, 94
  • [9] Performance Evaluation of Parallel Finite-Element Eddy-Current Analysis Using Direct Method as Subdomain Solver
    Mizuma, Takehito
    Takei, Amane
    IEEE TRANSACTIONS ON MAGNETICS, 2020, 56 (02)
  • [10] Fully parallel and pipelined sparse direct solver for large symmetric indefinite finite element problems
    Wang, Yujie
    Wang, Shengquan
    Cai, Yong
    Wang, Guidong
    Li, Guangyao
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2024, 175 : 447 - 469