Application of AVX (Advanced Vector Extensions) for Improved Performance of the PARFES - Finite Element Parallel Direct Solver

被引:0
|
作者
Fialko, Sergiy [1 ]
机构
[1] Tadeusz Kosciuszko Cracow Univ Technol, PL-31155 Krakow, Poland
来源
2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS) | 2013年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper considers application of the AVX (Advanced Vector Extensions) technique to improve the performance of the PARFES parallel finite element solver, intended for finite element analysis of large-scale problems of structural and solid mechanics using multi-core computers. The basis for this paper was the fact that the dgemm matrix multiplication procedure implemented in the Intel MKL (Math Kernel Library) and ACML (AMID Core Math Library) libraries, which lays down the foundations for achieving high performance of direct methods for sparse matrices, does not provide for satisfactory performance with the AMID Opteron 6276 processor, Bulldozer architecture, when used with the algorithm required for PARFES. The procedure presented herein significantly improves the performance of PARFES on computers with processors of the above architecture, while maintaining the competitiveness of PARFES with the Intel MKL dgemm procedure on computers with Intel processors.
引用
收藏
页码:447 / 454
页数:8
相关论文
共 29 条
  • [21] Performance Analysis of the H-Matrix-Based Fast Direct Solver for Finite-Element-Based Analysis of Electromagnetic Problems
    Liu, Haixin
    Jiao, Dan
    2009 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM AND USNC/URSI NATIONAL RADIO SCIENCE MEETING, VOLS 1-6, 2009, : 505 - 508
  • [22] Albany/FELIX: a parallel, scalable and robust, finite element, first-order Stokes approximation ice sheet solver built for advanced analysis
    Tezaur, I. K.
    Perego, M.
    Salinger, A. G.
    Tuminaro, R. S.
    Price, S. F.
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2015, 8 (04) : 1197 - 1220
  • [23] Performance analysis of a parallel finite element solution to the direct numerical simulation of fluid turbulence on Linux PC clusters
    Liu, CH
    Woo, CM
    Leung, DYC
    APPLIED MATHEMATICS AND COMPUTATION, 2006, 172 (02) : 731 - 743
  • [24] Performance evaluation of norm scaling algorithm for conjugate gradient method with application to parallel finite element method
    Yamada, T
    Yagawa, G
    COMPUTATIONAL MECHANICS, VOLS 1 AND 2, PROCEEDINGS: NEW FRONTIERS FOR THE NEW MILLENNIUM, 2001, : 149 - 154
  • [25] Development and performance analysis of a parallel finite element application implemented in an object-orientated programming framework
    Henz, BJ
    Shires, DR
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1286 - 1292
  • [26] PRIMAL DOMAIN DECOMPOSITION METHOD WITH DIRECT AND ITERATIVE SOLVER FOR CIRCUIT-FIELD-TORQUE COUPLED PARALLEL FINITE ELEMENT METHOD TO ELECTRIC MACHINE MODELLING
    Marcsa, Daniel
    Kuczmann, Miklos
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2015, 13 (05) : 458 - 465
  • [27] Improved performance and robustness of synchronous reluctance machine control using an advanced sliding mode and direct vector control
    Selma B.
    Bounadja E.
    Belmadani B.
    Selma B.
    Advanced Control for Applications: Engineering and Industrial Systems, 2024, 6 (01):
  • [28] An application of improved space vector strategy based in digital direct synthesizers for a high-performance inverter
    Silva, Rodrigo
    Huerta-Ruelas, Jorge A.
    de Dios Ortiz-Alvarado, Juan
    Mendoza-Mondragon, Fortino
    Hernandez Zavala, Antonio
    CONTROL ENGINEERING PRACTICE, 2021, 110
  • [29] The Use of Sparse Direct Solver in Vector Finite Element Modeling for Calculating Two Dimensional (2-D) Magnetotelluric Responses in Transverse Electric (TE) Mode
    Roodhiyah, Lisa' Yihaa
    Tjong, Tiffany
    Nurhasan
    Sutarno, D.
    INTERNATIONAL CONFERENCE ON THEORETICAL AND APPLIED PHYSICS, 2018, 1011