BLASFEO: Basic Linear Algebra Subroutines for Embedded Optimization

被引:64
|
作者
Frison, Gianluca [1 ]
Kouzoupis, Dimitris [1 ]
Sartor, Tommaso [1 ]
Zanelli, Andrea [1 ]
Diehl, Moritz [1 ,2 ]
机构
[1] Univ Freiburg, Dept Microsyst Engn, Georges Koehler Allee 102, D-79110 Freiburg, Germany
[2] Univ Freiburg, Dept Math, Georges Koehler Allee 102, D-79110 Freiburg, Germany
来源
关键词
Linear algebra; libraries; high-performance; matrix; BLAS;
D O I
10.1145/3210754
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Basic Linear Algebra Subroutines for Embedded Optimization (BLASFEO) is a dense linear algebra library providing high-performance implementations of BLAS-and LAPACK-like routines for use in embedded optimization and small-scale high-performance computing, in general. A key difference with respect to existing high-performance implementations of BLAS is that the computational performance is optimized for smallto medium-scale matrices, i.e., for sizes up to a few hundred. BLASFEO comes with three different implementations: a high-performance implementation aimed at providing the highest performance for matrices fitting in cache, a reference implementation providing portability and embeddability and optimized for very small matrices, and a wrapper to standard BLAS and LAPACK providing high performance on large matrices. The three implementations of BLASFEO together provide high-performance dense linear algebra routines for matrices ranging from very small to large. Compared to both open-source and proprietary highly tuned BLAS libraries, for matrices of size up to about 100, the high-performance implementation of BLASFEO is about 20-30% faster than the corresponding level 3 BLAS routines and two to three times faster than the corresponding LAPACK routines.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Compiling basic linear algebra subroutines for quantum computers
    Zhao, Liming
    Zhao, Zhikuan
    Rebentrost, Patrick
    Fitzsimons, Joseph
    QUANTUM MACHINE INTELLIGENCE, 2021, 3 (02)
  • [2] Compiling basic linear algebra subroutines for quantum computers
    Liming Zhao
    Zhikuan Zhao
    Patrick Rebentrost
    Joseph Fitzsimons
    Quantum Machine Intelligence, 2021, 3
  • [3] Evaluating the performance of basic linear algebra subroutines on a torus array processor
    Zekri, Ahmed S.
    Sedukhin, Stanislav G.
    2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 300 - 305
  • [4] Performance evaluation of basic linear algebra subroutines on a matrix co-processor
    Zekri, Ahmed S.
    Sedukhin, Stanislav G.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 1190 - 1199
  • [5] A Basic Linear Algebra Compiler for Embedded Processors
    Kyrtatas, Nikolaos
    Spampinato, Daniele G.
    Pueschel, Markus
    2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2015, : 1054 - 1059
  • [6] LOCAL BASIC LINEAR ALGEBRA SUBROUTINES (LBLAS) FOR DISTRIBUTED MEMORY ARCHITECTURES AND LANGUAGES WITH ARRAY SYNTAX
    JOHNSSON, SL
    ORTIZ, LF
    INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1992, 6 (04): : 322 - 350
  • [7] Local basic linear algebra subroutines (LBLAS) for the CM-5/5E
    Kramer, D
    Johnsson, SL
    Hu, Y
    INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1996, 10 (04): : 300 - 335
  • [8] A PARALLEL AND VECTORIAL IMPLEMENTATION OF BASIC LINEAR ALGEBRA SUBROUTINES IN ITERATIVE SOLVING OF LARGE SPARSE LINEAR-SYSTEMS OF EQUATIONS
    MAGNIN, H
    COULOMB, JL
    IEEE TRANSACTIONS ON MAGNETICS, 1989, 25 (04) : 2895 - 2897
  • [9] The Impact of Data Distribution in Accuracy and Performance of Parallel Linear Algebra Subroutines
    Rocker, Bjoern
    Kolberg, Mariana
    Heuveline, Vincent
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2010, 2011, 6449 : 394 - +
  • [10] Basic Principles of Linear Algebra
    Camacho, Luis
    TELOS-REVISTA DE ESTUDIOS INTERDISCIPLINARIOS EN CIENCIAS SOCIALES, 2009, 11 (01): : 126 - 126