Tuning high-level synthesis SpMV kernels in Alveo FPGAs

被引:0
|
作者
Favaro, Federico [1 ]
Dufrechou, Ernesto [2 ]
Oliver, Juan P. [1 ]
Ezzatti, Pablo [2 ]
机构
[1] Univ Republica, Fac Ingn, Inst Ingn Elect, Montevideo 11300, Uruguay
[2] Univ Republica, Fac Ingn, Inst Computac, Montevideo 11300, Uruguay
关键词
SpMV; Benchmarking; FPGA; ARCHITECTURE; MEMORY;
D O I
10.1016/j.micpro.2024.105104
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Sparse Matrix-Vector Multiplication (SpMV) is an essential operation in scientific and engineering fields, with applications in areas like finite element analysis, image processing, and machine learning. To address the need for faster and more energy-efficient computing, this paper investigates the acceleration of SpMV through Field-Programmable Gate Arrays (FPGAs), leveraging High-Level Synthesis (HLS) for design simplicity. Our study focuses on the AMD-Xilinx Alveo U280 FPGA, assessing the performance of the SpMV kernel from Vitis Libraries, which is the state of the art on SpMV acceleration on FPGAs. We explore kernel modifications, transition to single precision, and varying partition sizes, demonstrating the impact of these changes on execution time. Furthermore, we investigate matrix preprocessing techniques, including Reverse Cuthill-McKee (RCM) reordering and a hybrid sparse storage format, to enhance efficiency. Our findings reveal that the performance of FPGA-accelerated SpMV is influenced by matrix characteristics, by smaller partition sizes, and by specific preprocessing techniques delivering notable performance improvements. By selecting the best results from these experiments, we achieved execution time enhancements of up to 3.2x. This study advances the understanding of FPGA-accelerated SpMV, providing insights into key factors that impact performance and potential avenues for further improvement.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] A Parametrizable High-Level Synthesis Library for Accelerating Neural Networks on FPGAs
    Lester Kalms
    Pedram Amini Rad
    Muhammad Ali
    Arsany Iskander
    Diana Göhringer
    Journal of Signal Processing Systems, 2021, 93 : 513 - 529
  • [22] A Parametrizable High-Level Synthesis Library for Accelerating Neural Networks on FPGAs
    Kalms, Lester
    Rad, Pedram Amini
    Ali, Muhammad
    Iskander, Arsany
    Goehringer, Diana
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (05): : 513 - 529
  • [23] Register Allocation for High-Level Synthesis of Hardware Accelerators Targeting FPGAs
    Hempel, Gerald
    Hoyer, Jan
    Pionteck, Thilo
    Hochberger, Christian
    2013 8TH INTERNATIONAL WORKSHOP ON RECONFIGURABLE AND COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2013,
  • [24] Efficient and Reliable High-Level Synthesis Design Space Explorer for FPGAs
    Liu, Dong
    Schafer, Benjamin Carrion
    2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,
  • [25] High-Level Synthesis of Resource-oriented Approximate Designs for FPGAs
    Leipnitz, Marcos T.
    Nazar, Gabriel L.
    PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,
  • [26] From Software Threads to Parallel Hardware in High-Level Synthesis for FPGAs
    Choi, Jongsok
    Brown, Stephen
    Anderson, Jason
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2013, : 270 - 277
  • [27] Performance and Resource Modeling for FPGAs using High-Level Synthesis tools
    Da Silva, Bruno
    Braeken, An
    D'Hollander, Erik H.
    Touhafi, Abdellah
    PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, 25 : 523 - 531
  • [28] FIPLib: An Image Processing Library for FPGAs Using High-Level Synthesis
    Palazzari, Paolo
    Faltelli, Marco
    Iannone, Francesco
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2025, 53 (02)
  • [29] High-level power modeling of CPLDs and FPGAs
    Shang, L
    Jha, NK
    2001 INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD 2001, PROCEEDINGS, 2001, : 46 - 51
  • [30] Rosetta: A Realistic High-Level Synthesis Benchmark Suite for Software Programmable FPGAs
    Zhou, Yuan
    Gupta, Udit
    Dai, Steve
    Zhao, Ritchie
    Srivastava, Nitish
    Jin, Hanchen
    Featherston, Joseph
    Lai, Yi-Hsiang
    Liu, Gai
    Velasquez, Gustavo Angarita
    Wang, Wenping
    Zhang, Zhiru
    PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 269 - 278