Tuning high-level synthesis SpMV kernels in Alveo FPGAs

被引:0
|
作者
Favaro, Federico [1 ]
Dufrechou, Ernesto [2 ]
Oliver, Juan P. [1 ]
Ezzatti, Pablo [2 ]
机构
[1] Univ Republica, Fac Ingn, Inst Ingn Elect, Montevideo 11300, Uruguay
[2] Univ Republica, Fac Ingn, Inst Computac, Montevideo 11300, Uruguay
关键词
SpMV; Benchmarking; FPGA; ARCHITECTURE; MEMORY;
D O I
10.1016/j.micpro.2024.105104
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Sparse Matrix-Vector Multiplication (SpMV) is an essential operation in scientific and engineering fields, with applications in areas like finite element analysis, image processing, and machine learning. To address the need for faster and more energy-efficient computing, this paper investigates the acceleration of SpMV through Field-Programmable Gate Arrays (FPGAs), leveraging High-Level Synthesis (HLS) for design simplicity. Our study focuses on the AMD-Xilinx Alveo U280 FPGA, assessing the performance of the SpMV kernel from Vitis Libraries, which is the state of the art on SpMV acceleration on FPGAs. We explore kernel modifications, transition to single precision, and varying partition sizes, demonstrating the impact of these changes on execution time. Furthermore, we investigate matrix preprocessing techniques, including Reverse Cuthill-McKee (RCM) reordering and a hybrid sparse storage format, to enhance efficiency. Our findings reveal that the performance of FPGA-accelerated SpMV is influenced by matrix characteristics, by smaller partition sizes, and by specific preprocessing techniques delivering notable performance improvements. By selecting the best results from these experiments, we achieved execution time enhancements of up to 3.2x. This study advances the understanding of FPGA-accelerated SpMV, providing insights into key factors that impact performance and potential avenues for further improvement.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Learning from the Past: Efficient High-level Synthesis Design Space Exploration for FPGAs
    Wang, Zi
    Schafer, Benjamin Carrion
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2022, 27 (04)
  • [42] High-level synthesis for large bit-width multipliers on FPGAs: A case study
    Quan, G
    Davis, JP
    Devarkal, S
    Buell, DA
    2005 INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE CODESIGN AND SYSTEM SYNTHESIS, 2005, : 213 - 218
  • [43] Toward Automated Simulink Model Implementation and Optimization using High-Level Synthesis for FPGAs
    Kredo, Kurtis, II
    Mustafa, Hadil
    Crosbie, Roy
    Bednar, Richard
    Alavi, Zahrasadat
    2019 IEEE ELECTRIC SHIP TECHNOLOGIES SYMPOSIUM (ESTS 2019): EMERGING TECHNOLOGIES FOR FUTURE ELECTRIC SHIPS, 2019, : 172 - 180
  • [44] Open the Gates: Using High-level Synthesis Towards Programmable LDPC Decoders on FPGAs
    Pratas, Frederico
    Andrade, Joao
    Falcao, Gabriel
    Silva, Vitor
    Sousa, Leonel
    2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 1274 - 1277
  • [45] Constraint-Aware Multi-Technique Approximate High-Level Synthesis for FPGAs
    Leipnitz, Marcos T.
    Nazar, Gabriel L.
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (04)
  • [46] High-Level Synthesis Techniques to Generate Deeply Pipelined Circuits for FPGAs with Registered Routing
    Chen, Yu Ting
    Kim, Jin Hee
    Li, Kexin
    Hoyes, Graham
    Anderson, Jason H.
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 375 - 378
  • [47] Acceleration of Trading System Back End with FPGAs Using High-Level Synthesis Flow
    Puranik, Sunil
    Barve, Mahesh
    Rodi, Swapnil
    Patrikar, Rajendra
    ELECTRONICS, 2023, 12 (03)
  • [48] HIGH-LEVEL SYNTHESIS
    PAWLAK, A
    MICROPROCESSING AND MICROPROGRAMMING, 1992, 35 (1-5): : 261 - 261
  • [49] TARO: Automatic Optimization for Free-Running Kernels in FPGA High-Level Synthesis
    Choi, Young-Kyu
    Chi, Yuze
    Lau, Jason
    Cong, Jason
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (07) : 2423 - 2427
  • [50] High-Level Designs of Complex FIR Filters on FPGAs for the SKA
    Wang, Haomiao
    Gante, Joao
    Zhang, Ming
    Falcao, Gabriel
    Sousa, Leonel
    Sinnen, Oliver
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 797 - 804