Tuning high-level synthesis SpMV kernels in Alveo FPGAs

被引:0
|
作者
Favaro, Federico [1 ]
Dufrechou, Ernesto [2 ]
Oliver, Juan P. [1 ]
Ezzatti, Pablo [2 ]
机构
[1] Univ Republica, Fac Ingn, Inst Ingn Elect, Montevideo 11300, Uruguay
[2] Univ Republica, Fac Ingn, Inst Computac, Montevideo 11300, Uruguay
关键词
SpMV; Benchmarking; FPGA; ARCHITECTURE; MEMORY;
D O I
10.1016/j.micpro.2024.105104
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Sparse Matrix-Vector Multiplication (SpMV) is an essential operation in scientific and engineering fields, with applications in areas like finite element analysis, image processing, and machine learning. To address the need for faster and more energy-efficient computing, this paper investigates the acceleration of SpMV through Field-Programmable Gate Arrays (FPGAs), leveraging High-Level Synthesis (HLS) for design simplicity. Our study focuses on the AMD-Xilinx Alveo U280 FPGA, assessing the performance of the SpMV kernel from Vitis Libraries, which is the state of the art on SpMV acceleration on FPGAs. We explore kernel modifications, transition to single precision, and varying partition sizes, demonstrating the impact of these changes on execution time. Furthermore, we investigate matrix preprocessing techniques, including Reverse Cuthill-McKee (RCM) reordering and a hybrid sparse storage format, to enhance efficiency. Our findings reveal that the performance of FPGA-accelerated SpMV is influenced by matrix characteristics, by smaller partition sizes, and by specific preprocessing techniques delivering notable performance improvements. By selecting the best results from these experiments, we achieved execution time enhancements of up to 3.2x. This study advances the understanding of FPGA-accelerated SpMV, providing insights into key factors that impact performance and potential avenues for further improvement.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] High-Level Modeling and Synthesis for Embedded FPGAs
    Chen, Xiaolin
    Li, Shuai
    Schleifer, Jochen
    Coenen, Thomas
    Chattopadhyay, Anupam
    Ascheid, Gerd
    Noll, Tobias G.
    DESIGN, AUTOMATION & TEST IN EUROPE, 2013, : 1565 - 1570
  • [2] Development of Multiobjective High-Level Synthesis for FPGAs
    Reyes Fernandez de Bulnes, Darian
    Maldonado, Yazmin
    Trujillo, Leonardo
    SCIENTIFIC PROGRAMMING, 2020, 2020
  • [3] HIGH-LEVEL SYNTHESIS UNLOCKS POTENTIAL OF FPGAS
    TUCK, B
    COMPUTER DESIGN, 1991, 30 (07): : 50 - &
  • [4] Improving Performance of HPC Kernels on FPGAs Using High-Level Resource Management
    Filgueras, Antonio
    Vidal, Miquel
    Jimenez-Gonzalez, Daniel
    Alvarez, Carlos
    Martorell, Xavier
    2023 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, FCCM, 2023, : 213 - 213
  • [5] Improved Synthesis of Compressor Trees on FPGAs in High-level Synthesis
    Tu, Le
    Yuan, Yuelai
    Huang, Kan
    Zhang, Xiaoqiang
    Wang, Zixin
    Chen, Dihu
    2017 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2017), 2017, : 25 - 25
  • [6] Challenges Designing for FPGAs Using High-Level Synthesis
    Faber, Clayton J.
    Harris, Steven D.
    Xiao, Zhili
    Chamberlain, Roger D.
    Cabrera, Anthony M.
    2022 IEEE HIGH PERFORMANCE EXTREME COMPUTING VIRTUAL CONFERENCE (HPEC), 2022,
  • [7] SOFF: An OpenCL High-Level Synthesis Framework for FPGAs
    Jo, Gangwon
    Kim, Heehoon
    Lee, Jeesoo
    Lee, Jaejin
    2020 ACM/IEEE 47TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2020), 2020, : 295 - 308
  • [8] HIGH-LEVEL SYNTHESIS AND GENERATING FPGAS WITH THE BEDROC SYSTEM
    LEESER, M
    CHAPMAN, R
    AAGAARD, M
    LINDERMAN, M
    MEIER, S
    JOURNAL OF VLSI SIGNAL PROCESSING, 1993, 6 (02): : 191 - 214
  • [9] High-Level Synthesis for FPGAs: From Prototyping to Deployment
    Cong, Jason
    Liu, Bin
    Neuendorffer, Stephen
    Noguera, Juanjo
    Vissers, Kees
    Zhang, Zhiru
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2011, 30 (04) : 473 - 491