A High-Performance FPGA Accelerator for CUR Decomposition

被引:1
|
作者
Abdelgawad, M. A. A. [1 ]
Cheung, Ray C. C. [1 ]
Yan, Hong [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
关键词
CUR decomposition; low-rank decomposition; high level synthesis; SVD and QR decomposition;
D O I
10.1109/FPL57034.2022.00052
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A matrix factorization is to decompose a matrix into a product of smaller matrices. It is widely used in machine learning algorithms. There are many matrix decomposition algorithms, and each has various applications. CUR matrix decomposition is a widely-used factorization tool that has been employed for dimension reduction and pattern recognition in many scientific and engineering applications, such as image processing, text mining, and wireless communications. In this paper we propose an efficient FPGA-based floating-point accelerator using high-level synthesis (HLS) for the CUR decomposition algorithm. Our experiment results demonstrate the better efficiency of our hardware design compared to the optimized CPU-based software solutions. The speedup of our FPGA-based architecture over the optimized software implementation ranges from 2.37 to 16.82 times for different dimensions of the data input matrix. We evaluated our design using large dimension matrices 1024x1024 and 2048 x 2048 and the experiment results demonstrated the efficiency of our design in terms of the utilized resources and latency. Finally, we have compared our design with other matrix decomposition algorithms such as SVD and QR decomposition, the experiment results demonstrated that CUR is more efficient than SVD and QR decomposition in terms of latency and required resources.
引用
收藏
页码:294 / 299
页数:6
相关论文
共 50 条
  • [21] ScalaBFS2: A High-performance BFS Accelerator on an HBM-enhanced FPGA Chip
    Li, Kexin
    Xu, Shaoxian
    Shao, Zhiyuan
    Zheng, Ran
    Liao, Xiaofei
    Jin, Hai
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2024, 17 (02)
  • [22] HPTA: A High Performance Transformer Accelerator Based on FPGA
    Han, Yuntao
    Liu, Qiang
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 27 - 33
  • [23] A High-Performance and Ultra-Low-Power Accelerator Design for Advanced Deep Learning Algorithms on an FPGA
    Gundrapally, Achyuth
    Shah, Yatrik Ashish
    Alnatsheh, Nader
    Choi, Kyuwon Ken
    ELECTRONICS, 2024, 13 (13)
  • [24] FNNG: A High-Performance FPGA-based Accelerator for K-Nearest Neighbor Graph Construction
    Liu, Chaoqiang
    Liu, Haifeng
    Zheng, Long
    Huang, Yu
    Ye, Xiangyu
    Liao, Xiaofei
    Jin, Hai
    PROCEEDINGS OF THE 2023 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, FPGA 2023, 2023, : 67 - 77
  • [25] MuDBN: An Energy-Efficient and High-Performance Multi-FPGA Accelerator for Deep Belief Networks
    Cheng, Yuming
    Wang, Chao
    Zhao, Yangyang
    Chen, Xianglan
    Zhou, Xuehai
    Li, Xi
    PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18), 2018, : 435 - 438
  • [26] AOS: An Automated Overclocking System for High-Performance CNN Accelerator Through Timing Delay Measurement on FPGA
    Jiang, Weixiong
    Yu, Heng
    Chen, Fupeng
    Ha, Yajun
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (09) : 2952 - 2965
  • [27] A High-Performance and Flexible FPGA Inference Accelerator for Decision Forests Based on Prior Feature Space Partitioning
    Chu, Thiem Van
    Kitajima, Ryuichi
    Kawamura, Kazushi
    Yu, Jaehoon
    Motomura, Masato
    2021 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT), 2021, : 29 - 38
  • [28] High-performance computing using a reconfigurable accelerator
    Hartenstein, RW
    Becker, J
    Kress, R
    Reinig, H
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1996, 8 (06): : 429 - 443
  • [29] High-performance insulator structures for accelerator applications
    Sampayan, SE
    Caporaso, GJ
    Sanders, DM
    Stoddard, RD
    Trimble, DO
    Elizondo, J
    Krogh, ML
    Wieskamp, TF
    PROCEEDINGS OF THE 1997 PARTICLE ACCELERATOR CONFERENCE, VOLS 1-3: PLENARY AND SPECIAL SESSIONS ACCELERATORS AND STORAGE RINGS - BEAM DYNAMICS, INSTRUMENTATION, AND CONTROLS, 1998, : 1308 - 1310
  • [30] BOTTLE ACCELERATOR FOR HIGH-PERFORMANCE FILLING PLANTS
    MERNOE, E
    MONATSSCHRIFT FUR BRAUWISSENSCHAFT, 1983, 36 (05): : 207 - 207