Optimize FPGA-based Neural Network Accelerator with Bit-shift Quantization

被引：0

作者：

Liu, Yu ^{[1
]}

Liu, XueJiao ^{[1
]}

Liang, Luhong ^{[1
]}

机构：

[1] Hong Kong Appl Sci & Technol Res Inst ASTRI, Hong Kong, Peoples R China

来源：

2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS) | 2020年

关键词：

D O I：

10.1109/iscas45731.2020.9180919

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Well-programmed Field Programmable Gate Arrays (FPGAs) can accelerate Deep Neural Network (DNN) with high power efficiency. The dominant workloads of DNNs are Multiply Accumulates (MACs), which can be directly mapped to Digital Signal Processors (DSPs) in the FPGA. A DNN accelerator pursuing high performance can consume almost all the DSPs, but with a considerable amount of Look-up Tables (LUTs) in the FPGA unused or performing MACs inefficiently. To solve this problem, we present a Bit-Shift method for FPGA-based DNN accelerator to fully utilize the resources in the FPGA. The MAC is converted to a limited number of shift-and-add operations, which can be implemented by LUTs with significant improvement of efficiency. A quantization method based on Minimum Mean Absolute Error (MMAE) is proposed to preserve the accuracy of the DNN inference in the conversion of DNN parameters without re-training. The quantized parameters can be compressed to a fixed and fewer number of bits to reduce the memory bandwidth. Accordingly, a Bit-Shift architecture is designed to load the compressed parameters and perform the converted MAC calculations without extra decompression module. A large scale DNN accelerator with the proposed Bit-Shift architecture is implemented in a Xilinx VU095 FPGA. Experimental results show that the proposed method can boost the processing speed by 32% and reach 331 GOPS at 190MHz clock frequency for ResNet-34.

引用

页数：5

共 50 条

[21] Implementation of FPGA-based Accelerator for Deep Neural Networks
Tsai, Tsung-Han
Ho, Yuan-Chen
Sheu, Ming-Hwa
2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
[22] The Shunt: An FPGA-Based Accelerator for Network Intrusion Prevention
Weaver, Nicholas
Paxson, Vern
Gonzalez, Jose M.
FPGA 2007: FIFTEENTH ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2007, : 199 - 206
[23] FPGA-Based High-Performance Data Compression Deep Neural Network Accelerator
Wang, Hanze
Fu, Yingxun
Ma, Li
2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 563 - 569
[24] FPGA-based Convolutional Neural Network Accelerator design using High Level Synthesize
Ghaffari, Sina
Sharifian, Saeed
2016 2ND INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2016, : 29 - 34
[25] A High Utilization FPGA-Based Accelerator for Variable-Scale Convolutional Neural Network
Li, Xin
Cai, Yujie
Han, Jun
Zeng, Xiaoyang
2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 944 - 947
[26] An FPGA-Based Reconfigurable Accelerator for Low-Bit DNN Training
Shao, Haikuo
Lu, Jinming
Lin, Jun
Wang, Zhongfeng
2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 254 - 259
[27] An FPGA-based binary neural network accelerator with enhanced hardware efficiency and data reuse
Zhang, Dezheng
Cen, Rui
Pu, Han
Wan, Rui
Wang, Dong
MICROELECTRONICS JOURNAL, 2025, 156
[28] FPGA-Based Reconfigurable Convolutional Neural Network Accelerator Using Sparse and Convolutional Optimization
Gowda, Kavitha Malali Vishveshwarappa
Madhavan, Sowmya
Rinaldi, Stefano
Divakarachari, Parameshachari Bidare
Atmakur, Anitha
ELECTRONICS, 2022, 11 (10)
[29] Fast FPGA-based Emulation for ReRAM-Enabled Deep Neural Network Accelerator
Shi, Yongquan
Sun, Yongshuai
Jiang, Jianfei
He, Guanghui
Wang, Qin
Jing, Naifeng
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[30] An Efficient FPGA-based Depthwise Separable Convolutional Neural Network Accelerator with Hardware Pruning
Liu, Zhengyan
Liu, Qiang
Yan, Shun
Cheung, Ray C. C.
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2024, 17 (01)

← 1 2 3 4 5 →