Optimize FPGA-based Neural Network Accelerator with Bit-shift Quantization

被引：0

作者：

Liu, Yu ^{[1
]}

Liu, XueJiao ^{[1
]}

Liang, Luhong ^{[1
]}

机构：

[1] Hong Kong Appl Sci & Technol Res Inst ASTRI, Hong Kong, Peoples R China

来源：

2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS) | 2020年

关键词：

D O I：

10.1109/iscas45731.2020.9180919

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Well-programmed Field Programmable Gate Arrays (FPGAs) can accelerate Deep Neural Network (DNN) with high power efficiency. The dominant workloads of DNNs are Multiply Accumulates (MACs), which can be directly mapped to Digital Signal Processors (DSPs) in the FPGA. A DNN accelerator pursuing high performance can consume almost all the DSPs, but with a considerable amount of Look-up Tables (LUTs) in the FPGA unused or performing MACs inefficiently. To solve this problem, we present a Bit-Shift method for FPGA-based DNN accelerator to fully utilize the resources in the FPGA. The MAC is converted to a limited number of shift-and-add operations, which can be implemented by LUTs with significant improvement of efficiency. A quantization method based on Minimum Mean Absolute Error (MMAE) is proposed to preserve the accuracy of the DNN inference in the conversion of DNN parameters without re-training. The quantized parameters can be compressed to a fixed and fewer number of bits to reduce the memory bandwidth. Accordingly, a Bit-Shift architecture is designed to load the compressed parameters and perform the converted MAC calculations without extra decompression module. A large scale DNN accelerator with the proposed Bit-Shift architecture is implemented in a Xilinx VU095 FPGA. Experimental results show that the proposed method can boost the processing speed by 32% and reach 331 GOPS at 190MHz clock frequency for ResNet-34.

引用

页数：5

共 50 条

[31] FPGA-based Accelerator for Losslessly Quantized Convolutional Neural Networks
Sit, Mankit
Kazami, Ryosuke
Amano, Hideharu
2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 295 - 298
[32] Customizable FPGA-based Accelerator for Binarized Graph Neural Networks
Wang, Ziwei
Que, Zhiqiang
Luk, Wayne
Fan, Hongxiang
2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 1968 - 1972
[33] An FPGA-based Accelerator Implementation for Deep Convolutional Neural Networks
Zhou, Yongmei
Jiang, Jingfei
PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 829 - 832
[34] Composite FPGA-based Accelerator for Deep Convolutional Neural Networks
HuanZhang
YuanYang
YangXiao
2019 IEEE INTERNATIONAL CONFERENCE ON ELECTRON DEVICES AND SOLID-STATE CIRCUITS (EDSSC), 2019,
[35] A FPGA-based Hardware Accelerator for Multiple Convolutional Neural Networks
Yao, Yuchen
Duan, Qinghua
Zhang, Zhiqian
Gao, Jiabao
Wang, Jian
Yang, Meng
Tao, Xinxuan
Lai, Jinmei
2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1075 - 1077
[36] Optimizing a FPGA-based Neural Accelerator for Small IoT Devices
Hong, Seongmin
Lee, Inho
Park, Yongjun
2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, : 176 - 177
[37] Optimizing Bayesian Recurrent Neural Networks on an FPGA-based Accelerator
Ferianc, Martin
Que, Zhiqiang
Fan, Hongxiang
Luk, Wayne
Rodrigues, Miguel
2021 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT), 2021, : 19 - 28
[38] Customizable FPGA-based Accelerator for Binarized Graph Neural Networks
Wang, Ziwei
Que, Zhiqiang
Luk, Wayne
Fan, Hongxiang
Proceedings - IEEE International Symposium on Circuits and Systems, 2022, 2022-May : 1968 - 1972
[39] FPGA-based Acceleration of Neural Network Training
Sang, Ruoyu
Liu, Qiang
Zhang, Qijun
2016 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO), 2016,
[40] An FPGA-Based Convolutional Neural Network Coprocessor
Qiu, Changpei
Wang, Xin'an
Zhao, Tianxia
Li, Qiuping
Wang, Bo
Wang, Hu
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021

← 1 2 3 4 5 →