An FPGA-Based Energy-Efficient Reconfigurable Convolutional Neural Network Accelerator for Object Recognition Applications

被引：45

作者：

Li, Jixuan ^{[1
]}

Un, Ka-Fai ^{[1
]}

Yu, Wei-Han ^{[1
]}

Mak, Pui-In ^{[1
]}

Martins, Rui P. ^{[1
]}

机构：

[1] Univ Macau, Fac Sci & Technol, State Key Lab Analog & Mixed Signal VLSI IME & DE, Macau, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS | 2021年 / 68卷 / 09期

关键词：

Frequency modulation; Kernel; Throughput; Parallel processing; Memory management; Field programmable gate arrays; Computational efficiency; Computation efficiency; convolutional neural network (CNN); FPGA; object recognition; reconfigurability; THROUGHPUT; CNN;

D O I：

10.1109/TCSII.2021.3095283

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The computational efficiency is the prime concern of a computation-intensive deep convolutional neural network (CNN). In this Brief, we report an FPGA-based computation-efficient reconfigurable CNN accelerator. It innovates in the utilization of a kernel partition technique to substantially reduce the repeated access to the input feature maps and the kernels. As a result, it balances the ability for parallel computing while consuming less system power. Experimental results prove that the proposed CNN accelerator achieves a peak throughput of 220.0 GOP/s with an energy efficiency of 22.9 GOPs/W at 151.4 frames/s for the AlexNet. It is also reconfigurable to process VGG-16 befitting complex object recognition.

引用

页码：3143 / 3147

页数：5

共 50 条

[41] FPGA-based Convolutional Neural Network Accelerator design using High Level Synthesize
Ghaffari, Sina
Sharifian, Saeed
2016 2ND INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2016, : 29 - 34
[42] An FPGA-based Accelerator Implementation for Deep Convolutional Neural Networks
Zhou, Yongmei
Jiang, Jingfei
PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 829 - 832
[43] A High Utilization FPGA-Based Accelerator for Variable-Scale Convolutional Neural Network
Li, Xin
Cai, Yujie
Han, Jun
Zeng, Xiaoyang
2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 944 - 947
[44] Composite FPGA-based Accelerator for Deep Convolutional Neural Networks
HuanZhang
YuanYang
YangXiao
2019 IEEE INTERNATIONAL CONFERENCE ON ELECTRON DEVICES AND SOLID-STATE CIRCUITS (EDSSC), 2019,
[45] A FPGA-based Hardware Accelerator for Multiple Convolutional Neural Networks
Yao, Yuchen
Duan, Qinghua
Zhang, Zhiqian
Gao, Jiabao
Wang, Jian
Yang, Meng
Tao, Xinxuan
Lai, Jinmei
2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1075 - 1077
[46] Design framework for an energy-efficient binary convolutional neural network accelerator based on nonvolatile logic
Suzuki, Daisuke
Oka, Takahiro
Tamakoshi, Akira
Takako, Yasuhiro
Hanyu, Takahiro
IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2021, 12 (04): : 695 - 710
[47] A Dynamic Reconfigurable Architecture for Hybrid Spiking and Convolutional FPGA-Based Neural Network Designs
Irmak, Hasan
Corradi, Federico
Detterer, Paul
Alachiotis, Nikolaos
Ziener, Daniel
JOURNAL OF LOW POWER ELECTRONICS AND APPLICATIONS, 2021, 11 (03)
[48] Energy-Efficient Bit-Sparse Accelerator Design for Convolutional Neural Network
Xiao, Hang
Xu, Haobo
Wang, Ying
Li, Jiajun
Wang, Yujie
Han, Yinhe
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (07): : 1122 - 1131
[49] Unified energy-efficient reconfigurable MAC for dynamic Convolutional Neural Network based on Winograd algorithm
Yang, Dong-Sheng
Xu, Chong-Hao
Ruan, Shanq-Jang
Huang, Chun-Ming
MICROPROCESSORS AND MICROSYSTEMS, 2022, 93
[50] A Flexible and Energy-Efficient Convolutional Neural Network Acceleration With Dedicated ISA and Accelerator
Chen, Xiaobai
Yu, Zhiyi
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (07) : 1408 - 1412

← 1 2 3 4 5 →