FPGA-based acceleration for binary neural networks in edge computing

被引:1
|
作者
Zhan J.-Y. [1 ]
Yu A.-T. [1 ]
Jiang W. [1 ]
Yang Y.-J. [1 ]
Xie X.-N. [2 ]
Chang Z.-W. [3 ]
Yang J.-H. [4 ]
机构
[1] School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu
[2] School of Automation, Chengdu University of Information Technology, Chengdu
[3] State Grid Sichuan Electric Power Research Institute, Chengdu
[4] Department of Information Sciences and Technology, George Mason University, Fairfax
基金
中国国家自然科学基金;
关键词
Accelerator; Binarization; Field-programmable gate array (FPGA); Neural networks; Quantification;
D O I
10.1016/j.jnlest.2023.100204
中图分类号
学科分类号
摘要
As a core component in intelligent edge computing, deep neural networks (DNNs) will increasingly play a critically important role in addressing the intelligence-related issues in the industry domain, like smart factories and autonomous driving. Due to the requirement for a large amount of storage space and computing resources, DNNs are unfavorable for resource-constrained edge computing devices, especially for mobile terminals with scarce energy supply. Binarization of DNN has become a promising technology to achieve a high performance with low resource consumption in edge computing. Field-programmable gate array (FPGA)-based acceleration can further improve the computation efficiency to several times higher compared with the central processing unit (CPU) and graphics processing unit (GPU). This paper gives a brief overview of binary neural networks (BNNs) and the corresponding hardware accelerator designs on edge computing environments, and analyzes some significant studies in detail. The performances of some methods are evaluated through the experiment results, and the latest binarization technologies and hardware acceleration methods are tracked. We first give the background of designing BNNs and present the typical types of BNNs. The FPGA implementation technologies of BNNs are then reviewed. Detailed comparison with experimental evaluation on typical BNNs and their FPGA implementation is further conducted. Finally, certain interesting directions are also illustrated as future work. © 2023 The Authors
引用
收藏
相关论文
共 50 条
  • [21] Convolutional Neural Networks using FPGA-based Pipelining
    Ali G.A.
    Ali A.H.
    Iraqi Journal for Computer Science and Mathematics, 2023, 4 (02): : 215 - 223
  • [22] Implementation of FPGA-based Accelerator for Deep Neural Networks
    Tsai, Tsung-Han
    Ho, Yuan-Chen
    Sheu, Ming-Hwa
    2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
  • [23] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
    Hwang, Wen-Jyi
    Jhang, Yun-Jie
    Tai, Tsung-Ming
    2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
  • [24] A survey of FPGA-based accelerators for convolutional neural networks
    Sparsh Mittal
    Neural Computing and Applications, 2020, 32 : 1109 - 1139
  • [25] Energy-Efficient Architecture for FPGA-based Deep Convolutional Neural Networks with Binary Weights
    Duan, Yunzhi
    Li, Shuai
    Zhang, Ruipeng
    Wang, Qi
    Chen, Jienan
    Sobelman, Gerald E.
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [26] A survey of FPGA-based accelerators for convolutional neural networks
    Mittal, Sparsh
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (04): : 1109 - 1139
  • [27] FPGA-BASED EDGE COMPUTING FRAMEWORK: MODELING OF COMPUTATION TASK SCHEDULING
    Tan, Jianfei
    Yang, Hao
    Zhao, Chun
    Zhang, Lin
    PROCEEDINGS OF ASME 2023 18TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, MSEC2023, VOL 2, 2023,
  • [28] FPGA-based reconfigurable computing
    Chang, J. Morris
    Lo, C. Dan
    MICROPROCESSORS AND MICROSYSTEMS, 2006, 30 (06) : 281 - 282
  • [29] An FPGA-based Emulation Platform for Edge Computing Node Design Exploration
    Soriano, Theo
    Novo, David
    Benoit, Pascal
    PROCEEDINGS OF THE 2021 32ND INTERNATIONAL WORKSHOP ON RAPID SYSTEM PROTOTYPING (RSP): SHORTENING THE PATH FROM SPECIFICATION TO PROTOTYPE, 2021, : 8 - 14
  • [30] Scalable and Efficient Architecture for Random Forest on FPGA-Based Edge Computing
    Cuong Pham-Quoc
    EURO-PAR 2023: PARALLEL PROCESSING WORKSHOPS, PT I, EURO-PAR 2023, 2024, 14351 : 42 - 54