FPGA based convolution and memory architecture for Convolutional Neural Network

被引:2
|
作者
Shahan, K. A. [1 ]
Rani, Sheeba J. [1 ]
机构
[1] Indian Inst Space Sci & Technol, Dept Avion, Thiruvananthapuram, Kerala, India
关键词
convolution; neural network; winograd efficient; hardware; architecture; deep convolutional neural network; memory reuse; FPGA;
D O I
10.1109/VLSID49098.2020.00049
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks(CNNs) are widely used in vision based applications to increase the performance but at the cost of higher storage and increase in computation. Hardware implementations of CNN are limited by the computational complexity and bandwidth while accessing off-chip memory. In this work a novel FPGA based hardware architecture for 2D convolution operation with reduced computational complexity using Winograd's 2D minimal filtering algorithm and a memory architecture to reduce on-chip read operations to access adjacent input data tiles for convolution operations is proposed to accelerate CNNs. An on-chip memory bank reuse architecture is also utilized to reduce the number of memory read and write operations to off-chip memory. The proposed architecture for convolution operation achieves lower computational complexity by reducing the number of multiplication operations without proportionate increase in number of addition operations compared to prior implementations. The number of data read operations from on-chip memory is reduced by 4 times and using the on-chip memory bank reuse scheme latency associated with accessing intermediate data is reduced. The implemented uses 16-bit fixed point representation which could reduce bit width to save area and energy. Virtex Ultra scale+ VCU118 Evaluation Board 2.0 populated with XCVU9P-L2FLGA2104 is used as the platform for implementing the design. VGG Net based CNN is used for the implementation. The computation time for individual convolutional layer is also estimated and it is found to be reduced. For a 3x3 kernel the number of multiplications is reduced to 4 from 9 compared to standard convolution operation and the number of addition operations reduced to 12 from 14 compared to prior hardware implementations of Winograd's 2D minimal filtering algorithm.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 50 条
  • [1] Design of Convolutional Neural Network Processor Based on FPGA Resource Multiplexing Architecture
    Yan, Fei
    Zhang, Zhuangzhuang
    Liu, Yinping
    Liu, Jia
    SENSORS, 2022, 22 (16)
  • [2] FPGA-Based Convolutional Neural Network Architecture with Reduced Parameter Requirements
    Hailesellasie, Muluken
    Hasan, Syed Rafay
    Khalid, Faiq
    Awwad, Falah
    Shafique, Muhammad
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [3] Convolutional Neural Network and Maxpooling Architecture on Zynq SoC FPGA
    Bachtiar, Y. A.
    Adiono, T.
    2019 INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND SMART DEVICES (ISESD 2019): FUTURE SMART DEVICES AND NANOTECHNOLOGY FOR MICROELECTRONICS, 2019,
  • [4] Design of Convolutional Neural Network Based on FPGA
    Zhai, Sheping
    Qiu, Cheng
    Yang, Yuanyuan
    Li, Jing
    Cui, Yiming
    2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168
  • [5] Optimization of Convolution Neural Network Algorithm Based on FPGA
    Tang, Feixue
    Zhang, Weichao
    Tian, Xiaogang
    Fan, Xiaoye
    Cao, Xixin
    EMBEDDED SYSTEMS TECHNOLOGY, ESTC 2017, 2018, 857 : 131 - 140
  • [6] A Review of FPGA-Based Custom Computing Architecture for Convolutional Neural Network Inference
    Peng Xiyuan
    Yu Jinxiang
    Yao Bowen
    Liu Liansheng
    Peng Yu
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (01) : 1 - 17
  • [7] A Review of FPGA-Based Custom Computing Architecture for Convolutional Neural Network Inference
    PENG Xiyuan
    YU Jinxiang
    YAO Bowen
    LIU Liansheng
    PENG Yu
    Chinese Journal of Electronics, 2021, 30 (01) : 1 - 17
  • [8] A Skyrmion Racetrack Memory based Computing In-memory Architecture for Binary Neural Convolutional Network
    Pan, Yu
    Ouyang, Peng
    Zhao, Yinglin
    Yin, Shouyi
    Zhang, Youguang
    Wei, Shaojun
    Zhao, Weisheng
    GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 271 - 274
  • [9] Streaming Convolutional Neural Network FPGA Architecture for RFSoC Data Converters
    Maclellan, Andrew
    Crockett, Louise H.
    Stewart, Robert W.
    2023 21ST IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS, 2023,
  • [10] An FPGA-Based Convolutional Neural Network Coprocessor
    Qiu, Changpei
    Wang, Xin'an
    Zhao, Tianxia
    Li, Qiuping
    Wang, Bo
    Wang, Hu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021