Hardware Efficient Convolution Processing Unit for Deep Neural Networks

被引:2
|
作者
Hazarika, Anakhi [1 ]
Poddar, Soumyajit [1 ]
Rahaman, Hafizur [2 ]
机构
[1] Indian Inst Informat Technol Guwahati, Gauhati 781015, India
[2] Indian Inst Engn Sci & Technol, Sibpur 711103, Howrah, India
关键词
Deep Neural Network; CNN Hardware Accelerator; Field Programmable Gate Array (FPGA);
D O I
10.1109/isdcs.2019.8719278
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Network (CNN) is a type of deep neural networks that are commonly used for object detection and classification. State-of-the-art hardware for training and inference of CNN architectures require a considerable amount of computation and memory intensive resources. CNN achieves greater accuracy at the cost of high computational complexity and large power consumption. To optimize the memory requirement, processing speed and power, it is crucial to design more efficient accelerator architecture for CNN computation. In this work, an overlap of spatially adjacent data is exploited in order to parallelize the movement of data. A fast, re-configurable hardware accelerator architecture along with optimized kernel design suitable for a variety of CNN models is proposed. Our design achieves 2.1x computational benefits over state-of-the-art accelerator architectures.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Implementation and Evaluation of Deep Neural Networks in Commercially Available Processing in Memory Hardware
    Das, Prangon
    Sutradhar, Purab Ranjan
    Indovina, Mark
    Dinakarrao, Sai Manoj Pudukotai
    Ganguly, Amlan
    2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 293 - 298
  • [22] A Pipelined Energy-efficient Hardware Accelaration for Deep Convolutional Neural Networks
    Alaeddine, Hmidi
    Jihene, Malek
    2019 IEEE INTERNATIONAL CONFERENCE ON DESIGN & TEST OF INTEGRATED MICRO & NANO-SYSTEMS (DTS), 2019,
  • [23] From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
    Geng, Xue
    Wang, Zhe
    Chen, Chunyun
    Xu, Qing
    Xu, Kaixin
    Jin, Chao
    Gupta, Manas
    Yang, Xulei
    Chen, Zhenghua
    Aly, Mohamed M. Sabry
    Lin, Jie
    Wu, Min
    Li, Xiaoli
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 21
  • [24] An Updated Survey of Efficient Hardware Architectures for Accelerating Deep Convolutional Neural Networks
    Capra, Maurizio
    Bussolino, Beatrice
    Marchisio, Alberto
    Shafique, Muhammad
    Masera, Guido
    Martina, Maurizio
    FUTURE INTERNET, 2020, 12 (07):
  • [25] Quantized Deep Neural Networks for Energy Efficient Hardware-based Inference
    Ding, Ruizhou
    Liu, Zeye
    Blanton, R. D.
    Marculescu, Diana
    2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 1 - 8
  • [26] An Architecture to Accelerate Convolution in Deep Neural Networks
    Ardakani, Arash
    Condo, Carlo
    Ahmadi, Mehdi
    Gross, Warren J.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (04) : 1349 - 1362
  • [27] Deep Convolution Neural Networks for Image Classification
    Kulkarni, Arun D.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 18 - 23
  • [28] Calculate Deep Convolution NeurAl Network on Cell Unit
    Lu, Haofang
    Zhou, Ying
    Zhang, Zi-Ke
    INFORMATION SCIENCE AND APPLICATIONS 2017, ICISA 2017, 2017, 424 : 526 - 532
  • [29] MEPAD: A Memory-Efficient Parallelized Direct Convolution Algorithm for Deep Neural Networks
    Fiorin, Leandro
    Silvano, Cristina
    EURO-PAR 2024: PARALLEL PROCESSING, PART II, EURO-PAR 2024, 2024, 14802 : 167 - 181
  • [30] THE COMBINATION OF CONVOLUTION NEURAL NETWORKS AND DEEP NEURAL NETWORKS FOR FAKE NEWS DETECTION
    Jawad, Zainab A.
    Obaid, Ahmed J.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2023, 18 (01): : 814 - 826