Quantization aware approximate multiplier and hardware accelerator for edge computing of deep learning applications

被引:6
|
作者
Reddy, K. Manikantta [1 ]
Vasantha, M. H. [1 ]
Kumar, Y. B. Nithin [1 ]
Gopal, Ch. Keshava [2 ]
Dwivedi, Devesh [1 ]
机构
[1] Natl Inst Technol Goa, Dept Elect & Commun Engn, Ponda 403401, Goa, India
[2] Xilinx India Technol Serv Pvt Ltd, Syst Integrat & Validat Grp, Hyderabad 500032, India
关键词
Approximate computing; Approximate multiplier; Hardware accelerator; Edge computing; Matrix multiplication; LOW-POWER; NEURAL-NETWORK; COMPRESSORS; DESIGN; ADDER;
D O I
10.1016/j.vlsi.2021.08.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate computing has emerged as an efficient design methodology for improving the performance and power-efficiency of digital systems by allowing a negligible loss in the output accuracy. Dedicated hardware accelerators built using approximate circuits can solve power-performance trade-off in the computationally complex applications like deep learning. This paper proposes an approximate radix-4 Booth multiplier and hardware accelerator for deploying deep learning applications on power-restricted mobile/edge computing devices. The proposed accelerator uses approximate multiplier based parallel processing elements to accelerate the workloads. The proposed accelerator is tested with matrix-vector multiplication (MVM) and matrix-matrix multiplication (MMM) workloads on Zynq ZCU102 evaluation board. The experimental results show that the average power consumption of the proposed accelerator reduces by 34% and 40% for MVM and MMM respectively, as compared to the conventional multiply-accumulate unit that was used in the literature to implement similar workloads. Moreover, the proposed accelerator achieved an average performance of 5 GOP/s and 42.5 GOP/s for MVM and MMM respectively at 275 MHz, which are 14x and 5x respective improvements over the conventional design.
引用
收藏
页码:268 / 279
页数:12
相关论文
共 50 条
  • [1] Energy-aware Adaptive Approximate Computing for Deep Learning Applications
    TaheriNejad, Nima
    Shakibhamedan, Salar
    2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 328 - 328
  • [2] Reconfigurable FET Approximate Computing-based Accelerator for Deep Learning Applications
    Saravanan, Raghul
    Bavikadi, Sathwika
    Rai, Shubham
    Kumar, Akash
    Dinakarrao, Sai Manoj Pudukotai
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [3] A Survey on Hardware Accelerator Design of Deep Learning for Edge Devices
    Samanta, Anu
    Hatai, Indranil
    Mal, Ashis Kumar
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 137 (03) : 1715 - 1760
  • [4] Hardware-oriented deep reinforcement learning for edge computing
    Yamagishi, Yoshiharu
    Kaneko, Tatsuya
    Akai-Kasaya, Megumi
    Asai, Tetsuya
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2021, 12 (03): : 526 - 544
  • [5] Low power AI hardware platform for deep learning in edge computing
    Ohbuchi, Eisaku
    2018 IEEE CPMT SYMPOSIUM JAPAN (ICSJ), 2018, : 89 - 90
  • [6] Design Environment of Quantization-Aware Edge AI Hardware for Few-Shot Learning
    Kanda, R.
    Onizawa, N.
    Leonardon, M.
    Gripon, V
    Hanyu, T.
    2024 IEEE 67TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, MWSCAS 2024, 2024, : 928 - 931
  • [7] Hardware Efficient Approximate Multiplier Architecture for Image Processing Applications
    Chandaka, Shravani
    Narayanam, Balaji
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2022, 38 (02): : 217 - 230
  • [8] Hardware Efficient Approximate Multiplier Architecture for Image Processing Applications
    Shravani Chandaka
    Balaji Narayanam
    Journal of Electronic Testing, 2022, 38 : 217 - 230
  • [9] AVEC: Accelerator Virtualization in Cloud-Edge Computing for Deep Learning Libraries
    Kennedy, Jason
    Varghese, Blesson
    Reano, Carlos
    5TH IEEE INTERNATIONAL CONFERENCE ON FOG AND EDGE COMPUTING (ICFEC 2021), 2021, : 37 - 44
  • [10] Low Cost and Low Power Stacked Sparse Autoencoder Hardware Acceleration for Deep Learning Edge Computing Applications
    Belabed, Tarek
    Coutinho, Maria Gracielly F.
    Fernandes, Marcelo A. C.
    Carlos, Valderrama
    Souani, Chokri
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,