Quantization aware approximate multiplier and hardware accelerator for edge computing of deep learning applications

被引:6
|
作者
Reddy, K. Manikantta [1 ]
Vasantha, M. H. [1 ]
Kumar, Y. B. Nithin [1 ]
Gopal, Ch. Keshava [2 ]
Dwivedi, Devesh [1 ]
机构
[1] Natl Inst Technol Goa, Dept Elect & Commun Engn, Ponda 403401, Goa, India
[2] Xilinx India Technol Serv Pvt Ltd, Syst Integrat & Validat Grp, Hyderabad 500032, India
关键词
Approximate computing; Approximate multiplier; Hardware accelerator; Edge computing; Matrix multiplication; LOW-POWER; NEURAL-NETWORK; COMPRESSORS; DESIGN; ADDER;
D O I
10.1016/j.vlsi.2021.08.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate computing has emerged as an efficient design methodology for improving the performance and power-efficiency of digital systems by allowing a negligible loss in the output accuracy. Dedicated hardware accelerators built using approximate circuits can solve power-performance trade-off in the computationally complex applications like deep learning. This paper proposes an approximate radix-4 Booth multiplier and hardware accelerator for deploying deep learning applications on power-restricted mobile/edge computing devices. The proposed accelerator uses approximate multiplier based parallel processing elements to accelerate the workloads. The proposed accelerator is tested with matrix-vector multiplication (MVM) and matrix-matrix multiplication (MMM) workloads on Zynq ZCU102 evaluation board. The experimental results show that the average power consumption of the proposed accelerator reduces by 34% and 40% for MVM and MMM respectively, as compared to the conventional multiply-accumulate unit that was used in the literature to implement similar workloads. Moreover, the proposed accelerator achieved an average performance of 5 GOP/s and 42.5 GOP/s for MVM and MMM respectively at 275 MHz, which are 14x and 5x respective improvements over the conventional design.
引用
收藏
页码:268 / 279
页数:12
相关论文
共 50 条
  • [21] Deep Learning for Edge Computing Applications: A State-of-the-Art Survey
    Wang, Fangxin
    Zhang, Miao
    Wang, Xiangxiang
    Ma, Xiaoqiang
    Liu, Jiangchuan
    IEEE ACCESS, 2020, 8 : 58322 - 58336
  • [22] Characterizing Perception Deep Learning Algorithms and Applications for Vehicular Edge Computing
    Feng, Wang
    Tang, Sihai
    Wang, Shengze
    He, Ying
    Chen, Donger
    Yang, Qing
    Fu, Song
    ALGORITHMS, 2025, 18 (01)
  • [23] Reliability-Aware Personalized Deployment of Approximate Computation IoT Applications in Serverless Mobile Edge Computing
    Cao, Kun
    Chen, Mingsong
    Karnouskos, Stamatis
    Hu, Shiyan
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 44 (02) : 430 - 443
  • [24] A New Hardware-Efficient VLSI-Architecture of GoogLeNet CNN-Model Based Hardware Accelerator for Edge Computing Applications
    Islam, Md. Najrul
    Shrestha, Rahul
    Chowdhury, Shubhajit Roy
    2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 414 - 417
  • [25] Designing Deep Learning Hardware Accelerator and Efficiency Evaluation
    Qi, Zhi
    Chen, Weijian
    Naqvi, Rizwan Ali
    Siddique, Kamran
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [26] Designing Deep Learning Hardware Accelerator and Efficiency Evaluation
    Qi, Zhi
    Chen, Weijian
    Naqvi, Rizwan Ali
    Siddique, Kamran
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [27] Mobility-Aware Deep Reinforcement Learning with Glimpse Mobility Prediction in Edge Computing
    Wu, Chao-Lun
    Chiu, Te-Chuan
    Wang, Chih-Yu
    Pang, Ai-Chun
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [28] Deep Reinforcement Learning for Social-Aware Edge Computing and Caching in Urban Informatics
    Zhang, Ke
    Cao, Jiayu
    Liu, Hong
    Maharjan, Sabita
    Zhang, Yan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (08) : 5467 - 5477
  • [29] Deep Reinforcement Learning for Online Latency Aware Workload Offloading in Mobile Edge Computing
    Akhavan, Zeinab
    Esmaeili, Mona
    Badnava, Babak
    Yousefi, Mohammad
    Sun, Xiang
    Devetsikiotis, Michael
    Zarkesh-Ha, Payman
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 2218 - 2223
  • [30] Mobility-Aware Edge Caching and Computing in Vehicle Networks: A Deep Reinforcement Learning
    Le Thanh Tan
    Hu, Rose Qingyang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (11) : 10190 - 10203