Quantization aware approximate multiplier and hardware accelerator for edge computing of deep learning applications

被引:6
|
作者
Reddy, K. Manikantta [1 ]
Vasantha, M. H. [1 ]
Kumar, Y. B. Nithin [1 ]
Gopal, Ch. Keshava [2 ]
Dwivedi, Devesh [1 ]
机构
[1] Natl Inst Technol Goa, Dept Elect & Commun Engn, Ponda 403401, Goa, India
[2] Xilinx India Technol Serv Pvt Ltd, Syst Integrat & Validat Grp, Hyderabad 500032, India
关键词
Approximate computing; Approximate multiplier; Hardware accelerator; Edge computing; Matrix multiplication; LOW-POWER; NEURAL-NETWORK; COMPRESSORS; DESIGN; ADDER;
D O I
10.1016/j.vlsi.2021.08.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate computing has emerged as an efficient design methodology for improving the performance and power-efficiency of digital systems by allowing a negligible loss in the output accuracy. Dedicated hardware accelerators built using approximate circuits can solve power-performance trade-off in the computationally complex applications like deep learning. This paper proposes an approximate radix-4 Booth multiplier and hardware accelerator for deploying deep learning applications on power-restricted mobile/edge computing devices. The proposed accelerator uses approximate multiplier based parallel processing elements to accelerate the workloads. The proposed accelerator is tested with matrix-vector multiplication (MVM) and matrix-matrix multiplication (MMM) workloads on Zynq ZCU102 evaluation board. The experimental results show that the average power consumption of the proposed accelerator reduces by 34% and 40% for MVM and MMM respectively, as compared to the conventional multiply-accumulate unit that was used in the literature to implement similar workloads. Moreover, the proposed accelerator achieved an average performance of 5 GOP/s and 42.5 GOP/s for MVM and MMM respectively at 275 MHz, which are 14x and 5x respective improvements over the conventional design.
引用
收藏
页码:268 / 279
页数:12
相关论文
共 50 条
  • [31] Location Aware Workflow Migration Based on Deep Reinforcement Learning in Mobile Edge Computing
    Gao, Yongqiang
    Liu, Xiaolei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT I, 2022, 13155 : 509 - 528
  • [32] Deep Reinforcement Learning for QoS-Aware Package Caching in Serverless Edge Computing
    Jeon, Hongseok
    Shin, Seungjae
    Cho, Chunglae
    Yoon, Seunghyun
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [33] Distributed Deep Reinforcement Learning-Based Gradient Quantization for Federated Learning Enabled Vehicle Edge Computing
    Zhang, Cui
    Zhang, Wenjun
    Wu, Qiong
    Fan, Pingyi
    Fan, Qiang
    Wang, Jiangzhou
    Letaief, Khaled B.
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (05): : 4899 - 4913
  • [34] Deep Learning With Edge Computing: A Review
    Chen, Jiasi
    Ran, Xukan
    PROCEEDINGS OF THE IEEE, 2019, 107 (08) : 1655 - 1674
  • [35] AI Models for Edge Computing: Hardware-aware Optimizations for Efficiency
    Li, Hai ''Helen''
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [36] Exploiting Approximate Computing for Deep Learning Acceleration
    Chen, Chia-Yu
    Choi, Jungwook
    Gopalakrishnan, Kailash
    Srinivasan, Viji
    Venkataramani, Swagath
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 821 - 826
  • [37] RCAM: Resource Constraint Approximate Multiplier Design for Deep Convolutional Neural Network Accelerator
    Saleh O.S.
    SN Computer Science, 4 (4)
  • [38] Editorial for special issue on "Edge computing accelerated deep learning: Technologies and applications"
    Liu, Xiao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (10):
  • [39] Learning IoT in Edge: Deep Learning for the Internet of Things with Edge Computing
    Li, He
    Ota, Kaoru
    Dong, Mianxiong
    IEEE NETWORK, 2018, 32 (01): : 96 - 101
  • [40] RASM: Resource-Aware Service Migration in Edge Computing based on Deep Reinforcement Learning
    Mwasinga, Lusungu Josh
    Le, Duc-Tai
    Raza, Syed M.
    Challa, Rajesh
    Kim, Moonseong
    Choo, Hyunseung
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 182