Quantization aware approximate multiplier and hardware accelerator for edge computing of deep learning applications

被引：6

作者：

Reddy, K. Manikantta ^{[1
]}

Vasantha, M. H. ^{[1
]}

Kumar, Y. B. Nithin ^{[1
]}

Gopal, Ch. Keshava ^{[2
]}

Dwivedi, Devesh ^{[1
]}

机构：

[1] Natl Inst Technol Goa, Dept Elect & Commun Engn, Ponda 403401, Goa, India

[2] Xilinx India Technol Serv Pvt Ltd, Syst Integrat & Validat Grp, Hyderabad 500032, India

来源：

INTEGRATION-THE VLSI JOURNAL | 2021年 / 81卷

关键词：

Approximate computing; Approximate multiplier; Hardware accelerator; Edge computing; Matrix multiplication; LOW-POWER; NEURAL-NETWORK; COMPRESSORS; DESIGN; ADDER;

D O I：

10.1016/j.vlsi.2021.08.001

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Approximate computing has emerged as an efficient design methodology for improving the performance and power-efficiency of digital systems by allowing a negligible loss in the output accuracy. Dedicated hardware accelerators built using approximate circuits can solve power-performance trade-off in the computationally complex applications like deep learning. This paper proposes an approximate radix-4 Booth multiplier and hardware accelerator for deploying deep learning applications on power-restricted mobile/edge computing devices. The proposed accelerator uses approximate multiplier based parallel processing elements to accelerate the workloads. The proposed accelerator is tested with matrix-vector multiplication (MVM) and matrix-matrix multiplication (MMM) workloads on Zynq ZCU102 evaluation board. The experimental results show that the average power consumption of the proposed accelerator reduces by 34% and 40% for MVM and MMM respectively, as compared to the conventional multiply-accumulate unit that was used in the literature to implement similar workloads. Moreover, the proposed accelerator achieved an average performance of 5 GOP/s and 42.5 GOP/s for MVM and MMM respectively at 275 MHz, which are 14x and 5x respective improvements over the conventional design.

引用

页码：268 / 279

页数：12

共 50 条

[31] Location Aware Workflow Migration Based on Deep Reinforcement Learning in Mobile Edge Computing
Gao, Yongqiang
Liu, Xiaolei
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT I, 2022, 13155 : 509 - 528
[32] Deep Reinforcement Learning for QoS-Aware Package Caching in Serverless Edge Computing
Jeon, Hongseok
Shin, Seungjae
Cho, Chunglae
Yoon, Seunghyun
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[33] Distributed Deep Reinforcement Learning-Based Gradient Quantization for Federated Learning Enabled Vehicle Edge Computing
Zhang, Cui
Zhang, Wenjun
Wu, Qiong
Fan, Pingyi
Fan, Qiang
Wang, Jiangzhou
Letaief, Khaled B.
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (05): : 4899 - 4913
[34] Deep Learning With Edge Computing: A Review
Chen, Jiasi
Ran, Xukan
PROCEEDINGS OF THE IEEE, 2019, 107 (08) : 1655 - 1674
[35] AI Models for Edge Computing: Hardware-aware Optimizations for Efficiency
Li, Hai ''Helen''
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
[36] Exploiting Approximate Computing for Deep Learning Acceleration
Chen, Chia-Yu
Choi, Jungwook
Gopalakrishnan, Kailash
Srinivasan, Viji
Venkataramani, Swagath
PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 821 - 826
[37] RCAM: Resource Constraint Approximate Multiplier Design for Deep Convolutional Neural Network Accelerator
Saleh O.S.
SN Computer Science, 4 (4)
[38] Editorial for special issue on "Edge computing accelerated deep learning: Technologies and applications"
Liu, Xiao
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (10):
[39] Learning IoT in Edge: Deep Learning for the Internet of Things with Edge Computing
Li, He
Ota, Kaoru
Dong, Mianxiong
IEEE NETWORK, 2018, 32 (01): : 96 - 101
[40] RASM: Resource-Aware Service Migration in Edge Computing based on Deep Reinforcement Learning
Mwasinga, Lusungu Josh
Le, Duc-Tai
Raza, Syed M.
Challa, Rajesh
Kim, Moonseong
Choo, Hyunseung
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 182

← 1 2 3 4 5 →