Quantization aware approximate multiplier and hardware accelerator for edge computing of deep learning applications

被引：6

作者：

Reddy, K. Manikantta ^{[1
]}

Vasantha, M. H. ^{[1
]}

Kumar, Y. B. Nithin ^{[1
]}

Gopal, Ch. Keshava ^{[2
]}

Dwivedi, Devesh ^{[1
]}

机构：

[1] Natl Inst Technol Goa, Dept Elect & Commun Engn, Ponda 403401, Goa, India

[2] Xilinx India Technol Serv Pvt Ltd, Syst Integrat & Validat Grp, Hyderabad 500032, India

来源：

INTEGRATION-THE VLSI JOURNAL | 2021年 / 81卷

关键词：

Approximate computing; Approximate multiplier; Hardware accelerator; Edge computing; Matrix multiplication; LOW-POWER; NEURAL-NETWORK; COMPRESSORS; DESIGN; ADDER;

D O I：

10.1016/j.vlsi.2021.08.001

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Approximate computing has emerged as an efficient design methodology for improving the performance and power-efficiency of digital systems by allowing a negligible loss in the output accuracy. Dedicated hardware accelerators built using approximate circuits can solve power-performance trade-off in the computationally complex applications like deep learning. This paper proposes an approximate radix-4 Booth multiplier and hardware accelerator for deploying deep learning applications on power-restricted mobile/edge computing devices. The proposed accelerator uses approximate multiplier based parallel processing elements to accelerate the workloads. The proposed accelerator is tested with matrix-vector multiplication (MVM) and matrix-matrix multiplication (MMM) workloads on Zynq ZCU102 evaluation board. The experimental results show that the average power consumption of the proposed accelerator reduces by 34% and 40% for MVM and MMM respectively, as compared to the conventional multiply-accumulate unit that was used in the literature to implement similar workloads. Moreover, the proposed accelerator achieved an average performance of 5 GOP/s and 42.5 GOP/s for MVM and MMM respectively at 275 MHz, which are 14x and 5x respective improvements over the conventional design.

引用

页码：268 / 279

页数：12

共 50 条

[21] Deep Learning for Edge Computing Applications: A State-of-the-Art Survey
Wang, Fangxin
Zhang, Miao
Wang, Xiangxiang
Ma, Xiaoqiang
Liu, Jiangchuan
IEEE ACCESS, 2020, 8 : 58322 - 58336
[22] Characterizing Perception Deep Learning Algorithms and Applications for Vehicular Edge Computing
Feng, Wang
Tang, Sihai
Wang, Shengze
He, Ying
Chen, Donger
Yang, Qing
Fu, Song
ALGORITHMS, 2025, 18 (01)
[23] Reliability-Aware Personalized Deployment of Approximate Computation IoT Applications in Serverless Mobile Edge Computing
Cao, Kun
Chen, Mingsong
Karnouskos, Stamatis
Hu, Shiyan
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 44 (02) : 430 - 443
[24] A New Hardware-Efficient VLSI-Architecture of GoogLeNet CNN-Model Based Hardware Accelerator for Edge Computing Applications
Islam, Md. Najrul
Shrestha, Rahul
Chowdhury, Shubhajit Roy
2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 414 - 417
[25] Designing Deep Learning Hardware Accelerator and Efficiency Evaluation
Qi, Zhi
Chen, Weijian
Naqvi, Rizwan Ali
Siddique, Kamran
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[26] Designing Deep Learning Hardware Accelerator and Efficiency Evaluation
Qi, Zhi
Chen, Weijian
Naqvi, Rizwan Ali
Siddique, Kamran
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
[27] Mobility-Aware Deep Reinforcement Learning with Glimpse Mobility Prediction in Edge Computing
Wu, Chao-Lun
Chiu, Te-Chuan
Wang, Chih-Yu
Pang, Ai-Chun
ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
[28] Deep Reinforcement Learning for Social-Aware Edge Computing and Caching in Urban Informatics
Zhang, Ke
Cao, Jiayu
Liu, Hong
Maharjan, Sabita
Zhang, Yan
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (08) : 5467 - 5477
[29] Deep Reinforcement Learning for Online Latency Aware Workload Offloading in Mobile Edge Computing
Akhavan, Zeinab
Esmaeili, Mona
Badnava, Babak
Yousefi, Mohammad
Sun, Xiang
Devetsikiotis, Michael
Zarkesh-Ha, Payman
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 2218 - 2223
[30] Mobility-Aware Edge Caching and Computing in Vehicle Networks: A Deep Reinforcement Learning
Le Thanh Tan
Hu, Rose Qingyang
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (11) : 10190 - 10203

← 1 2 3 4 5 →