Quantization aware approximate multiplier and hardware accelerator for edge computing of deep learning applications

被引：6

作者：

Reddy, K. Manikantta ^{[1
]}

Vasantha, M. H. ^{[1
]}

Kumar, Y. B. Nithin ^{[1
]}

Gopal, Ch. Keshava ^{[2
]}

Dwivedi, Devesh ^{[1
]}

机构：

[1] Natl Inst Technol Goa, Dept Elect & Commun Engn, Ponda 403401, Goa, India

[2] Xilinx India Technol Serv Pvt Ltd, Syst Integrat & Validat Grp, Hyderabad 500032, India

来源：

INTEGRATION-THE VLSI JOURNAL | 2021年 / 81卷

关键词：

Approximate computing; Approximate multiplier; Hardware accelerator; Edge computing; Matrix multiplication; LOW-POWER; NEURAL-NETWORK; COMPRESSORS; DESIGN; ADDER;

D O I：

10.1016/j.vlsi.2021.08.001

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Approximate computing has emerged as an efficient design methodology for improving the performance and power-efficiency of digital systems by allowing a negligible loss in the output accuracy. Dedicated hardware accelerators built using approximate circuits can solve power-performance trade-off in the computationally complex applications like deep learning. This paper proposes an approximate radix-4 Booth multiplier and hardware accelerator for deploying deep learning applications on power-restricted mobile/edge computing devices. The proposed accelerator uses approximate multiplier based parallel processing elements to accelerate the workloads. The proposed accelerator is tested with matrix-vector multiplication (MVM) and matrix-matrix multiplication (MMM) workloads on Zynq ZCU102 evaluation board. The experimental results show that the average power consumption of the proposed accelerator reduces by 34% and 40% for MVM and MMM respectively, as compared to the conventional multiply-accumulate unit that was used in the literature to implement similar workloads. Moreover, the proposed accelerator achieved an average performance of 5 GOP/s and 42.5 GOP/s for MVM and MMM respectively at 275 MHz, which are 14x and 5x respective improvements over the conventional design.

引用

页码：268 / 279

页数：12

共 50 条

[1] Energy-aware Adaptive Approximate Computing for Deep Learning Applications
TaheriNejad, Nima
Shakibhamedan, Salar
2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 328 - 328
[2] Reconfigurable FET Approximate Computing-based Accelerator for Deep Learning Applications
Saravanan, Raghul
Bavikadi, Sathwika
Rai, Shubham
Kumar, Akash
Dinakarrao, Sai Manoj Pudukotai
2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
[3] A Survey on Hardware Accelerator Design of Deep Learning for Edge Devices
Samanta, Anu
Hatai, Indranil
Mal, Ashis Kumar
WIRELESS PERSONAL COMMUNICATIONS, 2024, 137 (03) : 1715 - 1760
[4] Hardware-oriented deep reinforcement learning for edge computing
Yamagishi, Yoshiharu
Kaneko, Tatsuya
Akai-Kasaya, Megumi
Asai, Tetsuya
IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2021, 12 (03): : 526 - 544
[5] Low power AI hardware platform for deep learning in edge computing
Ohbuchi, Eisaku
2018 IEEE CPMT SYMPOSIUM JAPAN (ICSJ), 2018, : 89 - 90
[6] Design Environment of Quantization-Aware Edge AI Hardware for Few-Shot Learning
Kanda, R.
Onizawa, N.
Leonardon, M.
Gripon, V
Hanyu, T.
2024 IEEE 67TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, MWSCAS 2024, 2024, : 928 - 931
[7] Hardware Efficient Approximate Multiplier Architecture for Image Processing Applications
Chandaka, Shravani
Narayanam, Balaji
JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2022, 38 (02): : 217 - 230
[8] Hardware Efficient Approximate Multiplier Architecture for Image Processing Applications
Shravani Chandaka
Balaji Narayanam
Journal of Electronic Testing, 2022, 38 : 217 - 230
[9] AVEC: Accelerator Virtualization in Cloud-Edge Computing for Deep Learning Libraries
Kennedy, Jason
Varghese, Blesson
Reano, Carlos
5TH IEEE INTERNATIONAL CONFERENCE ON FOG AND EDGE COMPUTING (ICFEC 2021), 2021, : 37 - 44
[10] Low Cost and Low Power Stacked Sparse Autoencoder Hardware Acceleration for Deep Learning Edge Computing Applications
Belabed, Tarek
Coutinho, Maria Gracielly F.
Fernandes, Marcelo A. C.
Carlos, Valderrama
Souani, Chokri
2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,

← 1 2 3 4 5 →