A Flexible Sparsity-Aware Accelerator with High Sensitivity and Efficient Operation for Convolutional Neural Networks

被引：1

作者：

Yuan, Haiying ^{[1
]}

Zeng, Zhiyong ^{[1
]}

Cheng, Junpeng ^{[1
]}

Li, Minghao ^{[1
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2022年 / 41卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Convolutional neural network; Sparsity perceptron; Parallel computing; FPGA accelerator;

D O I：

10.1007/s00034-022-01992-x

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In view of the technical challenge that convolutional neural networks involve in a large amount of computation caused by the information redundancy of the interlayer activation, a flexible sparsity-aware accelerator is proposed in this paper. It realizes the basic data transmission with coarse-grained control and realizes the transmission of sparse data with fine-grained control. In addition, the corresponding data arrangement scheme is designed to fully utilize the off-chip bandwidth. In order to improve the inference performance without accuracy reduction, the sparse activation is compressed to eliminate ineffectual activation while preserving topology information with the sparsity perceptron module. To improve power efficiency, the computational load is rationally allocated for multiplication accumulator array, and the convolution operation is decoupled by adder tree with FIFO. The accelerator is implemented on Xilinx VCU108, and 97.27% of the operations are non-zero activation operations. The accelerator running in sparsity mode is more than 2.5 times faster than that in density mode, and power consumption is reduced to 8.3 W. Furthermore, this flexible sparsity-aware accelerator architecture can be widely applied to large-scale deep convolutional neural networks.

引用

页码：4370 / 4389

页数：20

共 50 条

[21] HISPOC: A High-Performance Irregular Activation Sparsity-Aware Point Cloud Network Accelerator
Zhao, Pan
Chang, Liang
Zeng, Jiahao
Wu, Licheng
Zhou, Liang
Zhou, Jun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) : 2294 - 2298
[22] TSUNAMI: Triple Sparsity-Aware Ultra Energy-Efficient Neural Network Training Accelerator With Multi-Modal Iterative Pruning
Kim, Sangyeob
Lee, Juhyoung
Kang, Sanghoon
Han, Donghyeon
Jo, Wooyoung
Yoo, Hoi-Jun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (04) : 1494 - 1506
[23] Group sparsity-aware convolutional neural network for continuous missing data recovery of structural health monitoring
Tang, Zhiyi
Bao, Yuequan
Li, Hui
STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2021, 20 (04): : 1738 - 1759
[24] Sparsity-Aware Communication for Distributed Graph Neural Network Training
Mukhopadhyay, Ujjaini
Tripathy, Alok
Selvitopi, Oguz
Yelick, Katherine
Buluc, Aydin
53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 117 - 126
[25] A Sparsity-aware QR Decomposition Algorithm for Efficient Cooperative Localization
Zhou, Ke X.
Roumeliotis, Stergios I.
2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 799 - 806
[26] SuperHCA: A Super-Resolution Accelerator with Sparsity-Aware Heterogeneous Core Architecture
Hu, Zhicheng
Zeng, Jiahao
Zhao, Xin
Zhou, Liang
Chang, Liang
2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
[27] An Efficient Accelerator with Winograd for Novel Convolutional Neural Networks
Lin, Zhijian
Zhang, Meng
Weng, Dongpeng
Liu, Fei
2022 5TH INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS AND SIMULATION (ICCSS 2022), 2022, : 126 - 130
[28] A Power-efficient Accelerator for Convolutional Neural Networks
Sun, Fan
Wang, Chao
Gong, Lei
Xu, Chongchong
Zhang, Yiwei
Lu, Yuntao
Li, Xi
Zhou, Xuehai
2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 631 - 632
[29] An Efficient FIFO Based Accelerator for Convolutional Neural Networks
Panchbhaiyye, Vineet
Ogunfunmi, Tokunbo
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (10): : 1117 - 1129
[30] An Efficient FIFO Based Accelerator for Convolutional Neural Networks
Vineet Panchbhaiyye
Tokunbo Ogunfunmi
Journal of Signal Processing Systems, 2021, 93 : 1117 - 1129

← 1 2 3 4 5 →