Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network

被引：3

作者：

Xiao, Hao ^{[1
]}

Zhao, Kaikai ^{[1
]}

Liu, Guangzhu ^{[1
]}

机构：

[1] HeFei Univ Technol, Sch Microelect, Hefei, Peoples R China

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2021年 / E104D卷 / 05期

基金：

中国国家自然科学基金;

关键词：

deep neural networks; filed programmable gate array; run-length compression; sparse data;

D O I：

10.1587/transinf.2020EDL8153

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work presents a DNN accelerator architecture specifically designed for performing efficient inference on compressed and sparse DNN models. Leveraging the data sparsity, a runtime processing scheme is proposed to deal with the encoded weights and activations directly in the compressed domain without decompressing. Furthermore, a new data flow is proposed to facilitate the reusage of input activations across the fully-connected (FC) layers. The proposed design is implemented and verified using the Xilinx Virtex-7 FPGA. Experimental results show it achieves 1.99x, 1.95x faster and 20.38x, 3.04x more energy efficient than CPU and mGPU platforms, respectively, running AlexNet.

引用

页码：772 / 775

页数：4

共 50 条

[31] DEEP NEURAL NETWORK BASED SPARSE MEASUREMENT MATRIX FOR IMAGE COMPRESSED SENSING
Cui, Wenxue
Jiang, Feng
Gao, Xinwei
Tao, Wen
Zhao, Debin
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3883 - 3887
[32] SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
Parashar, Angshuman
Rhu, Minsoo
Mukkara, Anurag
Puglielli, Antonio
Venkatesan, Rangharajan
Khailany, Brucek
Emer, Joel
Keckler, Stephen W.
Dally, William J.
44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 27 - 40
[33] Deep Compression and EIE: Efficient Inference Engine on Compressed Deep Neural Network
Han, Song
Liu, Xingyu
Mao, Huizi
Pu, Jing
Pedram, Ardavan
Horowitz, Mark
Daily, Bill
2016 IEEE HOT CHIPS 28 SYMPOSIUM (HCS), 2016,
[34] An Efficient Accelerator for Sparse Convolutional Neural Networks
You, Weijie
Wu, Chang
2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2019,
[35] Design of artificial neural network hardware accelerator
Kuznar, Damian
Szczygiel, Robert
Maj, Piotr
Koziol, Anna
JOURNAL OF INSTRUMENTATION, 2023, 18 (04):
[36] SPARCNet: A Hardware Accelerator for Efficient Deployment of Sparse Convolutional Networks
Page, Adam
Jafari, Ali
Shea, Colin
Mohsenin, Tinoosh
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
[37] An Efficient Deep Learning Accelerator for Compressed Video Analysis
Wang, Yongchen
Wang, Ying
Li, Huawei
Han, Yinhe
Li, Xiaowei
PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
[38] An Efficient Optical-Based Binary Neural Network Hardware Accelerator for Harsh Environments
Jahannia, Belal
Ye, Jiachi
Altaleb, Salem
Patil, Chandraman
Heidari, Elham
Dalir, Hamed
SILICON PHOTONICS XIX, 2024, 12891
[39] Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
Boo, Yoonho
Sung, Wonyong
2017 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2017,
[40] Energy-Efficient Bit-Sparse Accelerator Design for Convolutional Neural Network
Xiao, Hang
Xu, Haobo
Wang, Ying
Li, Jiajun
Wang, Yujie
Han, Yinhe
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (07): : 1122 - 1131

← 1 2 3 4 5 →