A Flexible Sparsity-Aware Accelerator with High Sensitivity and Efficient Operation for Convolutional Neural Networks

被引:1
|
作者
Yuan, Haiying [1 ]
Zeng, Zhiyong [1 ]
Cheng, Junpeng [1 ]
Li, Minghao [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural network; Sparsity perceptron; Parallel computing; FPGA accelerator;
D O I
10.1007/s00034-022-01992-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In view of the technical challenge that convolutional neural networks involve in a large amount of computation caused by the information redundancy of the interlayer activation, a flexible sparsity-aware accelerator is proposed in this paper. It realizes the basic data transmission with coarse-grained control and realizes the transmission of sparse data with fine-grained control. In addition, the corresponding data arrangement scheme is designed to fully utilize the off-chip bandwidth. In order to improve the inference performance without accuracy reduction, the sparse activation is compressed to eliminate ineffectual activation while preserving topology information with the sparsity perceptron module. To improve power efficiency, the computational load is rationally allocated for multiplication accumulator array, and the convolution operation is decoupled by adder tree with FIFO. The accelerator is implemented on Xilinx VCU108, and 97.27% of the operations are non-zero activation operations. The accelerator running in sparsity mode is more than 2.5 times faster than that in density mode, and power consumption is reduced to 8.3 W. Furthermore, this flexible sparsity-aware accelerator architecture can be widely applied to large-scale deep convolutional neural networks.
引用
收藏
页码:4370 / 4389
页数:20
相关论文
共 50 条
  • [21] HISPOC: A High-Performance Irregular Activation Sparsity-Aware Point Cloud Network Accelerator
    Zhao, Pan
    Chang, Liang
    Zeng, Jiahao
    Wu, Licheng
    Zhou, Liang
    Zhou, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) : 2294 - 2298
  • [22] TSUNAMI: Triple Sparsity-Aware Ultra Energy-Efficient Neural Network Training Accelerator With Multi-Modal Iterative Pruning
    Kim, Sangyeob
    Lee, Juhyoung
    Kang, Sanghoon
    Han, Donghyeon
    Jo, Wooyoung
    Yoo, Hoi-Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (04) : 1494 - 1506
  • [23] Group sparsity-aware convolutional neural network for continuous missing data recovery of structural health monitoring
    Tang, Zhiyi
    Bao, Yuequan
    Li, Hui
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2021, 20 (04): : 1738 - 1759
  • [24] Sparsity-Aware Communication for Distributed Graph Neural Network Training
    Mukhopadhyay, Ujjaini
    Tripathy, Alok
    Selvitopi, Oguz
    Yelick, Katherine
    Buluc, Aydin
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 117 - 126
  • [25] A Sparsity-aware QR Decomposition Algorithm for Efficient Cooperative Localization
    Zhou, Ke X.
    Roumeliotis, Stergios I.
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 799 - 806
  • [26] SuperHCA: A Super-Resolution Accelerator with Sparsity-Aware Heterogeneous Core Architecture
    Hu, Zhicheng
    Zeng, Jiahao
    Zhao, Xin
    Zhou, Liang
    Chang, Liang
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [27] An Efficient Accelerator with Winograd for Novel Convolutional Neural Networks
    Lin, Zhijian
    Zhang, Meng
    Weng, Dongpeng
    Liu, Fei
    2022 5TH INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS AND SIMULATION (ICCSS 2022), 2022, : 126 - 130
  • [28] A Power-efficient Accelerator for Convolutional Neural Networks
    Sun, Fan
    Wang, Chao
    Gong, Lei
    Xu, Chongchong
    Zhang, Yiwei
    Lu, Yuntao
    Li, Xi
    Zhou, Xuehai
    2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 631 - 632
  • [29] An Efficient FIFO Based Accelerator for Convolutional Neural Networks
    Panchbhaiyye, Vineet
    Ogunfunmi, Tokunbo
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (10): : 1117 - 1129
  • [30] An Efficient FIFO Based Accelerator for Convolutional Neural Networks
    Vineet Panchbhaiyye
    Tokunbo Ogunfunmi
    Journal of Signal Processing Systems, 2021, 93 : 1117 - 1129