Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network

被引:3
|
作者
Xiao, Hao [1 ]
Zhao, Kaikai [1 ]
Liu, Guangzhu [1 ]
机构
[1] HeFei Univ Technol, Sch Microelect, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
deep neural networks; filed programmable gate array; run-length compression; sparse data;
D O I
10.1587/transinf.2020EDL8153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work presents a DNN accelerator architecture specifically designed for performing efficient inference on compressed and sparse DNN models. Leveraging the data sparsity, a runtime processing scheme is proposed to deal with the encoded weights and activations directly in the compressed domain without decompressing. Furthermore, a new data flow is proposed to facilitate the reusage of input activations across the fully-connected (FC) layers. The proposed design is implemented and verified using the Xilinx Virtex-7 FPGA. Experimental results show it achieves 1.99x, 1.95x faster and 20.38x, 3.04x more energy efficient than CPU and mGPU platforms, respectively, running AlexNet.
引用
收藏
页码:772 / 775
页数:4
相关论文
共 50 条
  • [31] DEEP NEURAL NETWORK BASED SPARSE MEASUREMENT MATRIX FOR IMAGE COMPRESSED SENSING
    Cui, Wenxue
    Jiang, Feng
    Gao, Xinwei
    Tao, Wen
    Zhao, Debin
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3883 - 3887
  • [32] SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
    Parashar, Angshuman
    Rhu, Minsoo
    Mukkara, Anurag
    Puglielli, Antonio
    Venkatesan, Rangharajan
    Khailany, Brucek
    Emer, Joel
    Keckler, Stephen W.
    Dally, William J.
    44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 27 - 40
  • [33] Deep Compression and EIE: Efficient Inference Engine on Compressed Deep Neural Network
    Han, Song
    Liu, Xingyu
    Mao, Huizi
    Pu, Jing
    Pedram, Ardavan
    Horowitz, Mark
    Daily, Bill
    2016 IEEE HOT CHIPS 28 SYMPOSIUM (HCS), 2016,
  • [34] An Efficient Accelerator for Sparse Convolutional Neural Networks
    You, Weijie
    Wu, Chang
    2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2019,
  • [35] Design of artificial neural network hardware accelerator
    Kuznar, Damian
    Szczygiel, Robert
    Maj, Piotr
    Koziol, Anna
    JOURNAL OF INSTRUMENTATION, 2023, 18 (04):
  • [36] SPARCNet: A Hardware Accelerator for Efficient Deployment of Sparse Convolutional Networks
    Page, Adam
    Jafari, Ali
    Shea, Colin
    Mohsenin, Tinoosh
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
  • [37] An Efficient Deep Learning Accelerator for Compressed Video Analysis
    Wang, Yongchen
    Wang, Ying
    Li, Huawei
    Han, Yinhe
    Li, Xiaowei
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
  • [38] An Efficient Optical-Based Binary Neural Network Hardware Accelerator for Harsh Environments
    Jahannia, Belal
    Ye, Jiachi
    Altaleb, Salem
    Patil, Chandraman
    Heidari, Elham
    Dalir, Hamed
    SILICON PHOTONICS XIX, 2024, 12891
  • [39] Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
    Boo, Yoonho
    Sung, Wonyong
    2017 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2017,
  • [40] Energy-Efficient Bit-Sparse Accelerator Design for Convolutional Neural Network
    Xiao, Hang
    Xu, Haobo
    Wang, Ying
    Li, Jiajun
    Wang, Yujie
    Han, Yinhe
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (07): : 1122 - 1131