Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network

被引:3
|
作者
Xiao, Hao [1 ]
Zhao, Kaikai [1 ]
Liu, Guangzhu [1 ]
机构
[1] HeFei Univ Technol, Sch Microelect, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
deep neural networks; filed programmable gate array; run-length compression; sparse data;
D O I
10.1587/transinf.2020EDL8153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work presents a DNN accelerator architecture specifically designed for performing efficient inference on compressed and sparse DNN models. Leveraging the data sparsity, a runtime processing scheme is proposed to deal with the encoded weights and activations directly in the compressed domain without decompressing. Furthermore, a new data flow is proposed to facilitate the reusage of input activations across the fully-connected (FC) layers. The proposed design is implemented and verified using the Xilinx Virtex-7 FPGA. Experimental results show it achieves 1.99x, 1.95x faster and 20.38x, 3.04x more energy efficient than CPU and mGPU platforms, respectively, running AlexNet.
引用
收藏
页码:772 / 775
页数:4
相关论文
共 50 条
  • [41] A sparse convolution neural network accelerator with bandwidth-efficient data loopback structure
    Yuan, Haiying
    Zeng, Zhiyong
    MICROPROCESSORS AND MICROSYSTEMS, 2023, 98
  • [42] An Efficient Unstructured Sparse Convolutional Neural Network Accelerator for Wearable ECG Classification Device
    Lu, Jiahao
    Liu, Dongsheng
    Cheng, Xuan
    Wei, Lai
    Hu, Ang
    Zou, Xuecheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (11) : 4572 - 4582
  • [43] A Fast Compressed Hardware Architecture for Deep Neural Networks
    Ansari, Anaam
    Shelton, Allen
    Ogunfunmi, Tokunbo
    Panchbhaiyye, Vineet
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 370 - 374
  • [44] An Efficient Reconfigurable Hardware Accelerator for Convolutional Neural Networks
    Ansari, Anaam
    Gunnam, Kiran
    Ogunfunmi, Tokunbo
    2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 1337 - 1341
  • [45] Snowflake: An Efficient Hardware Accelerator for Convolutional Neural Networks
    Gokhale, Vinayak
    Zaidy, Aliasger
    Chang, Andre Xian Ming
    Culurciello, Eugenio
    2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 2082 - 2085
  • [46] Hardware Accelerator Supporting Inhibitory Spiking Neural Network
    Qian, Ping
    Han, Rui
    Xie, Lingdong
    Luo, Wang
    Xu, Huarong
    Li, Songsong
    Zheng, Zhendong
    Computer Engineering and Applications, 2024, 60 (08) : 338 - 347
  • [47] Sequence Triggered Hardware Trojan in Neural Network Accelerator
    Liu, Zizhen
    Ye, Jing
    Hu, Xing
    Li, Huawei
    Li, Xiaowei
    Hu, Yu
    2020 IEEE 38TH VLSI TEST SYMPOSIUM (VTS 2020), 2020,
  • [48] Reliability Analysis of a Spiking Neural Network Hardware Accelerator
    Spyrou, Theofilos
    El-Sayed, Sarah A.
    Afacan, Engin
    Camunas-Mesa, Luis A.
    Linares-Barranco, Bernabe
    Stratigopoulos, Haralampos-G
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 370 - 375
  • [49] An efficient deep neural network accelerator using controlled ferroelectric domain dynamics
    Majumdar, Sayani
    NEUROMORPHIC COMPUTING AND ENGINEERING, 2022, 2 (04):
  • [50] RNSiM: Efficient Deep Neural Network Accelerator Using Residue Number Systems
    Roohi, Arman
    Taheri, MohammadReza
    Angizi, Shaahin
    Fan, Deliang
    2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,