PACENet: Energy Efficient Acceleration for Convolutional Network on Embedded Platform

被引:0
|
作者
Kulkarni, Adwaya [1 ]
Abtahi, Tahmid [1 ]
Shea, Colin [1 ]
Kulkarni, Amey [1 ]
Mohsenin, Tinoosh [1 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21228 USA
基金
美国国家科学基金会;
关键词
Energy efficient; Domain-specific many-core; Convolutional Neural Network (CNN); Accelerator; Machine Learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Lightweight convolutional neural network (CNN) on tiny embedded platforms can offer energy efficient solution for today's IoT devices. However, CNN implementation on embedded system faces processing bottleneck in convolutional layers and memory storage issues in fully connected layers. In past years, heterogeneous acceleration, where compute intensive tasks are performed on kernel specific cores, has gained attention. In this paper we propose, a domain specific and programmable accelerator "PACENet" Programmable many-core ACcElerator for convolution neural Network architecture. It consists of neural network kernel specific instruction set architecture such as convolution, maxpool and relu. To demonstrate efficiency of the proposed PACENet, we implemented ResNet-20 for CIFAR-10 dataset, where PACENet performs convolution layer, Relu activations, Maxpool layer, and fully-connected layer. We also implemented ResNet-20 for CIFAR-10 dataset on NVIDIA TX1 mobile GPU platform using Tensorflow and cuDNN libraries. Compared to NVIDIA TX1 platform implementation PACENet platform implementation performs 1.4x to 4.5x faster and saves 2.8x to 9x energy consumption respectively. PACENet achieves 2.9x to 9.3x higher throughput per watt as compared to TX1 platform implementation
引用
收藏
页码:448 / 451
页数:4
相关论文
共 50 条
  • [21] Efficient Neural Network Representations for Energy Data Analytics on Embedded Systems
    Bouchur, Mazen
    Reinhardt, Andreas
    PROCEEDINGS OF THE 2022 THE THIRTEENTH ACM INTERNATIONAL CONFERENCE ON FUTURE ENERGY SYSTEMS, E-ENERGY 2022, 2022, : 81 - 92
  • [22] Implementation of energy efficient circuit design using A* algorithm in embedded network
    Anil, G. L.
    Iqbal, J. L. Mazher
    MICROPROCESSORS AND MICROSYSTEMS, 2020, 74
  • [23] A Survey on Efficient Convolutional Neural Networks and Hardware Acceleration
    Ghimire, Deepak
    Kil, Dayoung
    Kim, Seong-heum
    ELECTRONICS, 2022, 11 (06)
  • [24] An Energy-Efficient Embedded System Platform for Energy-Critical Real-Time Tasks
    Wu, Jun
    Wang, Jia-Lin
    ENGINEERING LETTERS, 2023, 31 (01) : 105 - 112
  • [25] CNNParted: An open source framework for efficient Convolutional Neural Network inference partitioning in embedded systems
    Kress, Fabian
    Sidorenko, Vladimir
    Schmidt, Patrick
    Hoefer, Julian
    Hotfilter, Tim
    Walter, Iris
    Harbaum, Tanja
    Becker, Jurgen
    COMPUTER NETWORKS, 2023, 229
  • [26] Towards Efficient Convolutional Neural Network for Embedded Hardware via Multi-Dimensional Pruning
    Kong, Hao
    Liu, Di
    Luo, Xiangzhong
    Huai, Shuo
    Subramaniam, Ravi
    Makaya, Christian
    Lin, Qian
    Liu, Weichen
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [27] Real-Time Embedded Traffic Sign Recognition Using Efficient Convolutional Neural Network
    Xie Bangquan
    Xiong, Weng Xiao
    IEEE ACCESS, 2019, 7 : 53330 - 53346
  • [28] Research on Speech Recognition Acceleration Technology Based on Embedded Platform
    Wu, Da
    Ding, Ling
    Deng, Shuwen
    Lu, Shejie
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 3663 - 3668
  • [29] Cost and energy efficient reconfigurable embedded platform using Spartan-6 FPGAs
    Otero, A.
    LLinas, M.
    Lombardo, Miguel L.
    Portilla, Jorge
    de la Torre, E.
    Riesgo, T.
    VLSI CIRCUITS AND SYSTEMS V, 2011, 8067
  • [30] In-network Service Acceleration Platform (ISAP)
    Hayashi, Kentaro
    Hirai, Shiku
    Matsukawa, Tatsuya
    Baba, Hiroki
    NTT Technical Review, 2024, 22 (12): : 26 - 32