High Utilization Energy-Aware Real-Time Inference Deep Convolutional Neural Network Accelerator

被引:4
|
作者
Lin, Kuan-Ting [1 ]
Chiu, Ching-Te [1 ]
Chang, Jheng-Yi [2 ]
Hsiao, Shan-Chien [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Natl Tsing Hua Univ, Inst Commun Engn, Hsinchu, Taiwan
关键词
CNN; Accelerator; Energy-Aware; Real-Time Inference; High Utilization;
D O I
10.1109/ISCAS51556.2021.9401526
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep convolution Neural Network (DCNN) has been widely used in computer vision tasks. However, for edge device, even then inference has too large computational complexity and data access amount. Due to the mentioned shortcomings, the inference latency of state-of-the-art models are still impractical for real-world applications. In this paper, we proposed a high utilization energy-aware real-time inference deep convolutional neural network accelerator, which outperforms the current accelerators. First, we use 1x1 size convolution kernels as the smallest unit of the computing unit. And we design suitable computing unit for different models based on the requirement of each model. Second, we use Reuse Feature SRAM to store the output of current layer in the chip and use as the input of the next layer. Moreover, we import Output Reuse Strategy and Ring Stream Data flow not only to expand the reuse rate of data in the chip but to reduce the amount of data exchange between chips and DRAM. Finally, we present On-fly Pooling Module to let the calculation of the Pooling layer to be completed directly in the chip. With the aid of the proposed method in this paper, the implemented CNN acceleration chip has extreme high hardware utilization rate. We reduce a generous amount of data transfer on the specific module, ECNN [1]. Compared to the methods without reuse strategy, we can reduce 533 times of data access amount. At the same time, we have enough computing power to perform real-time execution of the existing image classification model, VGG16 [2] and MobileNet [3]. Compared with the design in [4], we can speed up 7.52 times and have 1.92x energy efficiency.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] On reliability- and energy-aware scheduling of real-time embedded systems
    Xie, X. N.
    Zhu, Q. X.
    Zhang, Y. W.
    INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1139 - 1144
  • [42] Energy-Aware Real-Time Task Scheduling Exploiting Temporal Locality
    Kim, Yong-Hee
    Jung, Myoung-Jo
    Lee, Cheol-Hoon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (05): : 1147 - 1153
  • [43] Eciton: Very Low-power Recurrent Neural Network Accelerator for Real-time Inference at the Edge
    Chen, Jeffrey
    Jun, Sang-Woo
    Hong, Sehwan
    He, Warrick
    Moon, Jinyeong
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2024, 17 (01)
  • [44] Endurance-Aware Deep Neural Network Real-Time Scheduling on ReRAM Accelerators
    Sha, Shi
    Yang, Xiaokun
    Szczecinski, Trent M.
    Whitman, Daniel
    Wen, Wujie
    Quart, Gang
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 404 - 410
  • [45] Evaluation framework for energy-aware multiprocessor scheduling in real-Time systems
    Mejia-Alvarez, Pedro
    Moncada-Madero, David
    Aydin, Hakan
    Diaz-Ramirez, Arnoldo
    JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 98 : 388 - 402
  • [46] Energy-aware Adaptive MAC Protocol for Real-time Sensor Networks
    Fateh, Benazir
    Govindarasu, Manimaran
    2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,
  • [47] Energy-aware dynamic reconfiguration algorithms for real-time multitasking systems
    Wang, Weixun
    Ranka, Sanjay
    Mishra, Prabhat
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2011, 1 (01): : 35 - 45
  • [48] Energy-aware dynamic slack allocation for real-time multitasking systems
    Wang, Weixun
    Ranka, Sanjay
    Mishra, Prabhat
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2012, 2 (03): : 128 - 137
  • [49] A voltage and resource synthesis technique for energy-aware real-time systems
    Kang, Dong-In
    Crago, Stephen P.
    Suh, Jinwoo
    McMahon, Janice
    13TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2007, : 20 - +
  • [50] Real-Time Tasks Oriented Energy-Aware Scheduling in Virtualized Clouds
    Zhu, Xiaomin
    Yang, Laurence T.
    Chen, Huangke
    Wang, Ji
    Yin, Shu
    Liu, Xiaocheng
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2014, 2 (02) : 168 - 180