High Utilization Energy-Aware Real-Time Inference Deep Convolutional Neural Network Accelerator

被引:4
|
作者
Lin, Kuan-Ting [1 ]
Chiu, Ching-Te [1 ]
Chang, Jheng-Yi [2 ]
Hsiao, Shan-Chien [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Natl Tsing Hua Univ, Inst Commun Engn, Hsinchu, Taiwan
关键词
CNN; Accelerator; Energy-Aware; Real-Time Inference; High Utilization;
D O I
10.1109/ISCAS51556.2021.9401526
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep convolution Neural Network (DCNN) has been widely used in computer vision tasks. However, for edge device, even then inference has too large computational complexity and data access amount. Due to the mentioned shortcomings, the inference latency of state-of-the-art models are still impractical for real-world applications. In this paper, we proposed a high utilization energy-aware real-time inference deep convolutional neural network accelerator, which outperforms the current accelerators. First, we use 1x1 size convolution kernels as the smallest unit of the computing unit. And we design suitable computing unit for different models based on the requirement of each model. Second, we use Reuse Feature SRAM to store the output of current layer in the chip and use as the input of the next layer. Moreover, we import Output Reuse Strategy and Ring Stream Data flow not only to expand the reuse rate of data in the chip but to reduce the amount of data exchange between chips and DRAM. Finally, we present On-fly Pooling Module to let the calculation of the Pooling layer to be completed directly in the chip. With the aid of the proposed method in this paper, the implemented CNN acceleration chip has extreme high hardware utilization rate. We reduce a generous amount of data transfer on the specific module, ECNN [1]. Compared to the methods without reuse strategy, we can reduce 533 times of data access amount. At the same time, we have enough computing power to perform real-time execution of the existing image classification model, VGG16 [2] and MobileNet [3]. Compared with the design in [4], we can speed up 7.52 times and have 1.92x energy efficiency.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] A real-time hourly ozone prediction system using deep convolutional neural network
    Ebrahim Eslami
    Yunsoo Choi
    Yannic Lops
    Alqamah Sayeed
    Neural Computing and Applications, 2020, 32 : 8783 - 8797
  • [32] Real-Time Fuel Truck Detection Algorithm Based on Deep Convolutional Neural Network
    Alsanad, Hamid R.
    Ucan, Osman N.
    Ilyas, Muhammad
    Khan, Atta Ur Rehman
    Bayat, Oguz
    IEEE ACCESS, 2020, 8 : 118808 - 118817
  • [33] Real-Time Prediction of Transarterial Drug Delivery Based on a Deep Convolutional Neural Network
    Yuan, Xin-Yi
    Hua, Yue
    Aubry, Nadine
    Zhussupbekov, Mansur
    Antaki, James F.
    Zhou, Zhi-Fu
    Peng, Jiang-Zhou
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [34] A Novel Real-time Driver Monitoring System Based on Deep Convolutional Neural Network
    Zhao, Yiheng
    Mammeri, Abdelhamid
    Boukerche, Azzedine
    2019 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2019), 2019, : 198 - 204
  • [35] A real-time hourly ozone prediction system using deep convolutional neural network
    Eslami, Ebrahim
    Choi, Yunsoo
    Lops, Yannic
    Sayeed, Alqamah
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13): : 8783 - 8797
  • [36] Real-time Road Cracks Detection based on Improved Deep Convolutional Neural Network
    Hassan, Syed Ali
    Han, Seung Heon
    Shin, Soo Young
    2020 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2020,
  • [37] Real-time Obstacle Detection Over Rails Using Deep Convolutional Neural Network
    Xu, Yuchuan
    Gao, Chunhai
    Yuan, Lei
    Tang, Simon
    Wei, Guodong
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 1007 - 1012
  • [38] Dynamic harvesting- and energy-aware real-time task scheduling
    Hasanloo, Mahmoud
    Kargahi, Mehdi
    Jalilian, Shahrokh
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2020, 28
  • [39] Energy-Aware Embedded Classifier Design for Real-Time Emotion Analysis
    Padmanabhan, Manoj
    Murali, Srinivasan
    Rincon, Francisco
    Atienza, David
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 2275 - 2278
  • [40] A real-time energy-aware routing strategy for Wireless Sensor Networks
    Khalid, Zubair
    Ahmed, Ghufran
    Khan, Noor M.
    2007 ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, 2007, : 381 - +