Hardware Architecture Exploration for Deep Neural Networks

被引:2
|
作者
Zheng, Wenqi [1 ]
Zhao, Yangyi [1 ]
Chen, Yunfan [1 ]
Park, Jinhong [2 ]
Shin, Hyunchul [1 ]
机构
[1] Hanyang Univ, Dept Elect Engn, Ansan, South Korea
[2] Samsung Elect Inc, Suwon, South Korea
关键词
AI architecture; Neural network architecture; CNN; Design space exploration;
D O I
10.1007/s13369-021-05455-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Owing to good performance, deep Convolution Neural Networks (CNNs) are rapidly rising in popularity across a broad range of applications. Since high accuracy CNNs are both computation intensive and memory intensive, many researchers have shown significant interest in the accelerator design. Furthermore, the AI chip market size grows and the competition on the performance, cost, and power consumption of the artificial intelligence SoC designs is increasing. Therefore, it is important to develop design techniques and platforms that are useful for the efficient design of optimized AI architectures to satisfy the given specifications in a short design time. In this research, we have developed design space exploration techniques and environments for the optimal design of the overall system including computing modules and memories. Our current design platform is built using NVIDIA Deep Learning Accelerator as a computing model, SRAM as a buffer, and DRAM with GDDR6 as an off-chip memory. We also developed a program to estimate the processing time of a given neural network. By modifying both the on-chip SRAM size and the computing module size, a designer can explore the design space efficiently, and then choose the optimal architecture which shows the minimal cost while satisfying the performance specification. To illustrate the operation of the design platform, two well-known deep CNNs are used, which are YOLOv3 and faster RCNN. This technology can be used to explore and to optimize the hardware architectures of the CNNs so that the cost can be minimized.
引用
收藏
页码:9703 / 9712
页数:10
相关论文
共 50 条
  • [31] Efficient Architecture Search for Deep Neural Networks
    Gottapu, Ram Deepak
    Dagli, Cihan H.
    COMPLEX ADAPTIVE SYSTEMS, 2020, 168 : 19 - 25
  • [32] A Computing Efficient Hardware Architecture for Sparse Deep Neural Network Computing
    Zhang, Yanwen
    Ouyang, Peng
    Yin, Shouyi
    Zhang, Youguang
    Zhao, Weisheng
    Wei, Shaojun
    2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1261 - 1263
  • [33] Data multiplexed and hardware reused architecture for deep neural network accelerator
    Raut, Gopal
    Biasizzo, Anton
    Dhakad, Narendra
    Gupta, Neha
    Papa, Gregor
    Vishvakarma, Santosh Kumar
    NEUROCOMPUTING, 2022, 486 : 147 - 159
  • [34] Chemical space exploration guided by deep neural networks
    Karlov, Dmitry S.
    Sosnin, Sergey
    Tetko, Igor V.
    Fedorov, Maxim V.
    RSC ADVANCES, 2019, 9 (09) : 5151 - 5157
  • [35] DEEP CONVOLUTIONAL NEURAL NETWORKS IN SEISMIC EXPLORATION PROBLEMS
    Vasyukov, A. V.
    Nikitin, I. S.
    Stankevich, A. S.
    Golubev, V. I.
    INTERFACIAL PHENOMENA AND HEAT TRANSFER, 2022, 10 (03) : 61 - 74
  • [36] An Exploration on Temperature Term in Training Deep Neural Networks
    Si, Zhaofeng
    Qi, Honggang
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [37] SafeTPU: A Verifiably Secure Hardware Accelerator for Deep Neural Networks
    Mera Collantes, Maria I.
    Ghodsi, Zahra
    Garg, Siddharth
    Proceedings of the IEEE VLSI Test Symposium, 2020, 2020-April
  • [38] Hardware Accelerator for Adversarial Attacks on Deep Learning Neural Networks
    Guo, Haoqiang
    Peng, Lu
    Zhang, Jian
    Qi, Fang
    Duan, Lide
    2019 TENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2019,
  • [39] Hardware-Aware Softmax Approximation for Deep Neural Networks
    Geng, Xue
    Lin, Jie
    Zhao, Bin
    Kong, Anmin
    Aly, Mohamed M. Sabry
    Chandrasekhar, Vijay
    COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 107 - 122
  • [40] Efficient Hardware Architectures for Accelerating Deep Neural Networks: Survey
    Dhilleswararao, Pudi
    Boppu, Srinivas
    Manikandan, M. Sabarimalai
    Cenkeramaddi, Linga Reddy
    IEEE ACCESS, 2022, 10 : 131788 - 131828