A High-Level Modeling Framework for Estimating Hardware Metrics of CNN Accelerators

被引:9
|
作者
Juracy, Leonardo Rezende [1 ]
Moreira, Matheus Trevisan [2 ]
Amory, Alexandre de Morais [3 ]
Hampel, Alexandre F. [1 ]
Moraes, Fernando Gehm [1 ]
机构
[1] Pontifical Catholic Univ Rio Grande Sul PUCRS, Sch Technol, BR-90619900 Porto Alegre, RS, Brazil
[2] Chronos Tech, San Diego, CA 92122 USA
[3] TeCIP Inst, Scuola Super SantAnna, I-56124 Pisa, Italy
关键词
Convolutional neural networks; Space exploration; Estimation; Computer architecture; Training; Hardware acceleration; Convolution; CNN; convolution hardware accelerator; system simulator; PPA; design space exploration;
D O I
10.1109/TCSI.2021.3104644
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
GPUs became the reference platform for both training and inference phases of Convolutional Neural Networks (CNN) due to their tailored architecture to the CNN operators. However, GPUs are power-hungry architectures. A path to enable the deployment of CNNs in energy-constrained devices is adopting hardware accelerators for the inference phase. The design space exploration of CNNs using standard approaches, such as RTL, is limited due to their complexity. Thus, designers need frameworks enabling design space exploration that delivers accurate hardware estimation metrics to deploy CNNs. This work proposes a framework to explore CNNs design space, providing power, performance, and area (PPA) estimations. The heart of the framework is a system simulator. The system simulator front-end is TensorFlow, and the back-end is performance estimations obtained from the physical synthesis of hardware accelerators, not only from components like multipliers and adders. The first set of results evaluate the CNN accuracy using integer quantization, the accelerators PPA after physical synthesis, and the benefits of using a system simulator. These results allow a rich design space exploration, enabling selecting the best set of CNN parameters to meet the design constraints.
引用
收藏
页码:4783 / 4795
页数:13
相关论文
共 50 条
  • [21] Hardware Description Language Enhancements for High Level Synthesis of Hardware Accelerators
    Kasivinayagam, Gurusankar
    Skanda, Romaanchan
    Burli, Aditya G.
    Jadon, Shruti
    Sidhu, Reetinder
    ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 1 - 12
  • [22] High-Throughput FPGA-Based Hardware Accelerators for Deflate Compression and Decompression Using High-Level Synthesis
    Ledwon, Morgan
    Cockburn, Bruce F.
    Han, Jie
    IEEE ACCESS, 2020, 8 : 62207 - 62217
  • [23] HAPE: A high-level area-power estimation framework for FPGA-based accelerators
    Makni, Mariem
    Niar, Smail
    Baklouti, Mouna
    Abid, Mohamed
    MICROPROCESSORS AND MICROSYSTEMS, 2018, 63 : 11 - 27
  • [24] High-Level Synthesis of Accelerators in Embedded Scalable Platforms
    Mantovani, Paolo
    Di Guglielmo, Giuseppe
    Carloni, Luca P.
    2016 21ST ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2016, : 204 - 211
  • [25] A High-Level Programming Approach for Distributed Systems with Accelerators
    Steuwer, Michel
    Kegel, Philipp
    Gorlatch, Sergei
    NEW TRENDS IN SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2012, 246 : 430 - 441
  • [26] HIGH-LEVEL METRICS FOR ENERGY EFFICIENCY EVALUATION
    Marcu, Marius
    Tudor, Dacian
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2010, 72 (03): : 131 - 146
  • [27] High-level metrics for energy efficiency evaluation
    Marcu, Marius
    Tudor, Dacian
    UPB Scientific Bulletin, Series C: Electrical Engineering, 2010, 72 (03): : 131 - 146
  • [28] A Design Approach to Automatically Synthesize ANSI-C Assertions during High-Level Synthesis of Hardware Accelerators
    Ben Hammouda, Mohamed
    Coussy, Philippe
    Lagadec, Loic
    2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 165 - 168
  • [29] A high-level abstraction for graphics hardware programming
    Tuler, D
    Celes, W
    SIBGRAPI 2002: XV BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2002, : 306 - 312
  • [30] A Unified Design Flow to Automatically Generate On-Chip Monitors During High-Level Synthesis of Hardware Accelerators
    Ben Hammouda, Mohamed
    Coussy, Philippe
    Lagadec, Loic
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2017, 36 (03) : 384 - 397