EENet: Energy Efficient Neural Networks with Run-time Power Management

被引:0
|
作者
Li, Xiangjie [1 ]
Shen, Yingtao [1 ]
Zou, An [1 ]
Ma, Yehan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
Neural Networks; Early Exit; Energy Efficiency; Inference Time; Feedback Control;
D O I
10.1109/DAC56929.2023.10247701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning approaches, such as convolution neural networks (CNNs), have achieved tremendous success in versatile applications. However, one of the challenges to deploy the deep learning models on resource-constrained systems is its huge energy cost. As a dynamic inference approach, early exit adds exiting layers to the networks, which can terminate the inference earlier with accurate results to save energy. The current passive decision-making for energy regulation of early exit cannot adapt to ongoing inference status, varying inference workloads, and timing constraints, let alone guide the reasonable configuration of the computing platforms alongside the inference proceeds for potential energy saving. In this paper, we propose an Energy Efficient Neural Networks (EENet), which introduces a plug-in module to the state-of-the-art networks by incorporating run-time power management. Within each inference, we establish prediction of where the network will exit and adjust computing configurations (i.e., frequency and voltage) accordingly over a small timescale. Considering multiple inferences over a large timescale, we provide frequency and voltage calibration advice, given inference workloads and timing constraints. Finally, the dynamic voltage and frequency scaling (DVFS) governor configures voltage and frequency to execute the network according to the prediction and calibration. Extensive experimental results demonstrate that EENet achieves up to 63.8% energy-saving compared with classic deep learning networks and 21.5% energy-saving compared with the early exit under state-of-the-art exiting strategies, together with improved timing performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Approximation for Run-time Power Management
    Kanduri, Anil
    Haghbayan, Mohammad-Hashem
    Rahmani, Amir M.
    Liljeberg, Pasi
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [2] Run-time Energy and Time Management for Intermittent LoRaWAN Communications
    Mileiko, Sergey
    Ritom, Firdaus
    Shafik, Rishad
    Yakovlev, Alex
    Al-Akaidi, Mohammed A.
    2024 IEEE SENSORS APPLICATIONS SYMPOSIUM, SAS 2024, 2024,
  • [3] Run-time Energy Management for Intermittent LoRaWAN Communications
    Mileiko, Sergey
    Bramwell, Connor
    Ritom, Firdaus
    De Roure, David
    Cetinkaya, Oktay
    Balsamo, Domenico
    PROCEEDINGS OF THE 2023 11TH INTERNATIONAL WORKSHOP ON ENERGY HARVESTING & ENERGY-NEUTRAL SENSING SYSTEMS, ENSSYS 2023, 2023, : 23 - 29
  • [4] Energy Efficient On-Chip Power Delivery with Run-Time Voltage Regulator Clustering
    Pathak, Divya
    Hajkazemi, Mohammad Hossein
    Tavana, Mohammad Khavari
    Homayoun, Houman
    Savidis, Ioannis
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 1210 - 1213
  • [5] Run-time Mapping of Spiking Neural Networks to Neuromorphic Hardware
    Adarsha Balaji
    Thibaut Marty
    Anup Das
    Francky Catthoor
    Journal of Signal Processing Systems, 2020, 92 : 1293 - 1302
  • [6] Run-time Mapping of Spiking Neural Networks to Neuromorphic Hardware
    Balaji, Adarsha
    Marty, Thibaut
    Das, Anup
    Catthoor, Francky
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2020, 92 (11): : 1293 - 1302
  • [7] Reliable Power Efficient Systems through Run-time Reconfiguration
    El-Araby, Nahla
    Jantsch, Axel
    2022 20TH IEEE INTERREGIONAL NEWCAS CONFERENCE (NEWCAS), 2022, : 347 - 351
  • [8] Efficient thermal simulation for run-time temperature tracking and management
    Li, H
    Liu, P
    Qi, ZY
    Jin, LL
    Wu, W
    Tan, SXD
    Yang, J
    2005 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS & PROCESSORS, PROCEEDINGS, 2005, : 130 - 133
  • [9] Energy Efficient Run-Time Incremental Mapping for 3-D Networks-on-Chip
    Wang, Xiao-Hang
    Liu, Peng
    Yang, Mei
    Palesi, Maurizio
    Jiang, Ying-Tao
    Huang, Michael C.
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2013, 28 (01) : 54 - 71
  • [10] Energy Efficient Run-Time Incremental Mapping for 3-D Networks-on-Chip
    王小航
    刘鹏
    杨梅
    Maurizio Palesi
    蒋颖涛
    黄巍
    JournalofComputerScience&Technology, 2013, 28 (01) : 54 - 71