EENet: Energy Efficient Neural Networks with Run-time Power Management

被引:0
|
作者
Li, Xiangjie [1 ]
Shen, Yingtao [1 ]
Zou, An [1 ]
Ma, Yehan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
Neural Networks; Early Exit; Energy Efficiency; Inference Time; Feedback Control;
D O I
10.1109/DAC56929.2023.10247701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning approaches, such as convolution neural networks (CNNs), have achieved tremendous success in versatile applications. However, one of the challenges to deploy the deep learning models on resource-constrained systems is its huge energy cost. As a dynamic inference approach, early exit adds exiting layers to the networks, which can terminate the inference earlier with accurate results to save energy. The current passive decision-making for energy regulation of early exit cannot adapt to ongoing inference status, varying inference workloads, and timing constraints, let alone guide the reasonable configuration of the computing platforms alongside the inference proceeds for potential energy saving. In this paper, we propose an Energy Efficient Neural Networks (EENet), which introduces a plug-in module to the state-of-the-art networks by incorporating run-time power management. Within each inference, we establish prediction of where the network will exit and adjust computing configurations (i.e., frequency and voltage) accordingly over a small timescale. Considering multiple inferences over a large timescale, we provide frequency and voltage calibration advice, given inference workloads and timing constraints. Finally, the dynamic voltage and frequency scaling (DVFS) governor configures voltage and frequency to execute the network according to the prediction and calibration. Extensive experimental results demonstrate that EENet achieves up to 63.8% energy-saving compared with classic deep learning networks and 21.5% energy-saving compared with the early exit under state-of-the-art exiting strategies, together with improved timing performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Run-Time Efficient Probabilistic Model Checking
    Filieri, Antonio
    Ghezzi, Carlo
    Tamburrelli, Giordano
    2011 33RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2011, : 341 - 350
  • [22] Efficient run-time monitoring of timing constraints
    Mok, Aloysius K.
    Liu, Guangtian
    Real-Time Technology and Applications - Proceedings, 1997, : 252 - 262
  • [23] Efficient run-time parallelization for DO loops
    Yang, CT
    Tseng, SS
    Hsieh, MH
    Kao, SH
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 1998, 14 (01) : 237 - 253
  • [24] Boosting SIMD Benefits through a Run-time and Energy Efficient DLP Detection
    Jordan, Michael Guilherme
    Knorst, Tiago
    Vicenzi, Julio
    Rutzig, Mateus Beck
    2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 722 - 727
  • [25] Run-time Compilation of Bytecode in Sensor Networks
    Ellul, Joshua
    Martinez, Kirk
    2010 FOURTH INTERNATIONAL CONFERENCE ON SENSOR TECHNOLOGIES AND APPLICATIONS (SENSORCOMM), 2008, : 133 - 138
  • [26] An Energy Efficient Run-Time Security Monitor for True Random Number Generators
    Al Khas, Ahmad
    Cicek, Ihsan
    2019 11TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO 2019), 2019, : 488 - 491
  • [27] Energy Efficient MIMO Processing: a Case Study of Opportunistic Run-Time Approximations
    Novo, David
    Farahpour, Nazanin
    Ienne, Paolo
    Ahmad, Ubaid
    Catthoor, Francky
    2014 DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION (DATE), 2014,
  • [28] Efficient management of custom instructions for run-time reconfigurable instruction set processors
    Lam, Siew-Kei
    Krishnan, Bharathi N.
    Srikanthan, Thambipillai
    2006 IEEE INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, PROCEEDINGS, 2006, : 261 - +
  • [29] A Survey on Run-time Power Monitors at the Edge
    Zoni, Davide
    Galimberti, Andrea
    Fornaciari, William
    ACM COMPUTING SURVEYS, 2023, 55 (14S)
  • [30] Power Mac's run-time architecture
    Thelen, Randy
    Byte, 1994, 19 (04):