EENet: Energy Efficient Neural Networks with Run-time Power Management

被引:0
|
作者
Li, Xiangjie [1 ]
Shen, Yingtao [1 ]
Zou, An [1 ]
Ma, Yehan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
Neural Networks; Early Exit; Energy Efficiency; Inference Time; Feedback Control;
D O I
10.1109/DAC56929.2023.10247701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning approaches, such as convolution neural networks (CNNs), have achieved tremendous success in versatile applications. However, one of the challenges to deploy the deep learning models on resource-constrained systems is its huge energy cost. As a dynamic inference approach, early exit adds exiting layers to the networks, which can terminate the inference earlier with accurate results to save energy. The current passive decision-making for energy regulation of early exit cannot adapt to ongoing inference status, varying inference workloads, and timing constraints, let alone guide the reasonable configuration of the computing platforms alongside the inference proceeds for potential energy saving. In this paper, we propose an Energy Efficient Neural Networks (EENet), which introduces a plug-in module to the state-of-the-art networks by incorporating run-time power management. Within each inference, we establish prediction of where the network will exit and adjust computing configurations (i.e., frequency and voltage) accordingly over a small timescale. Considering multiple inferences over a large timescale, we provide frequency and voltage calibration advice, given inference workloads and timing constraints. Finally, the dynamic voltage and frequency scaling (DVFS) governor configures voltage and frequency to execute the network according to the prediction and calibration. Extensive experimental results demonstrate that EENet achieves up to 63.8% energy-saving compared with classic deep learning networks and 21.5% energy-saving compared with the early exit under state-of-the-art exiting strategies, together with improved timing performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A Fuzzy Logic Based Power-Efficient Run-Time Reconfigurable Multicore System
    Shaheryar Najam
    Muhammad Yasir Qadri
    Zohaib Najam
    Jameel Ahmed
    Nadia N.Qadri
    ChineseJournalofElectronics, 2018, 27 (03) : 549 - 555
  • [42] Run-time Exploitation of Application Dynamism for Energy-efficient Exascale Computing (READEX)
    Oleynik, Yury
    Gerndt, Michael
    Schuchart, Joseph
    Kjeldsberg, Per Gunnar
    Nagel, Wolfgang E.
    2015 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2015, : 347 - +
  • [43] Efficient Run-time Resource Management of a Manycore Accelerator for Stream-based Applications
    Lee, Chanhee
    Kim, Sungchan
    Ha, Soonhoi
    2013 IEEE 11TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA), 2013, : 51 - 60
  • [44] Towards Run-time Efficient Hierarchical Reinforcement Learning
    Abramowitz, Sasha
    Nitschke, Geoff
    2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [45] An efficient run-time parallelizing scheme for wavefront scheduling
    Huang, TC
    Hsu, PH
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 580 - 583
  • [46] Efficient implementation of run-time generic types for java
    Allen, Eric
    Cartwright, Robert
    Stoler, Brian
    IFIP Advances in Information and Communication Technology, 1868, (207-236):
  • [47] AN EFFICIENT RUN-TIME ROUTER FOR CONNECTING MODULES IN FPGAS
    Suris, Jorge
    Patterson, Cameron
    Athanas, Peter
    2008 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE AND LOGIC APPLICATIONS, VOLS 1 AND 2, 2008, : 125 - 130
  • [48] Efficient run-time verification of web service composition
    Yau, Yik-Shiung
    Chua, Fang-Fang
    International Journal of Web Engineering and Technology, 2015, 10 (02) : 170 - 198
  • [49] Incremental Model Synchronization for Efficient Run-Time Monitoring
    Vogel, Thomas
    Neumann, Stefan
    Hildebrandt, Stephan
    Giese, Holger
    Becker, Basil
    MODELS IN SOFTWARE ENGINEERING, 2010, 6002 : 124 - 139
  • [50] CADSE: communication aware design space exploration for efficient run-time MPSoC management
    Singh, Amit Kumar
    Kumar, Akash
    Wu, Jigang
    Srikanthan, Thambipillai
    FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (03) : 416 - 430