ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators

被引:0
|
作者
Azhar, Muhammad Waqar [1 ]
Zouzoula, Stavroula [1 ]
Trancoso, Pedro [1 ]
机构
[1] Chalmers Univ Technol, Gothenburg, Sweden
关键词
CNNs; Energy Efficiency; Resource Allocation; Accelerators;
D O I
10.1145/3587135.3592207
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Learning (DL) applications are entering every part of our life given their ability to solve complex problems. Nevertheless, energy efficiency is still a major concern due to the large computational and memory requirements. State-of-the-art accelerators strive to address this issue by optimizing the architecture to the compute requirements of DL algorithms. However, there is always a mismatch between compute and memory requirements and what is offered by a particular design. A way to close this gap is by providing run-time adaptation or resource allocation to improve efficiency. This paper proposes an adaptive resource allocation for deep learning applications (ARADA) with the goal of improving energy efficiency for deep learning accelerators. This is leveraged by having a layer-by-layer resource allocation. The rationale is that each layer in the DL model has a unique compute and memory bandwidth requirement and allocating fixed resources to all layers leads to inefficiencies. This can be achieved by means of resource allocation (e.g., voltage-frequency, memory bandwidth) to save energy without sacrificing performance. Experimental results show that applying ARADA to the execution of 9 state-of-the-art CNN models results in an energy savings of 38% on average compared to race-to-idle for an Edge TPU coupled with LPDDR4 off-chip memory.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [21] Adaptive Resource Allocation Considering Power-Consumption Outage: A Deep Reinforcement Learning Approach
    Luo, Jia
    Chen, Qianbin
    Tang, Lun
    Zhang, Zhicai
    Li, Yu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (06) : 8111 - 8116
  • [22] Improving resource allocation efficiency in health care delivery systems
    Alkaabi, Rashed
    El Halim, A. O. Abd
    Mahmoud, Samy
    2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 2360 - +
  • [23] Improving resource recycling based on deep learning
    Xu, Yunjian
    Guo, Aiyin
    JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2024, 16 (04) : 527 - 540
  • [24] Resource Allocation for Energy Efficiency Optimization in Heterogeneous Networks
    Tang, Jie
    So, Daniel K. C.
    Alsusa, Emad
    Hamdi, Khairi Ashour
    Shojaeifard, Arman
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2015, 33 (10) : 2104 - 2117
  • [25] On-Device Deep Learning: Survey on Techniques Improving Energy Efficiency of DNNs
    Boumendil, Anais
    Bechkit, Walid
    Benatchba, Karima
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [26] Deep Reinforcement Learning for Resource Allocation in Business Processes
    Zbikowski, Kamil
    Ostapowicz, Michal
    Gawrysiak, Piotr
    PROCESS MINING WORKSHOPS, ICPM 2022, 2023, 468 : 177 - 189
  • [27] Predictive Resource Allocation with Interference Coordination by Deep Learning
    Xu, Zhaoqi
    Guo, Jia
    Yang, Chenyang
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [28] Deep Reinforcement Learning for Resource Allocation in Massive MIMO
    Chen, Liang
    Sun, Fanglei
    Li, Kai
    Chen, Ruiqing
    Yang, Yang
    Wang, Jun
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1611 - 1615
  • [29] Deep Reinforcement Learning Based Resource Allocation for LoRaWAN
    Li, Aohan
    2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [30] Deep Reinforcement Learning for Improving Resource Utilization Efficiency of URLLC With Imperfect Channel State Information
    Zhang, Litianyi
    She, Changyang
    Ying, Kai
    Li, Yonghui
    Vucetic, Branka
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (10) : 1796 - 1800