ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators

被引:0
|
作者
Azhar, Muhammad Waqar [1 ]
Zouzoula, Stavroula [1 ]
Trancoso, Pedro [1 ]
机构
[1] Chalmers Univ Technol, Gothenburg, Sweden
关键词
CNNs; Energy Efficiency; Resource Allocation; Accelerators;
D O I
10.1145/3587135.3592207
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Learning (DL) applications are entering every part of our life given their ability to solve complex problems. Nevertheless, energy efficiency is still a major concern due to the large computational and memory requirements. State-of-the-art accelerators strive to address this issue by optimizing the architecture to the compute requirements of DL algorithms. However, there is always a mismatch between compute and memory requirements and what is offered by a particular design. A way to close this gap is by providing run-time adaptation or resource allocation to improve efficiency. This paper proposes an adaptive resource allocation for deep learning applications (ARADA) with the goal of improving energy efficiency for deep learning accelerators. This is leveraged by having a layer-by-layer resource allocation. The rationale is that each layer in the DL model has a unique compute and memory bandwidth requirement and allocating fixed resources to all layers leads to inefficiencies. This can be achieved by means of resource allocation (e.g., voltage-frequency, memory bandwidth) to save energy without sacrificing performance. Experimental results show that applying ARADA to the execution of 9 state-of-the-art CNN models results in an energy savings of 38% on average compared to race-to-idle for an Edge TPU coupled with LPDDR4 off-chip memory.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [1] IMPROVING LEARNING EFFICIENCY FOR WIRELESS RESOURCE ALLOCATION WITH SYMMETRIC PRIOR
    Sun, Chengjian
    Wu, Jiajun
    Yang, Chenyang
    IEEE WIRELESS COMMUNICATIONS, 2022, 29 (02) : 162 - 168
  • [2] Diminishing Returns and Deep Learning for Adaptive CPU Resource Allocation of Containers
    Abdullah, Muhammad
    Iqbal, Waheed
    Bukhari, Faisal
    Erradi, Abdelkarim
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (04): : 2052 - 2063
  • [3] Deep Learning-Based Channel Adaptive Resource Allocation in LoRaWAN
    Farhad, Arshad
    Kim, Dae-Ho
    Yoon, Jeong-Sun
    Pyun, Jae-Young
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [4] Optimizing energy efficiency in MEC networks: a deep learning approach with Cybertwin-driven resource allocation
    Lilhore, Umesh Kumar
    Simaiya, Sarita
    Dalal, Surjeet
    Faujdar, Neetu
    Alroobaea, Roobaea
    Alsafyani, Majed
    Baqasah, Abdullah M.
    Algarni, Sultan
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2024, 13 (01):
  • [5] Resource allocation for joint energy and spectral efficiency in cloud radio access network based on deep reinforcement learning
    Iqbal, Amjad
    Tham, Mau-Luen
    Chang, Yoong Choon
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (04)
  • [6] Predictive Resource Allocation with Deep Learning
    Guo, Jia
    Yang, Chenyang
    2018 IEEE 88TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2018,
  • [7] Improving Efficiency in Resource Allocation Of OFDMA Femtocell Networks
    Aghababaiyan, Keyvan
    Pakravan, Mohammad Reza
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [8] Resource Allocation in HetNets with Green Energy Supply Based on Deep Reinforcement Learning
    Zheng, Weijun
    Fang, Jinghui
    Yuan, Siyu
    Guo, Da
    Zhang, Yong
    HUMAN CENTERED COMPUTING, 2019, 11956 : 671 - 682
  • [9] DeepVRM: Deep Learning Based Virtual Resource Management for Energy Efficiency
    Zakia Zaman
    Sabidur Rahman
    Fazle Rafsani
    Ishraq R. Rahman
    Mahmuda Naznin
    Journal of Network and Systems Management, 2023, 31
  • [10] DeepVRM: Deep Learning Based Virtual Resource Management for Energy Efficiency
    Zaman, Zakia
    Rahman, Sabidur
    Rafsani, Fazle
    Rahman, Ishraq R. R.
    Naznin, Mahmuda
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2023, 31 (04)