ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators

被引:0
|
作者
Azhar, Muhammad Waqar [1 ]
Zouzoula, Stavroula [1 ]
Trancoso, Pedro [1 ]
机构
[1] Chalmers Univ Technol, Gothenburg, Sweden
关键词
CNNs; Energy Efficiency; Resource Allocation; Accelerators;
D O I
10.1145/3587135.3592207
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Learning (DL) applications are entering every part of our life given their ability to solve complex problems. Nevertheless, energy efficiency is still a major concern due to the large computational and memory requirements. State-of-the-art accelerators strive to address this issue by optimizing the architecture to the compute requirements of DL algorithms. However, there is always a mismatch between compute and memory requirements and what is offered by a particular design. A way to close this gap is by providing run-time adaptation or resource allocation to improve efficiency. This paper proposes an adaptive resource allocation for deep learning applications (ARADA) with the goal of improving energy efficiency for deep learning accelerators. This is leveraged by having a layer-by-layer resource allocation. The rationale is that each layer in the DL model has a unique compute and memory bandwidth requirement and allocating fixed resources to all layers leads to inefficiencies. This can be achieved by means of resource allocation (e.g., voltage-frequency, memory bandwidth) to save energy without sacrificing performance. Experimental results show that applying ARADA to the execution of 9 state-of-the-art CNN models results in an energy savings of 38% on average compared to race-to-idle for an Edge TPU coupled with LPDDR4 off-chip memory.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [31] Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning
    Chen, Zheyi
    Hu, Jia
    Min, Geyong
    Luo, Chunbo
    El-Ghazawi, Tarek
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (08) : 1911 - 1923
  • [32] Enhancing Dynamic Production Scheduling and Resource Allocation Through Adaptive Control Systems with Deep Reinforcement Learning
    Aderoba, Olugbenga Adegbemisola
    Mpofu, Kluunbu Ani
    Adenuga, Olukorede Tijani
    Nzengue, Alliance Gracia Bibili
    PROCEEDINGS OF THE CONFERENCE ON PRODUCTION SYSTEMS AND LOGISTICS, CPSL 2024, 2024, : 814 - 827
  • [33] A Deep Learning Approach for Mobility-Aware and Energy-Efficient Resource Allocation in MEC
    Ali, Zaiwar
    Khaf, Sadia
    Abbas, Ziaul Haq
    Abbas, Ghulam
    Muhammad, Fazal
    Kim, Sunghwan
    IEEE ACCESS, 2020, 8 : 179530 - 179546
  • [34] Research on development of digital finance in improving efficiency of tourism resource allocation
    Qin, Wang
    Li, Yang
    Yue, Zhonggang
    RESOURCES ENVIRONMENT AND SUSTAINABILITY, 2022, 8
  • [35] Improving Resource Efficiency in Internet Cafes by Virtualization and Optimal User Allocation
    Hamling, Isaac
    O'Sullivan, Michael
    Walker, Cameron
    Thielen, Clemens
    2015 IEEE/ACM 8TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2015, : 26 - 34
  • [36] USING DATAFLOW TO OPTIMIZE ENERGY EFFICIENCY OF DEEP NEURAL NETWORK ACCELERATORS
    Chen, Yu-Hsin
    Emer, Joel
    Sze, Vivienne
    IEEE MICRO, 2017, 37 (03) : 12 - 21
  • [37] Improving Factory Resource and Energy Efficiency: The FREE Toolkit
    Despeisse, Melanie
    Evans, Steve
    ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS: INNOVATIVE PRODUCTION MANAGEMENT TOWARDS SUSTAINABLE GROWTH (AMPS 2015), PT I, 2015, 459 : 640 - 646
  • [38] Improving Energy Efficiency In Climatic Test Chambers With Deep Learning and Absolute Humidity Methods
    Bekiroglu, Erdal
    Karaca, Hakan
    2023 11TH INTERNATIONAL CONFERENCE ON SMART GRID, ICSMARTGRID, 2023,
  • [39] On Self-adaptive Resource Allocation through Reinforcement Learning
    Panerati, Jacopo
    Sironi, Filippo
    Carminati, Matteo
    Maggio, Martina
    Beltrame, Giovanni
    Gmytrasiewicz, Piotr J.
    Sciuto, Donatella
    Santambrogio, Marco D.
    2013 NASA/ESA CONFERENCE ON ADAPTIVE HARDWARE AND SYSTEMS (AHS), 2013, : 23 - 30
  • [40] Adaptive Communication Resource Allocation for Federated Learning with UEP Strategies
    Lan, Muhang
    Xiao, Song
    Zhang, Wenyi
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,