A Dynamic Deep Neural Network Design for Efficient Workload Allocation in Edge Computing

被引:34
|
作者
Lo, Chi [1 ]
Su, Yu-Yi [1 ]
Lee, Chun-Yi [1 ]
Chang, Shih-Chieh [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, 101,Sec 2,Kuang Fu Rd, Hsinchu 30013, Taiwan
关键词
Deep neural network; workload allocation; edge computing; authentic operation; dynamic network structure;
D O I
10.1109/ICCD.2017.49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Unreliable communication channels and limited computing resources at the edge end are two primary constraints of battery-powered movable devices, such as autonomous robots and unmanned aerial vehicles (UAVs). The impact is especially severe for those performing deep neural network (DNN) computations. With increasing demand for accuracy, the trend in modern DNN designs is the use of cascaded modularized layers. Implementing a deep network at the edge increases computational workloads and resource occupancy, leading to an increase in battery drain. Using a shallow network and offloading workloads to backbone servers, however, incur significant latency overheads caused by unstable communication channels. Hence, dynamic DNN design techniques for efficient workload allocation are urgently required to manage the amount of workload transmissions while achieving the required accuracy. In this paper, we explore the use of authentic operation (AO) unit and dynamic network structure to enhance DNNs. The AO unit defines a set of stochastic threshold values for different DNN output classes and determines at runtime if an input has to be transferred to backbone servers for further analysis. The dynamic network structure adjusts its depth according to channel availability. Experiments have been comprehensively performed on several well-known DNN models and datasets. Our results show that, on an average, the proposed techniques are able to reduce the amount of transmissions by up to 17% compared to previous methods under the same accuracy requirement.
引用
收藏
页码:273 / 280
页数:8
相关论文
共 50 条
  • [21] esDNN: Deep Neural Network Based Multivariate Workload Prediction in Cloud Computing Environments
    Xu, Minxian
    Song, Chenghao
    Wu, Huaming
    Gill, Sukhpal Singh
    Ye, Kejiang
    Xu, Chengzhong
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2022, 22 (03)
  • [22] Deep Reinforcement Learning for IoT Network Dynamic Clustering in Edge Computing
    Liu, Qingzhi
    Cheng, Long
    Ozcelebi, Tanir
    Murphy, John
    Lukkien, Johan
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 600 - 603
  • [23] AI-oriented Workload Allocation for Cloud-Edge Computing
    Hao, Tianshu
    Zhan, Jianfeng
    Hwang, Kai
    Gao, Wanling
    Wen, Xu
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 555 - 564
  • [24] Application Aware Workload Allocation for Edge Computing-Based IoT
    Fan, Qiang
    Ansari, Nirwan
    IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (03): : 2146 - 2153
  • [25] Energy Efficient Resource Allocation for Heterogeneous Workload in Cloud Computing
    Malik, Surbhi
    Saini, Poonam
    Rani, Sudesh
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, FICTA 2016, VOL 1, 2017, 515 : 89 - 97
  • [26] Energy efficient computing task offloading strategy for deep neural networks in mobile edge computing
    Gao H.
    Li X.
    Zhou B.
    Liu X.
    Xu J.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (06): : 1607 - 1615
  • [27] Energy-Aware Workload Allocation for Distributed Deep Neural Networks in Edge-Cloud Continuum
    Jin, Yi
    Xu, Jiawei
    Huan, Yuxiang
    Yan, Yulong
    Zheng, Lirong
    Zou, Zhuo
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 213 - 217
  • [28] Energy-Efficient and Delay-Guaranteed Workload Allocation in IoT-Edge-Cloud Computing Systems
    Guo, Mian
    Li, Lei
    Guan, Quansheng
    IEEE ACCESS, 2019, 7 : 78685 - 78697
  • [29] Efficient Workload Allocation and User-Centric Utility Maximization for Task Scheduling in Collaborative Vehicular Edge Computing
    Huang, Xumin
    Yu, Rong
    Ye, Dongdong
    Shu, Lei
    Xie, Shengli
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (04) : 3773 - 3787
  • [30] Workload Prediction for Efficient Node Management in Mobile Edge Computing
    Oikonomou, Efthymios
    Plastras, Stefanos
    Tsoumatidis, Dimitrios
    Skoutas, Dimitrios N.
    Rouskas, Angelos
    2024 23RD IFIP NETWORKING CONFERENCE, IFIP NETWORKING 2024, 2024, : 461 - 467