A Dynamic Deep Neural Network Design for Efficient Workload Allocation in Edge Computing

被引:34
|
作者
Lo, Chi [1 ]
Su, Yu-Yi [1 ]
Lee, Chun-Yi [1 ]
Chang, Shih-Chieh [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, 101,Sec 2,Kuang Fu Rd, Hsinchu 30013, Taiwan
关键词
Deep neural network; workload allocation; edge computing; authentic operation; dynamic network structure;
D O I
10.1109/ICCD.2017.49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Unreliable communication channels and limited computing resources at the edge end are two primary constraints of battery-powered movable devices, such as autonomous robots and unmanned aerial vehicles (UAVs). The impact is especially severe for those performing deep neural network (DNN) computations. With increasing demand for accuracy, the trend in modern DNN designs is the use of cascaded modularized layers. Implementing a deep network at the edge increases computational workloads and resource occupancy, leading to an increase in battery drain. Using a shallow network and offloading workloads to backbone servers, however, incur significant latency overheads caused by unstable communication channels. Hence, dynamic DNN design techniques for efficient workload allocation are urgently required to manage the amount of workload transmissions while achieving the required accuracy. In this paper, we explore the use of authentic operation (AO) unit and dynamic network structure to enhance DNNs. The AO unit defines a set of stochastic threshold values for different DNN output classes and determines at runtime if an input has to be transferred to backbone servers for further analysis. The dynamic network structure adjusts its depth according to channel availability. Experiments have been comprehensively performed on several well-known DNN models and datasets. Our results show that, on an average, the proposed techniques are able to reduce the amount of transmissions by up to 17% compared to previous methods under the same accuracy requirement.
引用
收藏
页码:273 / 280
页数:8
相关论文
共 50 条
  • [41] Dynamic function allocation in edge serverless computing networks
    Li, Shuo
    Bastug, Ejder
    Di Martino, Catello
    Di Renzo, Marco
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 486 - 491
  • [42] A Dynamic Service Allocation Algorithm in Mobile Edge Computing
    Hu, Bo
    Chen, Jianye
    Li, Fengcun
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 104 - 109
  • [43] Dynamic resource allocation scheme for mobile edge computing
    Changqing Gong
    Wanying He
    Ting Wang
    Abdullah Gani
    Han Qi
    The Journal of Supercomputing, 2023, 79 : 17187 - 17207
  • [44] Dynamic resource allocation scheme for mobile edge computing
    Gong, Changqing
    He, Wanying
    Wang, Ting
    Gani, Abdullah
    Qi, Han
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (15): : 17187 - 17207
  • [45] Dynamic Adaptive User Allocation in Mobile Edge Computing
    Li, Jiajia
    Ji, Shunhui
    Jin, Huiying
    Dong, Hai
    Ge, Zhiyuan
    Zhang, Pengcheng
    2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE SERVICES ENGINEERING, SSE 2024, 2024, : 179 - 187
  • [46] Energy-efficient Workload Allocation and Computation Resource Configuration in Distributed Cloud/Edge Computing Systems With Stochastic Workloads
    Zhang, Wenyu
    Zhang, Zhenjiang
    Zeadally, Sherali
    Chao, Han-Chieh
    Leung, Victor C. M.
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (06) : 1118 - 1132
  • [47] Deep Reinforcement Learning-Based Workload Scheduling for Edge Computing
    Zheng, Tao
    Wan, Jian
    Zhang, Jilin
    Jiang, Congfeng
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
  • [48] Deep Reinforcement Learning-Based Workload Scheduling for Edge Computing
    Tao Zheng
    Jian Wan
    Jilin Zhang
    Congfeng Jiang
    Journal of Cloud Computing, 11
  • [49] EasiEdge: A Novel Global Deep Neural Networks Pruning Method for Efficient Edge Computing
    Yu, Fang
    Cui, Li
    Wang, Pengcheng
    Han, Chuanqi
    Huang, Ruoran
    Huang, Xi
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (03): : 1259 - 1271
  • [50] Pruning deep convolutional neural networks for efficient edge computing in condition assessment of infrastructures
    Wu, Rih-Teng
    Singla, Ankush
    Jahanshahi, Mohammad R.
    Bertino, Elisa
    Ko, Bong Jun
    Verma, Dinesh
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2019, 34 (09) : 774 - 789