Edge Computing with Early Exiting for Adaptive Inference in Mobile Autonomous Systems

被引:0
|
作者
Angelucci, Simone [1 ,2 ]
Valentini, Roberto [1 ,2 ]
Levorato, Marco [3 ]
Santucci, Fortunato [1 ,2 ]
Chiasserini, Carla Fabiana [4 ]
机构
[1] Univ Aquila, Dept Informat Engn Comp Sci & Math, Laquila, Italy
[2] Univ Aquila, Ctr Ex EMERGE, Laquila, Italy
[3] UC Irvine, Donald Bren Sch Informat & Comp Sci, Irvine, CA USA
[4] Politecn Torino, Dept Elect & Telecommun, Turin, Italy
关键词
Early exiting; Edge computing; Mobile edge applications; Markov decision process; Connected and automated vehicles;
D O I
10.1109/ICC51166.2024.10622411
中图分类号
学科分类号
摘要
Early Exiting (EE) is an emerging computing paradigm where Deep Neural Networks (DNNs) are equipped with earlier classifiers, enabling trading-off accuracy with inference latency. EE can be effectively combined with edge computing, a paradigm that allows mobile nodes to offload complex tasks, such as the execution of DNNs, to servers at the edge of the network, thus reducing computing times and energy consumption at the mobile devices. The integration of such technologies is particularly attractive for the support of applications for connected and automated driving. In this paper, we consider a system that jointly leverages the benefits of EE and edge computing, and we model their complex interactions by means of a Markov Decision Process (MDP). We then formulate an optimization problem to select the inference strategy that maximizes the average task accuracy. Importantly, such an optimization problem has low complexity, as the optimal policy can be derived by mapping the MDP into a linear program. Our numerical results focus on a use case centered on automated vehicles connected with an edge server under varying channel and network conditions, and show that our solution achieves up to 11% higher accuracy compared to the optimal policy with no EE.
引用
收藏
页码:2980 / 2985
页数:6
相关论文
共 50 条
  • [1] DNN Inference Acceleration with Partitioning and Early Exiting in Edge Computing
    Li, Chao
    Xu, Hongli
    Xu, Yang
    Wang, Zhiyuan
    Huang, Liusheng
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 465 - 478
  • [2] Resource-Efficient DNN Inference With Early Exiting in Serverless Edge Computing
    Guo, Xiaolin
    Dong, Fang
    Shen, Dian
    Huang, Zhaowu
    Zhang, Jinghui
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) : 3650 - 3666
  • [3] Improved Early Exiting Activation to Accelerate Edge Inference
    Park, Junyong
    Lee, Jong-Ryul
    Moon, Yong-Hyuk
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1813 - 1817
  • [4] Resource Allocation for Batched Multiuser Edge Inference with Early Exiting
    Liu, Zhiyan
    Lan, Qiao
    Huang, Kaibin
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3614 - 3620
  • [5] Resource Allocation for Multiuser Edge Inference With Batching and Early Exiting
    Liu, Zhiyan
    Lan, Qiao
    Huang, Kaibin
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (04) : 1186 - 1200
  • [6] On Model Coding for Distributed Inference and Transmission in Mobile Edge Computing Systems
    Zhang, Jingjing
    Simeone, Osvaldo
    IEEE COMMUNICATIONS LETTERS, 2019, 23 (06) : 1065 - 1068
  • [7] Adaptive and Resilient Model-Distributed Inference in Edge Computing Systems
    Li, Pengzhen
    Koyuncu, Erdem
    Seferoglu, Hulya
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2023, 4 : 1263 - 1273
  • [8] Dynamic Batching and Early-Exiting for Accurate and Timely Edge Inference
    She, Yechao
    Shi, Tuo
    Wang, Jianping
    Liu, Bin
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,
  • [9] Adaptive Replication for Mobile Edge Computing
    Chang, Wan-Chi
    Wang, Pi-Chung
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (11) : 2422 - 2432
  • [10] Adaptive Inference Reinforcement Learning for Task Offloading in Vehicular Edge Computing Systems
    Tang, Dian
    Zhang, Xuefei
    Li, Meng
    Tao, Xiaofeng
    2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,