Improved Early Exiting Activation to Accelerate Edge Inference

被引:1
|
作者
Park, Junyong [1 ]
Lee, Jong-Ryul [1 ]
Moon, Yong-Hyuk [1 ,2 ]
机构
[1] Elect & Telecommun Res Inst ETRI, Daejeon, South Korea
[2] Univ Sci & Technol UST, Daejeon, South Korea
关键词
Edge Inference; Early Exiting;
D O I
10.1109/ICTC52510.2021.9621109
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As mobile & edge devices are getting powerful, on-device deep learning is becoming a reality. However, there are still many challenges for deep learning edge inferences, such as limited resources such as computing power, memory space, and energy. To address these challenges, model compression such as channel pruning, low rank representation, network quantization, and early exiting has been introduce to reduce the computational load of neural networks at a whole. In this paper, we propose an improved method of implementing early exiting branches on a pre-defined neural network, so that it can determine whether the input data is easy to process, therefore use less resource to execute the task. Our method starts with an entire search for activations in a given network, then inserting early exiting modules, testing those early exit branches, resulting in selecting useful branches that are both accurate and fast. Our contribution is reducing the computing time of neural networks by breaking the flow of models using execution branches. Additionally, by testing on all activations in neural network, we gain knowledge of the neural network model and insight on where to place the ideal early exit auxiliary classifier. We test on ResNet model and show reduction in real computation time on single input images.
引用
收藏
页码:1813 / 1817
页数:5
相关论文
共 50 条
  • [1] Resource Allocation for Batched Multiuser Edge Inference with Early Exiting
    Liu, Zhiyan
    Lan, Qiao
    Huang, Kaibin
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3614 - 3620
  • [2] DNN Inference Acceleration with Partitioning and Early Exiting in Edge Computing
    Li, Chao
    Xu, Hongli
    Xu, Yang
    Wang, Zhiyuan
    Huang, Liusheng
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 465 - 478
  • [3] Resource Allocation for Multiuser Edge Inference With Batching and Early Exiting
    Liu, Zhiyan
    Lan, Qiao
    Huang, Kaibin
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (04) : 1186 - 1200
  • [4] Dynamic Batching and Early-Exiting for Accurate and Timely Edge Inference
    She, Yechao
    Shi, Tuo
    Wang, Jianping
    Liu, Bin
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,
  • [5] Edge Computing with Early Exiting for Adaptive Inference in Mobile Autonomous Systems
    Angelucci, Simone
    Valentini, Roberto
    Levorato, Marco
    Santucci, Fortunato
    Chiasserini, Carla Fabiana
    ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, : 2980 - 2985
  • [6] Resource-Efficient DNN Inference With Early Exiting in Serverless Edge Computing
    Guo, Xiaolin
    Dong, Fang
    Shen, Dian
    Huang, Zhaowu
    Zhang, Jinghui
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) : 3650 - 3666
  • [7] DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
    Xin, Ji
    Tang, Raphael
    Lee, Jaejun
    Yu, Yaoliang
    Lin, Jimmy
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2246 - 2251
  • [8] Adaptive Early Exiting for Collaborative Inference over Noisy Wireless Channels
    Jankowski, Mikolaj
    Gunduz, Deniz
    Mikolajczyk, Krystian
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 126 - 131
  • [9] Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting
    Hamed, Omar
    Bakkali, Souhail
    Blaschko, Matthew
    Moens, Sien
    Van Landeghem, Jordy
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT IV, 2024, 14807 : 270 - 286
  • [10] SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference
    Hu, Boren
    Zhu, Yun
    Li, Jiacheng
    Tang, Siliang
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5067 - 5075