Input-Dependent Edge-Cloud Mapping of Recurrent Neural Networks Inference

被引:7
|
作者
Pagliari, Daniele Jahier [1 ]
Chiaro, Roberta [1 ]
Chen, Yukai [1 ]
Vinco, Sara [1 ]
Macii, Enrico [2 ]
Poncino, Massimo [1 ]
机构
[1] Politecn Torino, Dept Control & Comp Engn, Turin, Italy
[2] Politecn Torino, Interuniv Dept Reg & Urban Studies & Planning, Turin, Italy
关键词
D O I
10.1109/dac18072.2020.9218595
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the computational complexity of Recurrent Neural Networks (RNNs) inference, IoT and mobile devices typically offload this task to the cloud. However, the execution time and energy consumption of RNN inference strongly depends on the length of the processed input. Therefore, considering also communication costs, it may be more convenient to process short input sequences locally and only offload long ones to the cloud. In this paper, we propose a low-overhead runtime tool that performs this choice automatically. Results based on real edge and cloud devices show that our method is able to simultaneously reduce the total execution time and energy consumption of the system compared to solutions that run RNN inference fully locally or fully in the cloud.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Optimal Input-Dependent Edge-Cloud Partitioning for RNN Inference
    Pagliari, Daniele Jahier
    Chiaro, Roberta
    Chen, Yukai
    Macii, Enrico
    Poncino, Massimo
    2019 26TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2019, : 442 - 445
  • [2] CRIME: Input-Dependent Collaborative Inference for Recurrent Neural Networks
    Pagliari, Daniele Jahier
    Chiaro, Roberta
    Macii, Enrico
    Poncino, Massimo
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (10) : 1626 - 1639
  • [3] On-demand inference acceleration for directed acyclic graph neural networks over edge-cloud collaboration
    Yang, Lei
    Shen, Xiaoyuan
    Zhong, Changyi
    Liao, Yuwei
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 171 : 79 - 87
  • [4] Split Edge-Cloud Neural Networks for Better Adversarial Robustness
    Douch, Salmane
    Abid, Mohamed Riduan
    Zine-Dine, Khalid
    Bouzidi, Driss
    Benhaddou, Driss
    IEEE ACCESS, 2024, 12 : 158854 - 158865
  • [5] BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference
    Zhou, Hongbo
    Zhang, Weiwei
    Wang, Chengwei
    Ma, Xin
    Yu, Haoran
    SENSORS, 2021, 21 (13)
  • [6] Accelerating DNN Inference by Edge-Cloud Collaboration
    Chen, Jianan
    Qi, Qi
    Wang, Jingyu
    Sun, Haifeng
    Liao, Jianxin
    2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [7] DeepSplit: Dynamic Splitting of Collaborative Edge-Cloud Convolutional Neural Networks
    Mehta, Rishabh
    Shorey, Rajeev
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [8] Hybrid SLM and LLM for Edge-Cloud Collaborative Inference
    Hao, Zixu
    Jiang, Huiqiang
    Jiang, Shiqi
    Ren, Ju
    Cao, Ting
    PROCEEDINGS OF THE 2024 WORKSHOP ON EDGE AND MOBILE FOUNDATION MODELS, EDGEFM 2024, 2024, : 36 - 41
  • [9] Neural quantile optimization for edge-cloud networking☆ ☆
    Du, Bin
    Zhang, He
    Cheng, Xiangle
    Zhang, Lei
    COMPUTER NETWORKS, 2024, 253
  • [10] Collaborative DNNs Inference with Joint Model Partition and Compression in Mobile Edge-Cloud Computing Networks
    Tang, Yaxin
    Li, Xiuhua
    Li, Hui
    Yang, Zhengyi
    Wang, Xiaofei
    Leung, Victor C. M.
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,