Input-Dependent Edge-Cloud Mapping of Recurrent Neural Networks Inference

被引:7
|
作者
Pagliari, Daniele Jahier [1 ]
Chiaro, Roberta [1 ]
Chen, Yukai [1 ]
Vinco, Sara [1 ]
Macii, Enrico [2 ]
Poncino, Massimo [1 ]
机构
[1] Politecn Torino, Dept Control & Comp Engn, Turin, Italy
[2] Politecn Torino, Interuniv Dept Reg & Urban Studies & Planning, Turin, Italy
关键词
D O I
10.1109/dac18072.2020.9218595
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the computational complexity of Recurrent Neural Networks (RNNs) inference, IoT and mobile devices typically offload this task to the cloud. However, the execution time and energy consumption of RNN inference strongly depends on the length of the processed input. Therefore, considering also communication costs, it may be more convenient to process short input sequences locally and only offload long ones to the cloud. In this paper, we propose a low-overhead runtime tool that performs this choice automatically. Results based on real edge and cloud devices show that our method is able to simultaneously reduce the total execution time and energy consumption of the system compared to solutions that run RNN inference fully locally or fully in the cloud.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Balancing Video Analytics Processing and Bandwidth for Edge-Cloud Networks
    O'Gorman, Lawrence
    Wang, Xiaoyang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2618 - 2623
  • [32] Task Offloading and Resource Scheduling in Hybrid Edge-Cloud Networks
    Zhang, Qi
    Gui, Lin
    Zhu, Shichao
    Lang, Xiupu
    IEEE ACCESS, 2021, 9 : 85350 - 85366
  • [33] Priority-aware Static Task Mapping for Edge-Cloud Platforms
    Yoshimoto, Jo
    Taniguchi, Ittetsu
    Tomiyama, Hiroyuki
    Onoye, Takao
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [34] Poincare mapping of continuous recurrent neural networks excited by temporal external input
    Sato, S
    Gohara, K
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2000, 10 (07): : 1677 - 1695
  • [35] Reliable adaptive edge-cloud collaborative DNN inference acceleration scheme combining computing and communication resources in optical networks
    Yin, Shan
    Jiao, Yurong
    You, Chenyu
    Cai, Mengru
    Jin, Tianyu
    Huang, Shanguo
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2023, 15 (10) : 750 - 764
  • [36] An Adaptive Neural Architecture Search Design for Collaborative Edge-Cloud Computing
    Lu, Haodong
    Du, Miao
    He, Xiaoming
    Qian, Kai
    Chen, Jianli
    Sun, Yanfei
    Wang, Kun
    IEEE NETWORK, 2021, 35 (05): : 83 - 89
  • [37] PriPro: Towards Effective Privacy Protection on Edge-Cloud System running DNN Inference
    Gao, Ruiyuan
    Yang, Hailong
    Huang, Shaohan
    Dun, Ming
    Li, Mingzhen
    Luan, Zerong
    Luan, Zhongzhi
    Qian, Depei
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 334 - 343
  • [38] EINS: Edge-Cloud Deep Model Inference with Network-Efficiency Schedule in Serverless
    Peng, Shijie
    Lin, Yanying
    Chen, Wenyan
    Tang, Yingfei
    Duan, Xu
    Ye, Kejiang
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1376 - 1381
  • [39] JAVP: Joint-Aware Video Processing with Edge-Cloud Collaboration for DNN Inference
    Yang, Zheming
    Ji, Wen
    Guo, Qi
    Wang, Zhi
    MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia, 2023, : 9152 - 9160
  • [40] Efficient Computing Resource Sharing for Mobile Edge-Cloud Computing Networks
    Zhang, Yongmin
    Lan, Xiaolong
    Ren, Ju
    Cai, Lin
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (03) : 1227 - 1240