Input-Dependent Edge-Cloud Mapping of Recurrent Neural Networks Inference

被引:7
|
作者
Pagliari, Daniele Jahier [1 ]
Chiaro, Roberta [1 ]
Chen, Yukai [1 ]
Vinco, Sara [1 ]
Macii, Enrico [2 ]
Poncino, Massimo [1 ]
机构
[1] Politecn Torino, Dept Control & Comp Engn, Turin, Italy
[2] Politecn Torino, Interuniv Dept Reg & Urban Studies & Planning, Turin, Italy
关键词
D O I
10.1109/dac18072.2020.9218595
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the computational complexity of Recurrent Neural Networks (RNNs) inference, IoT and mobile devices typically offload this task to the cloud. However, the execution time and energy consumption of RNN inference strongly depends on the length of the processed input. Therefore, considering also communication costs, it may be more convenient to process short input sequences locally and only offload long ones to the cloud. In this paper, we propose a low-overhead runtime tool that performs this choice automatically. Results based on real edge and cloud devices show that our method is able to simultaneously reduce the total execution time and energy consumption of the system compared to solutions that run RNN inference fully locally or fully in the cloud.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] An Efficient Edge-Cloud Partitioning of Random Forests for Distributed Sensor Networks
    Shen, Tianyi
    Mishra, Cyan Subhra
    Sampson, Jack
    Kandemir, Mahmut Taylan
    Narayanan, Vijaykrishnan
    IEEE EMBEDDED SYSTEMS LETTERS, 2024, 16 (01) : 21 - 24
  • [42] An Efficient Low Complexity Edge-Cloud Framework for Security in IoT Networks
    Truong Thu Huong
    Ta Phuong Bac
    Dao Minh Long
    Bui Doan Thang
    Tran Duc Luong
    Nguyen Thanh Binh
    IEEE ICCE 2020: 2020 IEEE EIGHTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2021, : 533 - 539
  • [43] Dynamic DNN Model Selection and Inference Offloading for Video Analytics with Edge-Cloud Collaboration
    Wang, Xuezhi
    Gao, Guanyu
    Wu, Xiaohu
    Lyu, Yan
    Wu, Weiwei
    PROCEEDINGS OF THE 32ND WORKSHOP ON NETWORK AND OPERATING SYSTEMS SUPPORT FOR DIGITAL AUDIO AND VIDEO, NOSSDAV 2022, 2022, : 64 - 70
  • [44] Preemptive Scheduling for Distributed Machine Learning Jobs in Edge-Cloud Networks
    Wang, Ne
    Zhou, Ruiting
    Jiao, Lei
    Zhang, Renli
    Li, Bo
    Li, Zongpeng
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (08) : 2411 - 2425
  • [45] An Edge-Cloud Approach for Dynamic Resource Allocation in Drone Communication Networks
    Das, Debashis
    Njilla, Laurent
    Ghosh, Uttam
    Shetty, Sachin
    Levin, Eugene
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS, INFOCOM WKSHPS 2024, 2024,
  • [46] Joint Cooperative Content Caching and Recommendation in Mobile Edge-Cloud Networks
    Ke, Zhihui
    Cheng, Meng
    Zhou, Xiaobo
    Li, Keqiu
    Qiu, Tie
    WEB AND BIG DATA, PT I, APWEB-WAIM 2020, 2020, 12317 : 424 - 438
  • [47] Service Configuration Optimization in Edge-Cloud Networks Leveraging Log Analysis
    Sun, Mengyu
    Zhou, Zhangbing
    Xue, Xiao
    Zhang, Wenbo
    Hung, Patrick C. K.
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (09): : 6719 - 6731
  • [48] JAVP: Joint-Aware Video Processing with Edge-Cloud Collaboration for DNN Inference
    Yang, Zheming
    Ji, Wen
    Guo, Qi
    Wang, Zhi
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9152 - 9160
  • [49] Complexity-aware Adaptive Training and Inference for Edge-Cloud Distributed AI Systems
    Long, Yinghan
    Chakraborty, Indranil
    Srinivasan, Gopalakrishnan
    Roy, Kaushik
    2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021), 2021, : 573 - 583
  • [50] Edge-cloud computing oriented large-scale online music education mechanism driven by neural networks
    Wen Xing
    Adam Slowik
    J. Dinesh Peter
    Journal of Cloud Computing, 13