Joint compressing and partitioning of CNNs for fast edge-cloud collaborative intelligence for IoT

被引:7
|
作者
Zhang, Wanpeng [1 ]
Wang, Nuo [2 ]
Li, Liying [2 ]
Wei, Tongquan [2 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] East China Normal Univ, Dept Comp Sci & Technol, Shanghai, Peoples R China
关键词
Edge-cloud collaborative intelligence; CNN acceleration; CNN partitioning; IoT; MODEL;
D O I
10.1016/j.sysarc.2022.102461
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The artificial intelligence (AI) empowered advanced technologies have been widely applied to process in real-time the vast amount of data in the internet of things (IoT) for a fast response. However, traditional approaches to deploying AI models impose overwhelming computation and communication overheads. In this paper, we propose a novel edge-cloud collaborative intelligence scheme that jointly compresses and partitions Convolutional Neural Network (CNN) models for fast response in IoT applications. The proposed approach first accelerates a CNN by using an acceleration technique to generate new layers that can serve as candidate partitioning since their outputs are smaller than the unaccelerated layers. It then designs fine-grained prediction models to accurately estimate the execution latency for each layer in the CNN model, and finds an optimal partitioning. The proposed approach splits the compressed CNN model into two parts according to the optimal partitioning. The obtained two parts are deployed at the edge device and in the cloud, respectively, which collaboratively minimize the overall latency without compromising the accuracy of the deep CNN model. To the best of our knowledge, this is the first work that jointly compresses and partitions CNN models for fast edge-cloud collaborative intelligence considering both execution latency and communication latency. Experimental results show that the proposed technique can reduce the latency by up to 73.14% compared to five benchmarking methods.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Edge-Cloud Collaborative Computation Offloading for Mixed Traffic
    Li, Qirui
    Guo, Mian
    Peng, Zhiping
    Cui, Delong
    He, Jieguang
    IEEE SYSTEMS JOURNAL, 2023, 17 (03): : 5023 - 5034
  • [22] Hybrid SLM and LLM for Edge-Cloud Collaborative Inference
    Hao, Zixu
    Jiang, Huiqiang
    Jiang, Shiqi
    Ren, Ju
    Cao, Ting
    PROCEEDINGS OF THE 2024 WORKSHOP ON EDGE AND MOBILE FOUNDATION MODELS, EDGEFM 2024, 2024, : 36 - 41
  • [23] Edge-cloud Collaborative Learning with Federated and Centralized Features
    Li, Zexi
    Li, Qunwei
    Zhou, Yi
    Zhong, Wenliang
    Zhang, Guannan
    Wu, Chao
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1949 - 1953
  • [24] A SLAM Algorithm Based on Edge-Cloud Collaborative Computing
    Lv, Taizhi
    Zhang, Juan
    Chen, Yong
    JOURNAL OF SENSORS, 2022, 2022
  • [25] Nebula: An Edge-Cloud Collaborative Learning Framework for Dynamic Edge Environments
    Zhuang, Yan
    Zheng, Zhenzhe
    Shao, Yunfeng
    Li, Bingshuai
    Wu, Fan
    Chen, Guihai
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 782 - 791
  • [26] Collaborative DNNs Inference with Joint Model Partition and Compression in Mobile Edge-Cloud Computing Networks
    Tang, Yaxin
    Li, Xiuhua
    Li, Hui
    Yang, Zhengyi
    Wang, Xiaofei
    Leung, Victor C. M.
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [27] Communication-Efficient Quantized Deep Compressed Sensing for Edge-Cloud Collaborative Industrial IoT Networks
    Zhang, Mingqiang
    Zhang, Haixia
    Zhang, Chuanting
    Yuan, Dongfeng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 6613 - 6623
  • [28] IoT data analytic algorithms on edge-cloud infrastructure: A review
    Edje, Abel E.
    Abd Latiff, M. S.
    Chan, Weng Howe
    DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (06) : 1486 - 1515
  • [29] A novel approach for IoT tasks offloading in edge-cloud environments
    Almutairi, Jaber
    Aldossary, Mohammad
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2021, 10 (01):
  • [30] IoT data analytic algorithms on edge-cloud infrastructure: A review
    Abel EEdje
    MSAbd Latiff
    Weng Howe Chan
    Digital Communications and Networks, 2023, 9 (06) : 1486 - 1515