Joint compressing and partitioning of CNNs for fast edge-cloud collaborative intelligence for IoT

被引:7
|
作者
Zhang, Wanpeng [1 ]
Wang, Nuo [2 ]
Li, Liying [2 ]
Wei, Tongquan [2 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] East China Normal Univ, Dept Comp Sci & Technol, Shanghai, Peoples R China
关键词
Edge-cloud collaborative intelligence; CNN acceleration; CNN partitioning; IoT; MODEL;
D O I
10.1016/j.sysarc.2022.102461
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The artificial intelligence (AI) empowered advanced technologies have been widely applied to process in real-time the vast amount of data in the internet of things (IoT) for a fast response. However, traditional approaches to deploying AI models impose overwhelming computation and communication overheads. In this paper, we propose a novel edge-cloud collaborative intelligence scheme that jointly compresses and partitions Convolutional Neural Network (CNN) models for fast response in IoT applications. The proposed approach first accelerates a CNN by using an acceleration technique to generate new layers that can serve as candidate partitioning since their outputs are smaller than the unaccelerated layers. It then designs fine-grained prediction models to accurately estimate the execution latency for each layer in the CNN model, and finds an optimal partitioning. The proposed approach splits the compressed CNN model into two parts according to the optimal partitioning. The obtained two parts are deployed at the edge device and in the cloud, respectively, which collaboratively minimize the overall latency without compromising the accuracy of the deep CNN model. To the best of our knowledge, this is the first work that jointly compresses and partitions CNN models for fast edge-cloud collaborative intelligence considering both execution latency and communication latency. Experimental results show that the proposed technique can reduce the latency by up to 73.14% compared to five benchmarking methods.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] A novel approach for IoT tasks offloading in edge-cloud environments
    Almutairi, Jaber
    Aldossary, Mohammad
    Journal of Cloud Computing, 2021, 10 (01)
  • [32] A novel approach for IoT tasks offloading in edge-cloud environments
    Jaber Almutairi
    Mohammad Aldossary
    Journal of Cloud Computing, 10
  • [33] Collaborative Optimization of Edge-Cloud Computation Offloading in Internet of Vehicles
    Li, Yureng
    Xu, Shouzhi
    30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
  • [34] Using Collaborative Edge-Cloud Cache for Search in Internet of Things
    Tang, Jine
    Zhou, Zhangbing
    Xue, Xiao
    Wang, Gongwen
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (02) : 922 - 936
  • [35] Optimizing Face Recognition Inference with a Collaborative Edge-Cloud Network
    Oroceo, Paul P.
    Kim, Jeong-In
    Caliwag, Ej Miguel Francisco
    Kim, Sang-Ho
    Lim, Wansu
    SENSORS, 2022, 22 (21)
  • [36] MPCSM: Microservice Placement for Edge-Cloud Collaborative Smart Manufacturing
    Wang, Yimeng
    Zhao, Cong
    Yang, Shusen
    Ren, Xuebin
    Wang, Luhui
    Zhao, Peng
    Yang, Xinyu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) : 5898 - 5908
  • [37] Task Offloading and Resource Allocation for Edge-Cloud Collaborative Computing
    Wang, Yaxing
    Hao, Jia
    Xu, Gang
    Huang, Baoqi
    Zhang, Feng
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT V, 2024, 14491 : 361 - 372
  • [38] An Efficient Edge-Cloud Partitioning of Random Forests for Distributed Sensor Networks
    Shen, Tianyi
    Mishra, Cyan Subhra
    Sampson, Jack
    Kandemir, Mahmut Taylan
    Narayanan, Vijaykrishnan
    IEEE EMBEDDED SYSTEMS LETTERS, 2024, 16 (01) : 21 - 24
  • [39] Optimal Input-Dependent Edge-Cloud Partitioning for RNN Inference
    Pagliari, Daniele Jahier
    Chiaro, Roberta
    Chen, Yukai
    Macii, Enrico
    Poncino, Massimo
    2019 26TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2019, : 442 - 445
  • [40] An Efficient Low Complexity Edge-Cloud Framework for Security in IoT Networks
    Truong Thu Huong
    Ta Phuong Bac
    Dao Minh Long
    Bui Doan Thang
    Tran Duc Luong
    Nguyen Thanh Binh
    IEEE ICCE 2020: 2020 IEEE EIGHTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2021, : 533 - 539