Joint compressing and partitioning of CNNs for fast edge-cloud collaborative intelligence for IoT

被引:7
|
作者
Zhang, Wanpeng [1 ]
Wang, Nuo [2 ]
Li, Liying [2 ]
Wei, Tongquan [2 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] East China Normal Univ, Dept Comp Sci & Technol, Shanghai, Peoples R China
关键词
Edge-cloud collaborative intelligence; CNN acceleration; CNN partitioning; IoT; MODEL;
D O I
10.1016/j.sysarc.2022.102461
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The artificial intelligence (AI) empowered advanced technologies have been widely applied to process in real-time the vast amount of data in the internet of things (IoT) for a fast response. However, traditional approaches to deploying AI models impose overwhelming computation and communication overheads. In this paper, we propose a novel edge-cloud collaborative intelligence scheme that jointly compresses and partitions Convolutional Neural Network (CNN) models for fast response in IoT applications. The proposed approach first accelerates a CNN by using an acceleration technique to generate new layers that can serve as candidate partitioning since their outputs are smaller than the unaccelerated layers. It then designs fine-grained prediction models to accurately estimate the execution latency for each layer in the CNN model, and finds an optimal partitioning. The proposed approach splits the compressed CNN model into two parts according to the optimal partitioning. The obtained two parts are deployed at the edge device and in the cloud, respectively, which collaboratively minimize the overall latency without compromising the accuracy of the deep CNN model. To the best of our knowledge, this is the first work that jointly compresses and partitions CNN models for fast edge-cloud collaborative intelligence considering both execution latency and communication latency. Experimental results show that the proposed technique can reduce the latency by up to 73.14% compared to five benchmarking methods.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Design of Platform-Independent IoT Applications in the Edge-Cloud Continuum
    Marozzo, Fabrizio
    Vinci, Andrea
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 589 - 594
  • [42] Light-Edge: A Lightweight Authentication Protocol for IoT Devices in an Edge-Cloud Environment
    Shahidinejad, Ali
    Ghobaei-Arani, Mostafa
    Souri, Alireza
    Shojafar, Mohammad
    Kumari, Saru
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2022, 11 (02) : 57 - 63
  • [43] Evaluation of Failure Analysis of IoT Applications Using Edge-Cloud Architecture
    Jassas, Mohammad S.
    Mahmoud, Qusay H.
    SYSCON 2022: THE 16TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON), 2022,
  • [44] ARVMEC: Adaptive Recommendation of Virtual Machines for IoT in Edge-Cloud Environment
    Xu, Yajing
    Li, Junnan
    Lu, Zhihui
    Wu, Jie
    Hung, Patrick C. K.
    Alelaiwi, Abdulhameed
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 141 : 23 - 34
  • [45] SDN/NFV architectures for edge-cloud oriented IoT: A systematic review
    Ray, Partha Pratim
    Kumar, Neeraj
    COMPUTER COMMUNICATIONS, 2021, 169 (169) : 129 - 153
  • [46] Collaborative Edge-Cloud Data Transfer Optimization for Industrial Internet of Things
    Zhang, Xinchang
    Wang, Maoli
    Zhu, Xiaomin
    Yan, Zhiwei
    Geng, Guanggang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 36 (03) : 580 - 597
  • [47] An edge-cloud collaborative computing platform for building AIoT applications efficiently
    Guoping Rong
    Yangchen Xu
    Xinxin Tong
    Haojun Fan
    Journal of Cloud Computing, 10
  • [48] EC2Detect: Real-Time Online Video Object Detection in Edge-Cloud Collaborative IoT
    Guo, Siyan
    Zhao, Cong
    Wang, Guiqin
    Yang, Jiaqing
    Yang, Shusen
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (20): : 20382 - 20392
  • [49] Edge-cloud collaborative intelligent production scheduling based on digital twin
    Yifan, Han
    Tao, Feng
    Xiaokai, Liü
    Fangmin, Xu
    Chenglin, Zhao
    Journal of China Universities of Posts and Telecommunications, 2022, 29 (02): : 108 - 120
  • [50] Context-Aware Edge-Cloud Collaborative Scene Text Recognition
    Zhang, Puning
    Liu, Changfeng
    Wang, Honggang
    Wu, Dapeng
    Wang, Ruyan
    Zou, Hong
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 611 - 617