Joint compressing and partitioning of CNNs for fast edge-cloud collaborative intelligence for IoT

被引:7
|
作者
Zhang, Wanpeng [1 ]
Wang, Nuo [2 ]
Li, Liying [2 ]
Wei, Tongquan [2 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] East China Normal Univ, Dept Comp Sci & Technol, Shanghai, Peoples R China
关键词
Edge-cloud collaborative intelligence; CNN acceleration; CNN partitioning; IoT; MODEL;
D O I
10.1016/j.sysarc.2022.102461
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The artificial intelligence (AI) empowered advanced technologies have been widely applied to process in real-time the vast amount of data in the internet of things (IoT) for a fast response. However, traditional approaches to deploying AI models impose overwhelming computation and communication overheads. In this paper, we propose a novel edge-cloud collaborative intelligence scheme that jointly compresses and partitions Convolutional Neural Network (CNN) models for fast response in IoT applications. The proposed approach first accelerates a CNN by using an acceleration technique to generate new layers that can serve as candidate partitioning since their outputs are smaller than the unaccelerated layers. It then designs fine-grained prediction models to accurately estimate the execution latency for each layer in the CNN model, and finds an optimal partitioning. The proposed approach splits the compressed CNN model into two parts according to the optimal partitioning. The obtained two parts are deployed at the edge device and in the cloud, respectively, which collaboratively minimize the overall latency without compromising the accuracy of the deep CNN model. To the best of our knowledge, this is the first work that jointly compresses and partitions CNN models for fast edge-cloud collaborative intelligence considering both execution latency and communication latency. Experimental results show that the proposed technique can reduce the latency by up to 73.14% compared to five benchmarking methods.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A DNN partitioning framework with controlled lossy mechanisms for edge-cloud collaborative intelligence
    Kim, Hyochan
    Choi, Ji Sub
    Kim, Jungrae
    Ko, Jong Hwan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 426 - 439
  • [2] A Fast Hierarchical Physical Topology Update Scheme for Edge-Cloud Collaborative IoT Systems
    Yu, Tianqi
    Wang, Xianbin
    Hu, Jianling
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2021, 29 (05) : 2254 - 2266
  • [3] Collaborative Edge-Cloud AI for IoT Driven Secure Healthcare System
    Gupta, Lay
    2023 IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON, 2023,
  • [4] ACE: Toward Application-Centric, Edge-Cloud, Collaborative Intelligence
    Wang, Luhui
    Zhao, Cong
    Yang, Shusen
    Yang, Xinyu
    McCann, Julie
    COMMUNICATIONS OF THE ACM, 2023, 66 (01) : 62 - 73
  • [5] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Zheming YANG
    Wen JI
    Zhi WANG
    Science China(Information Sciences), 2024, 67 (04) : 335 - 336
  • [6] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Yang, Zheming
    Ji, Wen
    Wang, Zhi
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (04)
  • [7] Smart Collaborative Tracking for Ubiquitous Power IoT in Edge-Cloud Interplay Domain
    Song, Fei
    Zhu, Mingqiang
    Zhou, Yutong
    You, Ilsun
    Zhang, Hongke
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) : 6046 - 6055
  • [8] Collaborative Edge-Cloud and Edge-Edge Video Analytics
    Gazzaz, Samaa
    Nawab, Faisal
    PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, : 484 - 484
  • [9] Intelligent and Scalable IoT Edge-Cloud System
    Manihar, Shifa
    Patel, Ravindra
    Rehman, Tasneem Bano
    Agrawal, Sanjay
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (08) : 359 - 364
  • [10] Intelligent and scalable IoT edge-cloud system
    Manihar S.
    Patel R.
    Rehman T.B.
    Agrawal S.
    1600, Science and Information Organization (11): : 359 - 364