Auto-Split: A General Framework of Collaborative Edge-Cloud AI

被引:47
|
作者
Banitalebi-Dehkordi, Amin [1 ]
Vedula, Naveen [1 ]
Pei, Jian [2 ]
Xia, Fei [3 ]
Wang, Lanjun [1 ]
Zhang, Yong [1 ]
机构
[1] Huawei Technol Canada Co Ltd, Vancouver, BC, Canada
[2] Simon Fraser Univ, Sch Comp Sci, Vancouver, BC, Canada
[3] Huawei Technol, Shenzhen, Peoples R China
关键词
Edge-Cloud Collaboration; Network Splitting; Neural Networks; Mixed Precision; Collaborative Intelligence; Distributed Inference;
D O I
10.1145/3447548.3467078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many industry scale applications, large and resource consuming machine learning models reside in powerful cloud servers. At the same time, large amounts of input data are collected at the edge of cloud. The inference results are also communicated to users or passed to downstream tasks at the edge. The edge often consists of a large number of low-power devices. It is a big challenge to design industry products to support sophisticated deep model deployment and conduct model inference in an efficient manner so that the model accuracy remains high and the end-to-end latency is kept low. This paper describes the techniques and engineering practice behind AUTO-SPLIT, an edge-cloud collaborative prototype of Huawei Cloud. This patented technology is already validated on selected applications, is on its way for broader systematic edge cloud application integration, and is being made available for public use as an automated pipeline service for end-to-end cloud-edge collaborative intelligence deployment. To the best of our knowledge, there is no existing industry product that provides the capability of Deep Neural Network (DNN) splitting.
引用
收藏
页码:2543 / 2553
页数:11
相关论文
共 50 条
  • [31] ACE: Toward Application-Centric, Edge-Cloud, Collaborative Intelligence
    Wang, Luhui
    Zhao, Cong
    Yang, Shusen
    Yang, Xinyu
    McCann, Julie
    COMMUNICATIONS OF THE ACM, 2023, 66 (01) : 62 - 73
  • [32] Collaborative Edge-Cloud Data Transfer Optimization for Industrial Internet of Things
    Zhang, Xinchang
    Wang, Maoli
    Zhu, Xiaomin
    Yan, Zhiwei
    Geng, Guanggang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 36 (03) : 580 - 597
  • [33] An edge-cloud collaborative computing platform for building AIoT applications efficiently
    Guoping Rong
    Yangchen Xu
    Xinxin Tong
    Haojun Fan
    Journal of Cloud Computing, 10
  • [34] Reliable and Data-driven AI Applications in Edge-Cloud Environments
    Ko, In-Young
    Mrissa, Michael
    Srivastava, Abhishek
    FRONTIERS OF COMPUTER VISION, IW-FCV 2024, 2024, 2143 : 2 - 4
  • [35] An Edge-Cloud Collaboration Framework for Graph Processing in Smart Society
    Zhou, Jun
    Kondo, Masaaki
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2023, 11 (04) : 985 - 1001
  • [36] A framework for offloading and migration of serverless functions in the Edge-Cloud Continuum
    Russo, Gabriele Russo
    Cardellini, Valeria
    Lo Presti, Francesco
    PERVASIVE AND MOBILE COMPUTING, 2024, 100
  • [37] Edge-cloud collaborative intelligent production scheduling based on digital twin
    Yifan, Han
    Tao, Feng
    Xiaokai, Liü
    Fangmin, Xu
    Chenglin, Zhao
    Journal of China Universities of Posts and Telecommunications, 2022, 29 (02): : 108 - 120
  • [38] Context-Aware Edge-Cloud Collaborative Scene Text Recognition
    Zhang, Puning
    Liu, Changfeng
    Wang, Honggang
    Wu, Dapeng
    Wang, Ruyan
    Zou, Hong
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 611 - 617
  • [39] Distributed Photovoltaic Scenario Generation Based on Edge-Cloud Collaborative Architecture
    Huang, Jinju
    Mao, Zhihang
    Xie, Chenzheng
    Sun, Yingyun
    2022 IEEE/IAS INDUSTRIAL AND COMMERCIAL POWER SYSTEM ASIA (I&CPS ASIA 2022), 2022, : 1806 - 1810
  • [40] An edge-cloud collaborative computing platform for building AIoT applications efficiently
    Rong, Guoping
    Xu, Yangchen
    Tong, Xinxin
    Fan, Haojun
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2021, 10 (01):