Auto-Split: A General Framework of Collaborative Edge-Cloud AI

被引:47
|
作者
Banitalebi-Dehkordi, Amin [1 ]
Vedula, Naveen [1 ]
Pei, Jian [2 ]
Xia, Fei [3 ]
Wang, Lanjun [1 ]
Zhang, Yong [1 ]
机构
[1] Huawei Technol Canada Co Ltd, Vancouver, BC, Canada
[2] Simon Fraser Univ, Sch Comp Sci, Vancouver, BC, Canada
[3] Huawei Technol, Shenzhen, Peoples R China
关键词
Edge-Cloud Collaboration; Network Splitting; Neural Networks; Mixed Precision; Collaborative Intelligence; Distributed Inference;
D O I
10.1145/3447548.3467078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many industry scale applications, large and resource consuming machine learning models reside in powerful cloud servers. At the same time, large amounts of input data are collected at the edge of cloud. The inference results are also communicated to users or passed to downstream tasks at the edge. The edge often consists of a large number of low-power devices. It is a big challenge to design industry products to support sophisticated deep model deployment and conduct model inference in an efficient manner so that the model accuracy remains high and the end-to-end latency is kept low. This paper describes the techniques and engineering practice behind AUTO-SPLIT, an edge-cloud collaborative prototype of Huawei Cloud. This patented technology is already validated on selected applications, is on its way for broader systematic edge cloud application integration, and is being made available for public use as an automated pipeline service for end-to-end cloud-edge collaborative intelligence deployment. To the best of our knowledge, there is no existing industry product that provides the capability of Deep Neural Network (DNN) splitting.
引用
收藏
页码:2543 / 2553
页数:11
相关论文
共 50 条
  • [41] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Zheming YANG
    Wen JI
    Zhi WANG
    ScienceChina(InformationSciences), 2024, 67 (04) : 335 - 336
  • [42] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Yang, Zheming
    Ji, Wen
    Wang, Zhi
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (04)
  • [43] Edge-cloud collaborative transfer of process knowledge for digital manufacturing monitoring
    Cao X.
    Yao B.
    He W.
    Chen B.
    Qing T.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (06): : 152 - 163
  • [44] Attacking and Protecting Data Privacy in Edge-Cloud Collaborative Inference Systems
    He, Zecheng
    Zhang, Tianwei
    Lee, Ruby B.
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 9706 - 9716
  • [45] An Adaptive Neural Architecture Search Design for Collaborative Edge-Cloud Computing
    Lu, Haodong
    Du, Miao
    He, Xiaoming
    Qian, Kai
    Chen, Jianli
    Sun, Yanfei
    Wang, Kun
    IEEE NETWORK, 2021, 35 (05): : 83 - 89
  • [46] Edge-Cloud Collaborative Defense against Backdoor Attacks in Federated Learning
    Yang, Jie
    Zheng, Jun
    Wang, Haochen
    Li, Jiaxing
    Sun, Haipeng
    Han, Weifeng
    Jiang, Nan
    Tan, Yu-An
    SENSORS, 2023, 23 (03)
  • [47] Online data caching in edge-cloud collaborative system with the data center
    Xinxin Han
    Sijia Dai
    Guichen Gao
    Yang Wang
    Yong Zhang
    Journal of Combinatorial Optimization, 2022, 44 : 3351 - 3363
  • [48] An edge-cloud integrated framework for flexible and dynamic stream analytics
    Wang, Xin
    Khan, Azim
    Wang, Jianwu
    Gangopadhyay, Aryya
    Busart, Carl
    Freeman, Jade
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 137 : 323 - 335
  • [49] Preference aware participant selection strategy for edge-cloud collaborative crowdsensing
    Wang, Ruyan
    Liu, Jia
    He, Peng
    Cui, Yaping
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (01): : 142 - 151
  • [50] Efficient Resource Management and Expansion Scheme for Collaborative Edge-Cloud Computing
    Wang, Wei
    Zhang, Yongmin
    Huang, Rui
    Ren, Ju
    Lyu, Feng
    Zhang, Yaoxue
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (04) : 2731 - 2747