Auto-Split: A General Framework of Collaborative Edge-Cloud AI

被引:47
|
作者
Banitalebi-Dehkordi, Amin [1 ]
Vedula, Naveen [1 ]
Pei, Jian [2 ]
Xia, Fei [3 ]
Wang, Lanjun [1 ]
Zhang, Yong [1 ]
机构
[1] Huawei Technol Canada Co Ltd, Vancouver, BC, Canada
[2] Simon Fraser Univ, Sch Comp Sci, Vancouver, BC, Canada
[3] Huawei Technol, Shenzhen, Peoples R China
关键词
Edge-Cloud Collaboration; Network Splitting; Neural Networks; Mixed Precision; Collaborative Intelligence; Distributed Inference;
D O I
10.1145/3447548.3467078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many industry scale applications, large and resource consuming machine learning models reside in powerful cloud servers. At the same time, large amounts of input data are collected at the edge of cloud. The inference results are also communicated to users or passed to downstream tasks at the edge. The edge often consists of a large number of low-power devices. It is a big challenge to design industry products to support sophisticated deep model deployment and conduct model inference in an efficient manner so that the model accuracy remains high and the end-to-end latency is kept low. This paper describes the techniques and engineering practice behind AUTO-SPLIT, an edge-cloud collaborative prototype of Huawei Cloud. This patented technology is already validated on selected applications, is on its way for broader systematic edge cloud application integration, and is being made available for public use as an automated pipeline service for end-to-end cloud-edge collaborative intelligence deployment. To the best of our knowledge, there is no existing industry product that provides the capability of Deep Neural Network (DNN) splitting.
引用
收藏
页码:2543 / 2553
页数:11
相关论文
共 50 条
  • [21] Split Edge-Cloud Neural Networks for Better Adversarial Robustness
    Douch, Salmane
    Abid, Mohamed Riduan
    Zine-Dine, Khalid
    Bouzidi, Driss
    Benhaddou, Driss
    IEEE ACCESS, 2024, 12 : 158854 - 158865
  • [22] Towards Edge-Cloud Collaborative Machine Learning: A Quality-aware Task Partition Framework
    Zheng, Zimu
    Li, Yunzhe
    Song, Han
    Wang, Lanjun
    Xia, Fei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3705 - 3714
  • [23] An Intelligent Edge-Cloud Collaborative Framework for Communication Security in Distributed Cyber-Physical Systems
    Chen, Cen
    Li, Yangfan
    Wang, Qinyu
    Yang, Xulei
    Wang, Xiaokang
    Yang, Laurence T.
    IEEE NETWORK, 2024, 38 (01): : 172 - 179
  • [24] Patra ModelCards: AI/ML Accountability in the Edge-Cloud Continuum
    Withana, Sachith
    Plale, Beth
    2024 IEEE 20TH INTERNATIONAL CONFERENCE ON E-SCIENCE, E-SCIENCE 2024, 2024,
  • [25] An Experimental Implementation of an Edge-based AI Engine with Edge-Cloud Coordination
    Yamakami, Toshihiko
    2018 18TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2018, : 442 - 446
  • [26] Collaborative Optimization of Edge-Cloud Computation Offloading in Internet of Vehicles
    Li, Yureng
    Xu, Shouzhi
    30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
  • [27] Using Collaborative Edge-Cloud Cache for Search in Internet of Things
    Tang, Jine
    Zhou, Zhangbing
    Xue, Xiao
    Wang, Gongwen
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (02) : 922 - 936
  • [28] Optimizing Face Recognition Inference with a Collaborative Edge-Cloud Network
    Oroceo, Paul P.
    Kim, Jeong-In
    Caliwag, Ej Miguel Francisco
    Kim, Sang-Ho
    Lim, Wansu
    SENSORS, 2022, 22 (21)
  • [29] MPCSM: Microservice Placement for Edge-Cloud Collaborative Smart Manufacturing
    Wang, Yimeng
    Zhao, Cong
    Yang, Shusen
    Ren, Xuebin
    Wang, Luhui
    Zhao, Peng
    Yang, Xinyu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) : 5898 - 5908
  • [30] Task Offloading and Resource Allocation for Edge-Cloud Collaborative Computing
    Wang, Yaxing
    Hao, Jia
    Xu, Gang
    Huang, Baoqi
    Zhang, Feng
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT V, 2024, 14491 : 361 - 372