Auto-Split: A General Framework of Collaborative Edge-Cloud AI

被引:47
|
作者
Banitalebi-Dehkordi, Amin [1 ]
Vedula, Naveen [1 ]
Pei, Jian [2 ]
Xia, Fei [3 ]
Wang, Lanjun [1 ]
Zhang, Yong [1 ]
机构
[1] Huawei Technol Canada Co Ltd, Vancouver, BC, Canada
[2] Simon Fraser Univ, Sch Comp Sci, Vancouver, BC, Canada
[3] Huawei Technol, Shenzhen, Peoples R China
关键词
Edge-Cloud Collaboration; Network Splitting; Neural Networks; Mixed Precision; Collaborative Intelligence; Distributed Inference;
D O I
10.1145/3447548.3467078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many industry scale applications, large and resource consuming machine learning models reside in powerful cloud servers. At the same time, large amounts of input data are collected at the edge of cloud. The inference results are also communicated to users or passed to downstream tasks at the edge. The edge often consists of a large number of low-power devices. It is a big challenge to design industry products to support sophisticated deep model deployment and conduct model inference in an efficient manner so that the model accuracy remains high and the end-to-end latency is kept low. This paper describes the techniques and engineering practice behind AUTO-SPLIT, an edge-cloud collaborative prototype of Huawei Cloud. This patented technology is already validated on selected applications, is on its way for broader systematic edge cloud application integration, and is being made available for public use as an automated pipeline service for end-to-end cloud-edge collaborative intelligence deployment. To the best of our knowledge, there is no existing industry product that provides the capability of Deep Neural Network (DNN) splitting.
引用
收藏
页码:2543 / 2553
页数:11
相关论文
共 50 条
  • [1] Nebula: An Edge-Cloud Collaborative Learning Framework for Dynamic Edge Environments
    Zhuang, Yan
    Zheng, Zhenzhe
    Shao, Yunfeng
    Li, Bingshuai
    Wu, Fan
    Chen, Guihai
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 782 - 791
  • [2] Collaborative Edge-Cloud AI for IoT Driven Secure Healthcare System
    Gupta, Lay
    2023 IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON, 2023,
  • [3] Two-Phase Split Computing Framework in Edge-Cloud Continuum
    Ko, Haneul
    Kim, Bokyeong
    Kim, Yumi
    Pack, Sangheon
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (12): : 21741 - 21749
  • [4] ECOMA: Edge-Cloud Collaborative Framework for Multi-Task Applications
    Zhang, Zhipeng
    Ma, Wenting
    Xu, Qinqing
    Tang, Renjie
    Wang, Jinlang
    Chen, Wai
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 992 - 997
  • [5] Collaborative Edge-Cloud and Edge-Edge Video Analytics
    Gazzaz, Samaa
    Nawab, Faisal
    PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, : 484 - 484
  • [6] Efficient AI Applications in Edge-Cloud Environments
    Ko, In-Young
    Mrissa, Michael
    Murillo, Juan Manuel
    Srivastava, Abhishek
    JOURNAL OF WEB ENGINEERING, 2023, 22 (06): : V - VII
  • [7] Adaptive Edge-Cloud Environments for Rural AI
    Almurshed, Osama
    Patros, Panos
    Huang, Victoria
    Mayo, Michael
    Ooi, Melanie
    Chard, Ryan
    Chard, Kyle
    Rana, Omer
    Nagra, Harshaan
    Baughman, Matt
    Foster, Ian
    2022 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (IEEE SCC 2022), 2022, : 74 - 83
  • [8] EC5: Edge-cloud collaborative computing framework with compressive communication
    Tan, Jingwei
    Liu, Fagui
    Wang, Bin
    Wu, Qingbo
    Chen, C. L. Philip
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
  • [9] A DNN partitioning framework with controlled lossy mechanisms for edge-cloud collaborative intelligence
    Kim, Hyochan
    Choi, Ji Sub
    Kim, Jungrae
    Ko, Jong Hwan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 426 - 439
  • [10] An Edge-Cloud Collaborative Object Detection System
    Xu, Lei
    Yang, Dingkun
    UBIQUITOUS SECURITY, 2022, 1557 : 371 - 378