HIDL: High-Throughput Deep Learning Inference at the Hybrid Mobile Edge

被引:31
|
作者
Wu, Jing [1 ]
Wang, Lin [2 ,3 ]
Pei, Qiangyu [1 ]
Cui, Xingqi [1 ]
Liu, Fangming [1 ]
Yang, Tingting [4 ]
机构
[1] Huazhong Univ Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst, Serv Comp Technol & Syst Lab, Cluster & Grid Comp Lab,Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Vrije Univ Amsterdam, NL-1081 HV Amsterdam, Netherlands
[3] Tech Univ Darmstadt, D-64289 Darmstadt, Germany
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
关键词
Deep learning inference; edge computing; resource allocation; systems for machine learning;
D O I
10.1109/TPDS.2022.3195664
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep neural networks (DNNs) have become a critical component for inference in modem mobile applications, but the efficient provisioning of DNNs is non-trivial. Existing mobile- and server-based approaches compromise either the inference accuracy or latency. Instead, a hybrid approach can reap the benefits of the two by splitting the DNN at an appropriate layer and running the two parts separately on the mobile and the server respectively. Nevertheless, the DNN throughput in the hybrid approach has not been carefully examined, which is particularly important for edge servers where limited compute resources are shared among multiple DNNs. This article presents HiTDL, a runtime framework for managing multiple DNNs provisioned following the hybrid approach at the edge. HiTDL's mission is to improve edge resource efficiency by optimizing the combined throughput of all co-located DNNs, while still guaranteeing their SLAB. To this end, HiTDL first builds comprehensive performance models for DNN inference latency and throughout with respect to multiple factors including resource availability, DNN partition plan, and cross-DNN interference. HiTDL then uses these models to generate a set of candidate partition plans with SLA guarantees for each DNN. Finally, HiTDL makes global throughput-optimal resource allocation decisions by selecting partition plans from the candidate set for each DNN via solving a fairness-aware multiple-choice knapsack problem. Experimental results based on a prototype implementation show that HiTDL improves the overall throughput of the edge by 4.3x compared with the state-of-the-art.
引用
收藏
页码:4499 / 4514
页数:16
相关论文
共 50 条
  • [41] High-Throughput Precision Phenotyping of Left Ventricular Hypertrophy With Cardiovascular Deep Learning
    Duffy, Grant
    Cheng, Paul P.
    Yuan, Neal
    He, Bryan
    Kwan, Alan C.
    Shun-Shin, Matthew J.
    Alexander, Kevin M.
    Ebinger, Joseph
    Lungren, Matthew P.
    Rader, Florian
    Liang, David H.
    Schnittger, Ingela
    Ashley, Euan A.
    Zou, James Y.
    Patel, Jignesh
    Witteles, Ronald
    Cheng, Susan
    Ouyang, David
    JAMA CARDIOLOGY, 2022, 7 (04) : 386 - 395
  • [42] Deep learning for cell-specific high-throughput quantification of oligodendrocyte ensheathment
    Xu, Y. K.
    Chitsaz, D.
    Cui, Q. L.
    Brown, R. A.
    Dabarno, M. A.
    Antel, J. P.
    Kennedy, T. E.
    MULTIPLE SCLEROSIS JOURNAL, 2018, 24 : 946 - 946
  • [43] PathFlowAI: A High-Throughput Workflow for Preprocessing, Deep Learning and Interpretation in Digital Pathology
    Levy, Joshua J.
    Salas, Lucas A.
    Christensen, Brock C.
    Sriharan, Aravindhan
    Vaickus, Louis J.
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020, 2020, : 403 - 414
  • [44] High-throughput classification of S. cerevisiae tetrads using deep learning
    Szucs, Balint
    Selvan, Raghavendra
    Lisby, Michael
    YEAST, 2024, 41 (07) : 423 - 436
  • [45] Reaching for the Sky: Maximizing Deep Learning Inference Throughput on Edge Devices with AI Multi-Tenancy
    Hao, Jianwei
    Subedi, Piyush
    Ramaswamy, Lakshmish
    Kim, In Kee
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2023, 23 (01)
  • [46] Accelerating Deep Learning Inference on Mobile Systems
    Frajberg, Darian
    Bernaschina, Carlo
    Marone, Christian
    Fraternali, Piero
    ARTIFICIAL INTELLIGENCE AND MOBILE SERVICES - AIMS 2019, 2019, 11516 : 118 - 134
  • [47] Collaborative Inference for Mobile Deep Learning Applications
    Yang, Qinglin
    Luo, Xiaofei
    Li, Peng
    Miyazaki, Toshiaki
    2ND INTERNATIONAL CONFERENCE ON 5G FOR UBIQUITOUS CONNECTIVITY, 5GU 2018, 2020, : 1 - 12
  • [48] Mobile Microscopy and Machine Learning Provide Accurate and High-throughput Monitoring of Air Quality
    Wu, Yichen
    Shiledar, Ashutosh
    Li, Yicheng
    Wong, Jeffrey
    Feng, Steve
    Chen, Xuan
    Chen, Christine
    Jin, Kevin
    Janamian, Saba
    Yang, Zhe
    Ballard, Zach
    Gorocs, Zoltan
    Feizi, Alborz
    Ozcan, Aydogan
    2017 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2017,
  • [49] High-throughput two-hybrid analysis
    Fields, S
    FEBS JOURNAL, 2005, 272 (21) : 5391 - 5399
  • [50] Deep learning based high-throughput phenotyping of chalkiness in rice exposed to high night temperature
    Chaoxin Wang
    Doina Caragea
    Nisarga Kodadinne Narayana
    Nathan T. Hein
    Raju Bheemanahalli
    Impa M. Somayanda
    S. V. Krishna Jagadish
    Plant Methods, 18