HIDL: High-Throughput Deep Learning Inference at the Hybrid Mobile Edge

被引:31
|
作者
Wu, Jing [1 ]
Wang, Lin [2 ,3 ]
Pei, Qiangyu [1 ]
Cui, Xingqi [1 ]
Liu, Fangming [1 ]
Yang, Tingting [4 ]
机构
[1] Huazhong Univ Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst, Serv Comp Technol & Syst Lab, Cluster & Grid Comp Lab,Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Vrije Univ Amsterdam, NL-1081 HV Amsterdam, Netherlands
[3] Tech Univ Darmstadt, D-64289 Darmstadt, Germany
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
关键词
Deep learning inference; edge computing; resource allocation; systems for machine learning;
D O I
10.1109/TPDS.2022.3195664
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep neural networks (DNNs) have become a critical component for inference in modem mobile applications, but the efficient provisioning of DNNs is non-trivial. Existing mobile- and server-based approaches compromise either the inference accuracy or latency. Instead, a hybrid approach can reap the benefits of the two by splitting the DNN at an appropriate layer and running the two parts separately on the mobile and the server respectively. Nevertheless, the DNN throughput in the hybrid approach has not been carefully examined, which is particularly important for edge servers where limited compute resources are shared among multiple DNNs. This article presents HiTDL, a runtime framework for managing multiple DNNs provisioned following the hybrid approach at the edge. HiTDL's mission is to improve edge resource efficiency by optimizing the combined throughput of all co-located DNNs, while still guaranteeing their SLAB. To this end, HiTDL first builds comprehensive performance models for DNN inference latency and throughout with respect to multiple factors including resource availability, DNN partition plan, and cross-DNN interference. HiTDL then uses these models to generate a set of candidate partition plans with SLA guarantees for each DNN. Finally, HiTDL makes global throughput-optimal resource allocation decisions by selecting partition plans from the candidate set for each DNN via solving a fairness-aware multiple-choice knapsack problem. Experimental results based on a prototype implementation show that HiTDL improves the overall throughput of the edge by 4.3x compared with the state-of-the-art.
引用
收藏
页码:4499 / 4514
页数:16
相关论文
共 50 条
  • [21] Deep learning accelerated high-throughput screening of organic solar cells
    Zhang, Wenlin
    Zou, Yurong
    Wang, Xin
    Chen, Junxian
    Xu, Dingguo
    JOURNAL OF MATERIALS CHEMISTRY C, 2025, 13 (10) : 5295 - 5306
  • [22] High-throughput ovarian follicle counting by an innovative deep learning approach
    Sonigo, Charlotte
    Jankowski, Stephane
    Yoo, Olivier
    Trassard, Olivier
    Bousquet, Nicolas
    Grynberg, Michael
    Beau, Isabelle
    Binart, Nadine
    SCIENTIFIC REPORTS, 2018, 8
  • [23] Deep learning approaches for detecting high-throughput screening false positives
    Matlock, Matthew
    Hughes, Tyler
    Swamidass, S. Joshua
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [24] High-throughput deep learning variant effect prediction with Sequence UNET
    Alistair S. Dunham
    Pedro Beltrao
    Mohammed AlQuraishi
    Genome Biology, 24
  • [25] A deep learning model for detection and tracking in high-throughput images of organoid
    Bian, Xuesheng
    Li, Gang
    Wang, Cheng
    Liu, Weiquan
    Lin, Xiuhong
    Chen, Zexin
    Cheung, Mancheung
    Luo, Xiongbiao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134
  • [26] Autodidactic Neurosurgeon: Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning
    Zhang, Letian
    Chen, Lixing
    Xu, Jie
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3111 - 3123
  • [27] Deep Reinforcement Learning Based Admission Control for Throughput Maximization in Mobile Edge Computing
    Zhou, Yitong
    Ye, Qiang
    Huang, Hui
    Du, Hongwei
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [28] High-Throughput Synchronous Deep RL
    Liu, Iou-Jen
    Yeh, Raymond A.
    Schwing, Alexander G.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [29] HiPR: High-throughput probabilistic RNA structure inference
    Kuksa, Pavel P.
    Li, Fan
    Kannan, Sampath
    Gregory, Brian D.
    Leung, Yuk Yee
    Wang, Li-San
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 1539 - 1547
  • [30] Mobile Deep Learning Processors on the Edge
    Yoo, Hoi-Jun
    2019 IEEE CUSTOM INTEGRATED CIRCUITS CONFERENCE (CICC), 2019,