Multi-Stream Scheduling of Inference Pipelines on Edge Devices - a DRL Approach

被引:0
|
作者
Pereira, Danny [1 ]
Ghosh, Sumana [2 ]
Dey, Soumyajit [1 ]
机构
[1] Indian Inst Technol Kharagpur, Comp Sci & Engn, Kharagpur, West Bengal, India
[2] Indian Stat Inst, Comp & Commun Sci Div, Kolkata, West Bengal, India
关键词
Convolutional neural network; edge device; GPU; deep reinforcement learning; real-time scheduling;
D O I
10.1145/3677378
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Low-power edge devices equipped with Graphics Processing Units (GPUs) are a popular target platform for real-time scheduling of inference pipelines. Such application-architecture combinations are popular in Advanced Driver-assistance Systems for aiding in the real-time decision-making of automotive controllers. However, the real-time throughput sustainable by such inference pipelines is limited by resource constraints of the target edge devices. Modern GPUs, both in edge devices and workstation variants, support the facility of concurrent execution of computation kernels and data transfers using the primitive of streams, also allowing for the assignment of priority to these streams. This opens up the possibility of executing computation layers of inference pipelines within a multi-priority, multi-stream environment on the GPU. However, manually co-scheduling such applications while satisfying their throughput requirement and platform memory budget may require an unmanageable number of profiling runs. In this work, we propose a Deep Reinforcement Learning (DRL)-based method for deciding the start time of various operations in each pipeline layer while optimizing the latency of execution of inference pipelines as well as memory consumption. Experimental results demonstrate the promising efficacy of the proposed DRL approach in comparison with the baseline methods, particularly in terms of real-time performance enhancements, schedulability ratio, and memory savings. We have additionally assessed the effectiveness of the proposed DRL approach using a real-time traffic simulation tool IPG CarMaker.
引用
收藏
页数:36
相关论文
共 50 条
  • [31] Alternative design approach for multipass and multi-stream plate heat exchangers for use in heat recovery systems
    Picon-Nunez, M.
    Martinez-Rodriguez, G.
    Lopez-Robles, J. L.
    HEAT TRANSFER ENGINEERING, 2006, 27 (06) : 12 - 21
  • [32] Real-Time Sound Source Localization for Low-Power IoT Devices Based on Multi-Stream CNN
    Ko, Jungbeom
    Kim, Hyunchul
    Kim, Jungsuk
    SENSORS, 2022, 22 (12)
  • [33] Hand gesture recognition using sEMG signals with a multi-stream time-varying feature enhancement approach
    Shin, Jungpil
    Miah, Abu Saleh Musa
    Konnai, Sota
    Takahashi, Itsuki
    Hirooka, Koki
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [34] BCEdge: SLO-Aware DNN Inference Services With Adaptive Batch-Concurrent Scheduling on Edge Devices
    Zhang, Ziyang
    Zhao, Yang
    Li, Huan
    Liu, Jie
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (04): : 4131 - 4145
  • [35] MOBILE EDGE COMPUTING ORIENTED MULTI-AGENT COOPERATIVE ROUTING ALGORITHM: A DRL-BASED APPROACH
    Lv, Jianhui
    Zhao, Shen
    Yi, Bo
    Li, Qing
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2023, 31 (06)
  • [36] PD-Net: Multi-Stream Hybrid Healthcare System for Parkinson's Disease Detection using Multi Learning Trick Approach
    Khan, Mustaqeem
    Khan, Ufaq
    Othmani, Alice
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 382 - 385
  • [37] Leveraging Multi-Stream Information Fusion for Trajectory Prediction in Low-Illumination Scenarios: A Multi-Channel Graph Convolutional Approach
    Gong, Hailong
    Li, Zirui
    Lu, Chao
    Du, Guodong
    Gong, Jianwei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 3854 - 3869
  • [38] Partitioning DNNs for Optimizing Distributed Inference Performance on Cooperative Edge Devices: A Genetic Algorithm Approach
    Na, Jun
    Zhang, Handuo
    Lian, Jiaxin
    Zhang, Bin
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [39] Taking a wider view A formative multi-stream social marketing approach to understanding human trafficking as a social issue in Nigeria
    Badejo, Foluke Abigail
    Rundle-Thiele, Sharyn
    Kubacki, Krzysztof
    JOURNAL OF SOCIAL MARKETING, 2019, 9 (04) : 467 - 484
  • [40] Optimal resource scheduling of multi-functional edge computing devices in digital distribution networks
    Yu, Hao
    Huang, Chaoming
    Song, Guanyu
    Ji, Haoran
    Zheng, Zhe
    Cui, Wenpeng
    AIN SHAMS ENGINEERING JOURNAL, 2024, 15 (09)