Multi-Stream Scheduling of Inference Pipelines on Edge Devices - a DRL Approach

被引:0
|
作者
Pereira, Danny [1 ]
Ghosh, Sumana [2 ]
Dey, Soumyajit [1 ]
机构
[1] Indian Inst Technol Kharagpur, Comp Sci & Engn, Kharagpur, West Bengal, India
[2] Indian Stat Inst, Comp & Commun Sci Div, Kolkata, West Bengal, India
关键词
Convolutional neural network; edge device; GPU; deep reinforcement learning; real-time scheduling;
D O I
10.1145/3677378
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Low-power edge devices equipped with Graphics Processing Units (GPUs) are a popular target platform for real-time scheduling of inference pipelines. Such application-architecture combinations are popular in Advanced Driver-assistance Systems for aiding in the real-time decision-making of automotive controllers. However, the real-time throughput sustainable by such inference pipelines is limited by resource constraints of the target edge devices. Modern GPUs, both in edge devices and workstation variants, support the facility of concurrent execution of computation kernels and data transfers using the primitive of streams, also allowing for the assignment of priority to these streams. This opens up the possibility of executing computation layers of inference pipelines within a multi-priority, multi-stream environment on the GPU. However, manually co-scheduling such applications while satisfying their throughput requirement and platform memory budget may require an unmanageable number of profiling runs. In this work, we propose a Deep Reinforcement Learning (DRL)-based method for deciding the start time of various operations in each pipeline layer while optimizing the latency of execution of inference pipelines as well as memory consumption. Experimental results demonstrate the promising efficacy of the proposed DRL approach in comparison with the baseline methods, particularly in terms of real-time performance enhancements, schedulability ratio, and memory savings. We have additionally assessed the effectiveness of the proposed DRL approach using a real-time traffic simulation tool IPG CarMaker.
引用
收藏
页数:36
相关论文
共 50 条
  • [1] DRL-based Multi-Stream Scheduling of Inference Pipelines on Edge Devices
    Pereria, Danny
    Ghosh, Sumana
    Dey, Soumyajit
    PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, VLSID 2024 AND 23RD INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS, ES 2024, 2024, : 324 - 329
  • [2] DEVICES FOR PAINTING BY MULTI-STREAM SPRINKLING
    BOKSZCZA.W
    MECHANIK MIESIECZNIK NAUKOWO-TECHNICZNY, 1971, 44 (10): : 568 - &
  • [3] A Multi-Stream Approach for Video Understanding
    Kunam, Lutharsanen
    Rossetto, Luca
    Bernstein, Abraham
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7003 - 7007
  • [4] Multi-Stream LDPC Decoder on GPU of Mobile Devices
    Amiri, Roohollah
    Mehrpouyan, Hani
    2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 1004 - 1009
  • [5] BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge
    Su, Lin
    Wang, Weijun
    Yuan, Tingting
    Li, Liang
    Dai, Aipeng
    Liu, Yunxin
    Fu, Xiaoming
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 1181 - 1190
  • [6] PACKETGAME: Multi-Stream Packet Gating for Concurrent Video Inference at Scale
    Yuan, Mu
    Zhang, Lan
    You, Xuanke
    Li, Xiang-Yang
    PROCEEDINGS OF THE 2023 ACM SIGCOMM 2023 CONFERENCE, SIGCOMM 2023, 2023, : 724 - 737
  • [7] A Multi-Stream Approach for Seizure Classification with Knowledge Distillation
    Hou, Jen-Cheng
    McGonigal, Aileen
    Bartolomei, Fabrice
    Thonnat, Monique
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
  • [8] A GAME THEORETIC APPROACH TO MULTI-STREAM QOS ROUTING
    Man, Hong
    Li, Yang
    GLOBECOM 2006 - 2006 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2006,
  • [9] A multi-stream approach to audiovisual automatic speech recognition
    Hasegawa-Johnson, Mark
    2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 328 - 331
  • [10] A Multi-Stream Feature Fusion Approach for Traffic Prediction
    Li, Zhishuai
    Xiong, Gang
    Tian, Yonglin
    Lv, Yisheng
    Chen, Yuanyuan
    Hui, Pan
    Su, Xiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 1456 - 1466