Deep Reinforcement Learning with Importance Weighted A3C for QoE enhancement in Video Delivery Services

被引:3
|
作者
Naresh, Mandan [1 ]
Saxena, Paresh [1 ]
Gupta, Manik [1 ]
机构
[1] Birla Inst Technol & Sci Pilani, Comp Sci & Informat Syst, Hyderabad, India
关键词
Deep Reinforcement Learning; Video Delivery; Quality of Experience (QoE); Adaptive Bit Rates (ABR); Actorcritic methods;
D O I
10.1109/WoWMoM57956.2023.00024
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Adaptive bitrate (ABR) algorithms are used to adapt the video bitrate based on the network conditions to improve the overall video quality of experience (QoE). Recently, reinforcement learning (RL) and asynchronous advantage actor-critic (A3C) methods have been used to generate adaptive bit rate algorithms and they have been shown to improve the overall QoE as compared to fixed rule ABR algorithms. However, a common issue in the A3C methods is the lag between behaviour policy and target policy. As a result, the behaviour and the target policies are no longer synchronized which results in suboptimal updates. In this work, we present ALISA: An Actor-Learner Architecture with Importance Sampling for efficient learning in ABR algorithms. ALISA incorporates importance sampling weights to give more weightage to relevant experience to address the lag issues with the existing A3C methods. We present the design and implementation of ALISA, and compare its performance to state-of-the-art video rate adaptation algorithms including vanilla A3C implemented in the Pensieve framework and other fixed-rule schedulers like BB, BOLA, and RB. Our results show that ALISA improves average QoE by up to 25%-48% higher average QoE than Pensieve, and even more when compared to fixed-rule schedulers.
引用
收藏
页码:97 / 106
页数:10
相关论文
共 29 条
  • [1] A3C Deep Reinforcement Learning Model Compression and Knowledge Extraction
    Zhang J.
    Wang Z.
    Ren Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (06): : 1373 - 1384
  • [2] Air Combat Maneuver Decision Method Based on A3C Deep Reinforcement Learning
    Fan, Zihao
    Xu, Yang
    Kang, Yuhang
    Luo, Delin
    MACHINES, 2022, 10 (11)
  • [3] Improved Video QoE in Wireless Networks using Deep Reinforcement Learning
    Moura, Henrique D.
    Oliveira, Junia Maisa
    Soares, Daniel
    Macedo, Daniel F.
    Vieira, Marcos A. M.
    2023 19TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM, 2023,
  • [4] Deeplive: QoE Optimization for Live Video Streaming through Deep Reinforcement Learning
    Tian, Zhao
    Zhao, Laiping
    Nie, Lihai
    Chen, Peiqi
    Chen, Shuyu
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 827 - 831
  • [5] Resource Pricing and Allocation in MEC Enabled Blockchain Systems: An A3C Deep Reinforcement Learning Approach
    Du, Jianbo
    Cheng, Wenjie
    Lu, Guangyue
    Cao, Haotong
    Chu, Xiaoli
    Zhang, Zhicai
    Wang, Junxuan
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (01): : 33 - 44
  • [6] Mask-Attention A3C: Visual Explanation of Action-State Value in Deep Reinforcement Learning
    Itaya, Hidenori
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    Sugiura, Komei
    IEEE ACCESS, 2024, 12 : 86553 - 86571
  • [7] Improving Search Through A3C Reinforcement Learning Based Conversational Agent
    Aggarwal, Milan
    Arora, Aarushi
    Sodhani, Shagun
    Krishnamurthy, Balaji
    COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 273 - 286
  • [8] QoE Estimation of DASH-Based Mobile Video Application Using Deep Reinforcement Learning
    Hou, Biao
    Zhang, Junxing
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 633 - 645
  • [9] Deep Reinforcement Learning-Based Resource Allocation for QoE Enhancement in Wireless VR Communications
    Kougioumtzidis, Georgios
    Poulkov, Vladimir K.
    Lazaridis, Pavlos I.
    Zaharis, Zaharias D.
    IEEE ACCESS, 2025, 13 : 25045 - 25058
  • [10] DeePref: Deep Reinforcement Learning For Video Prefetching In Content Delivery Networks
    Alkassab, Nawras
    Huang, Chin-Tser
    Botran, Tania Lorido
    2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,