Deep Reinforcement Learning with Importance Weighted A3C for QoE enhancement in Video Delivery Services

被引：3

作者：

Naresh, Mandan ^{[1
]}

Saxena, Paresh ^{[1
]}

Gupta, Manik ^{[1
]}

机构：

[1] Birla Inst Technol & Sci Pilani, Comp Sci & Informat Syst, Hyderabad, India

来源：

2023 IEEE 24TH INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS, WOWMOM | 2023年

关键词：

Deep Reinforcement Learning; Video Delivery; Quality of Experience (QoE); Adaptive Bit Rates (ABR); Actorcritic methods;

D O I：

10.1109/WoWMoM57956.2023.00024

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Adaptive bitrate (ABR) algorithms are used to adapt the video bitrate based on the network conditions to improve the overall video quality of experience (QoE). Recently, reinforcement learning (RL) and asynchronous advantage actor-critic (A3C) methods have been used to generate adaptive bit rate algorithms and they have been shown to improve the overall QoE as compared to fixed rule ABR algorithms. However, a common issue in the A3C methods is the lag between behaviour policy and target policy. As a result, the behaviour and the target policies are no longer synchronized which results in suboptimal updates. In this work, we present ALISA: An Actor-Learner Architecture with Importance Sampling for efficient learning in ABR algorithms. ALISA incorporates importance sampling weights to give more weightage to relevant experience to address the lag issues with the existing A3C methods. We present the design and implementation of ALISA, and compare its performance to state-of-the-art video rate adaptation algorithms including vanilla A3C implemented in the Pensieve framework and other fixed-rule schedulers like BB, BOLA, and RB. Our results show that ALISA improves average QoE by up to 25%-48% higher average QoE than Pensieve, and even more when compared to fixed-rule schedulers.

引用

页码：97 / 106

页数：10

共 29 条

[1] A3C Deep Reinforcement Learning Model Compression and Knowledge Extraction
Zhang J.
Wang Z.
Ren Y.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (06): : 1373 - 1384
[2] Air Combat Maneuver Decision Method Based on A3C Deep Reinforcement Learning
Fan, Zihao
Xu, Yang
Kang, Yuhang
Luo, Delin
MACHINES, 2022, 10 (11)
[3] Improved Video QoE in Wireless Networks using Deep Reinforcement Learning
Moura, Henrique D.
Oliveira, Junia Maisa
Soares, Daniel
Macedo, Daniel F.
Vieira, Marcos A. M.
2023 19TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM, 2023,
[4] Deeplive: QoE Optimization for Live Video Streaming through Deep Reinforcement Learning
Tian, Zhao
Zhao, Laiping
Nie, Lihai
Chen, Peiqi
Chen, Shuyu
2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 827 - 831
[5] Resource Pricing and Allocation in MEC Enabled Blockchain Systems: An A3C Deep Reinforcement Learning Approach
Du, Jianbo
Cheng, Wenjie
Lu, Guangyue
Cao, Haotong
Chu, Xiaoli
Zhang, Zhicai
Wang, Junxuan
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (01): : 33 - 44
[6] Mask-Attention A3C: Visual Explanation of Action-State Value in Deep Reinforcement Learning
Itaya, Hidenori
Hirakawa, Tsubasa
Yamashita, Takayoshi
Fujiyoshi, Hironobu
Sugiura, Komei
IEEE ACCESS, 2024, 12 : 86553 - 86571
[7] Improving Search Through A3C Reinforcement Learning Based Conversational Agent
Aggarwal, Milan
Arora, Aarushi
Sodhani, Shagun
Krishnamurthy, Balaji
COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 273 - 286
[8] QoE Estimation of DASH-Based Mobile Video Application Using Deep Reinforcement Learning
Hou, Biao
Zhang, Junxing
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 633 - 645
[9] Deep Reinforcement Learning-Based Resource Allocation for QoE Enhancement in Wireless VR Communications
Kougioumtzidis, Georgios
Poulkov, Vladimir K.
Lazaridis, Pavlos I.
Zaharis, Zaharias D.
IEEE ACCESS, 2025, 13 : 25045 - 25058
[10] DeePref: Deep Reinforcement Learning For Video Prefetching In Content Delivery Networks
Alkassab, Nawras
Huang, Chin-Tser
Botran, Tania Lorido
2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,

← 1 2 3 →