Q-learning for Waiting Time Control in CDN/V2V Live streaming

被引：0

作者：

Ma, Zhejiayu ^{[1
]}

Roubia, Soufiane ^{[1
]}

Giroire, Frederic ^{[2
]}

Urvoy-Keller, Guillaume ^{[2
]}

机构：

[1] EasyBroadcast, Nantes, France

[2] Univ Cote Azur, CNRS, Sophia Antipolis, France

来源：

2023 IFIP NETWORKING CONFERENCE, IFIP NETWORKING | 2023年

关键词：

hybrid P2P; live streaming; q-learning; machine learning;

D O I：

10.23919/IFIPNetworking57963.2023.10186429

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

HTTP-based streaming has become the dominant technology for streaming due to the widespread adoption of the HTTP protocol. Many streaming providers use a combination of Content Delivery Network (CDN) and Viewer-to-Viewer (V2V) technology, known as Hybrid CDN/V2V live streaming, for both efficiency and cost-effectiveness. V2V technology allows for offloading streaming traffic from the CDN and reducing operational costs, and WebRTC technology facilitates direct V2V transfer, as it is natively supported by all browsers. In a WebRTC-based V2V network, some viewers cache the video chunks on their devices, while others wait and fetch chunks from their neighbors. A common strategy used to determine when a viewer should stop waiting for chunk delivery and revert to the CDN is called Random Waiting Time Control (RWC). However, due to the complex dynamics in the V2V system, RWC is far from optimal. In this work, we have formulated the Waiting Time Control determination problem as a reinforcement learning problem and proposed a Q-learning-based Waiting Time Control (QWC) solution. We conducted offline experiments in the Grid5000 [1] testbed and validated our results through a 14-day A/B testing in the wild. Our findings showed that QWC improves overall streaming Quality-of-Experience (QoE) in rebuffering (-29% fewer events), video quality (+17% higher), and buffer length (+5% longer), with a slightly improved V2V ratio (+5% more).

引用

页数：9

共 50 条

[1] A Q-learning based adaptive congestion control for V2V communication in VANET
Liu, Xiaofeng
Amour, Ben St.
Jaekel, Arunita
2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 847 - 852
[2] V2V Routing in VANET Based on Heuristic Q-Learning
Yang, X. Y.
Zhang, W. L.
Lu, H. M.
Zhao, L.
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2020, 15 (05) : 1 - 17
[3] A Hybrid Routing Algorithm for V2V Communication in VANETs Based on Blocked Q-Learning
Bi, Xiang
Huang, Huang
Zhang, Benhong
Wei, Xing
IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (01) : 1 - 17
[4] Neighbor Selection Strategies in the Wild for CDN/V2V WebRTC Live Streaming: Can we learn what a good neighbor is?
Ma, Zhejiayu
Rouibia, Soufiane
Giroire, Frederic
Urvoy-Keller, Guillaume
PROCEEDINGS OF THE 2022 47TH IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2022), 2022, : 295 - 298
[5] A Data-Driven Analysis and Tuning of a Live Hybrid CDN/V2V Video Distribution System
Sarkar, Ishani
Roubia, Soufiane
Lopez-Pacheco, Dino Martin
Urvoy-Keller, Guillaume
PASSIVE AND ACTIVE MEASUREMENT, PAM 2021, 2021, 12671 : 128 - 140
[6] Making Vehicles Transparent Through V2V Video Streaming
Gomes, Pedro
Olaverri-Monreal, Cristina
Ferreira, Michel
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2012, 13 (02) : 930 - 938
[7] Q-LEARNING BASED CONTROL ALGORITHM FOR HTTP ADAPTIVE STREAMING
Martin, Virginia
Cabrera, Julian
Garcia, Narciso
2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
[8] A Reinforcement Learning-Based Congestion Control Approach for V2V Communication in VANET
Liu, Xiaofeng
St Amour, Ben
Jaekel, Arunita
APPLIED SCIENCES-BASEL, 2023, 13 (06):
[9] Active Roll Preview Control with V2V Communication
Seongjin Yim
International Journal of Automotive Technology, 2019, 20 : 169 - 175
[10] Active Roll Preview Control with V2V Communication
Yim, Seongjin
INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2019, 20 (01) : 169 - 175

← 1 2 3 4 5 →