Q-learning for Waiting Time Control in CDN/V2V Live streaming

被引:0
|
作者
Ma, Zhejiayu [1 ]
Roubia, Soufiane [1 ]
Giroire, Frederic [2 ]
Urvoy-Keller, Guillaume [2 ]
机构
[1] EasyBroadcast, Nantes, France
[2] Univ Cote Azur, CNRS, Sophia Antipolis, France
关键词
hybrid P2P; live streaming; q-learning; machine learning;
D O I
10.23919/IFIPNetworking57963.2023.10186429
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
HTTP-based streaming has become the dominant technology for streaming due to the widespread adoption of the HTTP protocol. Many streaming providers use a combination of Content Delivery Network (CDN) and Viewer-to-Viewer (V2V) technology, known as Hybrid CDN/V2V live streaming, for both efficiency and cost-effectiveness. V2V technology allows for offloading streaming traffic from the CDN and reducing operational costs, and WebRTC technology facilitates direct V2V transfer, as it is natively supported by all browsers. In a WebRTC-based V2V network, some viewers cache the video chunks on their devices, while others wait and fetch chunks from their neighbors. A common strategy used to determine when a viewer should stop waiting for chunk delivery and revert to the CDN is called Random Waiting Time Control (RWC). However, due to the complex dynamics in the V2V system, RWC is far from optimal. In this work, we have formulated the Waiting Time Control determination problem as a reinforcement learning problem and proposed a Q-learning-based Waiting Time Control (QWC) solution. We conducted offline experiments in the Grid5000 [1] testbed and validated our results through a 14-day A/B testing in the wild. Our findings showed that QWC improves overall streaming Quality-of-Experience (QoE) in rebuffering (-29% fewer events), video quality (+17% higher), and buffer length (+5% longer), with a slightly improved V2V ratio (+5% more).
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A Q-learning based adaptive congestion control for V2V communication in VANET
    Liu, Xiaofeng
    Amour, Ben St.
    Jaekel, Arunita
    2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 847 - 852
  • [2] V2V Routing in VANET Based on Heuristic Q-Learning
    Yang, X. Y.
    Zhang, W. L.
    Lu, H. M.
    Zhao, L.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2020, 15 (05) : 1 - 17
  • [3] A Hybrid Routing Algorithm for V2V Communication in VANETs Based on Blocked Q-Learning
    Bi, Xiang
    Huang, Huang
    Zhang, Benhong
    Wei, Xing
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (01) : 1 - 17
  • [4] Neighbor Selection Strategies in the Wild for CDN/V2V WebRTC Live Streaming: Can we learn what a good neighbor is?
    Ma, Zhejiayu
    Rouibia, Soufiane
    Giroire, Frederic
    Urvoy-Keller, Guillaume
    PROCEEDINGS OF THE 2022 47TH IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2022), 2022, : 295 - 298
  • [5] A Data-Driven Analysis and Tuning of a Live Hybrid CDN/V2V Video Distribution System
    Sarkar, Ishani
    Roubia, Soufiane
    Lopez-Pacheco, Dino Martin
    Urvoy-Keller, Guillaume
    PASSIVE AND ACTIVE MEASUREMENT, PAM 2021, 2021, 12671 : 128 - 140
  • [6] Making Vehicles Transparent Through V2V Video Streaming
    Gomes, Pedro
    Olaverri-Monreal, Cristina
    Ferreira, Michel
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2012, 13 (02) : 930 - 938
  • [7] Q-LEARNING BASED CONTROL ALGORITHM FOR HTTP ADAPTIVE STREAMING
    Martin, Virginia
    Cabrera, Julian
    Garcia, Narciso
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [8] A Reinforcement Learning-Based Congestion Control Approach for V2V Communication in VANET
    Liu, Xiaofeng
    St Amour, Ben
    Jaekel, Arunita
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [9] Active Roll Preview Control with V2V Communication
    Seongjin Yim
    International Journal of Automotive Technology, 2019, 20 : 169 - 175
  • [10] Active Roll Preview Control with V2V Communication
    Yim, Seongjin
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2019, 20 (01) : 169 - 175