Q-learning for Waiting Time Control in CDN/V2V Live streaming

被引:0
|
作者
Ma, Zhejiayu [1 ]
Roubia, Soufiane [1 ]
Giroire, Frederic [2 ]
Urvoy-Keller, Guillaume [2 ]
机构
[1] EasyBroadcast, Nantes, France
[2] Univ Cote Azur, CNRS, Sophia Antipolis, France
关键词
hybrid P2P; live streaming; q-learning; machine learning;
D O I
10.23919/IFIPNetworking57963.2023.10186429
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
HTTP-based streaming has become the dominant technology for streaming due to the widespread adoption of the HTTP protocol. Many streaming providers use a combination of Content Delivery Network (CDN) and Viewer-to-Viewer (V2V) technology, known as Hybrid CDN/V2V live streaming, for both efficiency and cost-effectiveness. V2V technology allows for offloading streaming traffic from the CDN and reducing operational costs, and WebRTC technology facilitates direct V2V transfer, as it is natively supported by all browsers. In a WebRTC-based V2V network, some viewers cache the video chunks on their devices, while others wait and fetch chunks from their neighbors. A common strategy used to determine when a viewer should stop waiting for chunk delivery and revert to the CDN is called Random Waiting Time Control (RWC). However, due to the complex dynamics in the V2V system, RWC is far from optimal. In this work, we have formulated the Waiting Time Control determination problem as a reinforcement learning problem and proposed a Q-learning-based Waiting Time Control (QWC) solution. We conducted offline experiments in the Grid5000 [1] testbed and validated our results through a 14-day A/B testing in the wild. Our findings showed that QWC improves overall streaming Quality-of-Experience (QoE) in rebuffering (-29% fewer events), video quality (+17% higher), and buffer length (+5% longer), with a slightly improved V2V ratio (+5% more).
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Coexistence of decentralized congestion control algorithms for V2V communication
    Math, Chetan Belagal
    Li, Hong
    Abanto-Leon, Luis F.
    de Groot, Sonia Heemstra
    Niemegeers, Ignas
    2018 IEEE 87TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2018,
  • [22] Deep Reinforcement Learning Based Resource Allocation for V2V Communications
    Ye, Hao
    Li, Geoffrey Ye
    Juang, Biing-Hwang Fred
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (04) : 3163 - 3173
  • [23] V2V Routing in VANET Based on Fuzzy Logic and Reinforcement Learning
    Zhang, W. L.
    Yang, X. Y.
    Song, Q. X.
    Zhao, L.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2021, 16 (01) : 1 - 19
  • [24] A Real-Time Millimeter Wave V2V Channel Sounder
    Chopra, Aditya
    Thornburg, Andrew
    Kanhere, Ojas
    Ghassemzadeh, Saeed S.
    Majmundar, Milap
    Rappaport, Theodore S.
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2607 - 2612
  • [25] V2V Cooperative Sensing using Reinforcement Learning with Action Branching
    Abdel-Aziz, Mohamed K.
    Perfecto, Cristina
    Samarakoon, Sumudu
    Bennis, Mehdi
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [26] Acceleration control strategy for Battery Electric Vehicle based on Deep Reinforcement Learning in V2V driving
    Acquarone, Matteo
    Borneo, Angelo
    Misul, Daniela Anna
    2022 IEEE/AIAA TRANSPORTATION ELECTRIFICATION CONFERENCE AND ELECTRIC AIRCRAFT TECHNOLOGIES SYMPOSIUM (ITEC+EATS 2022), 2022, : 202 - 207
  • [27] Congestion Control in V2V Safety Communication: Problem, Analysis, Approaches
    Liu, Xiaofeng
    Jaekel, Arunita
    ELECTRONICS, 2019, 8 (05):
  • [28] Optimal Time Partitioning in V2V Integrated Sensing and Communication Systems
    Gaur, Abhilash
    Balakrishnan, Ashutosh
    Srirangarajan, Seshan
    De, Swades
    Tseng, Po-Hsuan
    Feng, Kai-Ten
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (12) : 3390 - 3394
  • [29] Cooperative Finite-time Control for autonomous vehicles platoons with nonuniform V2V communication delays
    Caiazzo, Bianca
    Lui, Dario Giuseppe
    Petrillo, Alberto
    Santini, Stefania
    IFAC PAPERSONLINE, 2022, 55 (36): : 145 - 150
  • [30] Decentralized Congestion Control Protocols for V2V Communications and Their Challenging Issues
    Song, Heecheol
    Lee, Hwang Soo
    2014 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2014, : 505 - 509