Improving Mobile Interactive Video QoE via Two-Level Online Cooperative Learning

被引:8
|
作者
Zhang, Huanhuan [1 ]
Zhou, Anfu [1 ]
Ma, Huadong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Intelligent Telecommun Software &, Beijing 100876, Peoples R China
基金
国家重点研发计划;
关键词
Interactive video; online reinforcement learning; cooperative learning; learning aggregation; EFFICIENCY;
D O I
10.1109/TMC.2022.3179782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning models, particularly reinforcement learning (RL), have demonstrated great potential in optimizing video streaming applications. However, the state-of-the-art solutions are limited to an "offline learning" paradigm, i.e., the RL models are trained in simulators and then are operated in real networks. As a result, they inevitably suffer from the simulation-to-reality gap, showing far less satisfactory performance under real conditions compared with simulated environment. In this article, we close the gap by proposing Legato, an online RL framework for real-time mobile interactive video systems. Legato puts many individual RL agents directly into the video system, which make video bitrate decisions in real-time and evolve their models over time. Legato then employs a two-level cooperative learning mechanism to enhance video QoE. First, Legato proposes a score-based robust learning algorithm to eliminate risks of quality degradation caused by the RL model's exploration attempts. Then, Legato adaptively aggregates agents following a network condition-aware manner to form its corresponding high-level RL model that can help each individual to react to unseen network conditions. We implement Legato on an interactive real-time video system. Based on the exhaustive evaluations, we find that Legato outperforms the state-of-the-art algorithms significantly across a wide range of QoE metrics.
引用
收藏
页码:5900 / 5917
页数:18
相关论文
共 50 条
  • [21] Interactive Two-Level WEBSOM for Organizational Exploration
    Honkela, Timo
    Knapek, Michael
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2013, 2013, 8131 : 579 - 585
  • [22] Ethernet-based interactive video-on-demand system with two-level gateway model
    Chang, SH
    Lee, MH
    Huang, YM
    Wu, JN
    Chang, BG
    MULTIMEDIA SYSTEMS AND APPLICATIONS-BOOK, 1999, 3528 : 181 - 190
  • [23] Improving GPU Performance via Large Warps and Two-Level Warp Scheduling
    Narasiman, Veynu
    Shebanow, Michael
    Lee, Chang Joo
    Miftakhutdinov, Rustam
    Mutlu, Onur
    Patt, Yale N.
    PROCEEDINGS OF THE 2011 44TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 44), 2011, : 308 - 317
  • [24] Two-Phase Rate Adaptation Strategy for Improving Real-Time Video QoE in Mobile Networks
    Xiao, Ailing
    Liu, Jie
    Li, Yizhe
    Song, Qiwei
    Ge, Ning
    CHINA COMMUNICATIONS, 2018, 15 (10) : 12 - 24
  • [25] Two-Phase Rate Adaptation Strategy for Improving Real-Time Video QoE in Mobile Networks
    Ailing Xiao
    Jie Liu
    Yizhe Li
    Qiwei Song
    Ning Ge
    中国通信, 2018, 15 (10) : 12 - 24
  • [26] An approach to generating two-level video abstraction
    Cheng, WG
    Xu, D
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2896 - 2900
  • [27] Characterizing, Modeling, and Improving the QoE of Mobile Devices with Low Battery Level
    Yan, Kaige
    Zhang, Xingyao
    Fu, Xin
    PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48), 2015, : 713 - 724
  • [28] A two-level patching scheme for video multicast
    Guan, DL
    Yu, SY
    ICON 2003: 11TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, 2003, : 289 - 293
  • [29] Two-level workload characterization of online auctions
    Akula, Vasudeva
    Menasce, Daniel A.
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2007, 6 (02) : 192 - 208
  • [30] A Two-Level Online Parameter Identification Approach
    Chen, Runze
    Wu, Wenchuan
    Sun, Hongbin
    Zhang, Boming
    2013 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PES), 2013,