User Navigation Modeling, Rate-Distortion Analysis, and End-to-End Optimization for Viewport-Driven 360° Video Streaming

被引:4
|
作者
Chakareski, Jacob [1 ]
Corbillon, Xavier [2 ]
Simon, Gwendal [3 ]
Swaminathan, Viswanathan [4 ]
机构
[1] New Jersey Inst Technol, Coll Comp, Newark, NJ 07103 USA
[2] Tiledmedia, Rotterdam, Zuid Holland, Netherlands
[3] Synmedia, Networking, Rennes, France
[4] Adobe, Adobe Res, San Jose, CA USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Omnidirectional video; quality of experience; viewport-adaptive 360 degrees video streaming; rate-distortion analysis and optimization; user navigation modeling;
D O I
10.1109/TMM.2022.3201397
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emerging technologies of Virtual Reality (VR) and 360 degrees video introduce new challenges for state-of-the-art video communication systems. Enormous data volume and spatial user navigation are unique characteristics of 360 degrees videos that necessitate a space-time effective allocation of the available network streaming bandwidth over the 360 degrees video content to maximize the Quality of Experience (QoE) delivered to the user. Towards this objective, we investigate a framework for viewport-driven rate-distortion optimized 360 degrees video streaming that integrates the user view navigation patterns and the spatiotemporal ratedistortion characteristics of the 360 degrees video content to maximize the delivered user viewport video quality, for the given network/system resources. The framework comprises a methodology for assigning dynamic navigation likelihoods over the 360 degrees video spatiotemporal panorama, induced by the user navigation patterns, an analysis and characterization of the 360 degrees video panorama's spatiotemporal rate-distortion characteristics that leverage preprocessed spatial tilling of the content, and an optimization problem formulation and solution that capture and aim to maximize the delivered expected viewport video quality, given a user's navigation patterns, the 360 degrees video encoding/streaming decisions, and the available system/network resources. We formulate a Markov model to capture the navigation patterns of a user over the 360 degrees video panorama and simultaneously extend our actual navigation datasets by synthesizing additional realistic navigation data. Moreover, we investigate the impact of using two different tile sizes for equirectangular tiling of the 360 degrees video panorama. Our experimental results demonstrate the advantages of our framework over the conventional approach of streaming a monolithic uniformly-encoded 360 degrees video and a state-of-the-art navigation-speed based reference method. Considerable average and instantaneous viewport video quality gains of up to 5 dB are demonstrated in the case of five popular 4 K 360 degrees videos. In addition, we explore the impact of two different popular 360 degrees video quality metrics applied to evaluate the streaming performance of our system framework and the two reference methods. Finally, we demonstrate that by exploiting the unequal rate-distortion characteristics of the different spatial sectors of the 360 degrees video panorama, we can enable spatially more uniform and temporally higher 360 degrees video viewport quality delivered to the user, relative to monolithic streaming.
引用
收藏
页码:5941 / 5956
页数:16
相关论文
共 50 条
  • [11] Error resilience video coding parameters and mechanisms selection with End-to-End rate-distortion analysis at frame level
    Xu, Weiwei
    Chen, Yaowu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (04) : 2347 - 2366
  • [12] Error resilience video coding parameters and mechanisms selection with End-to-End rate-distortion analysis at frame level
    Weiwei Xu
    Yaowu Chen
    Multimedia Tools and Applications, 2016, 75 : 2347 - 2366
  • [13] End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression
    Yilmaz, M. Akin
    Tekalp, A. Murat
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 974 - 983
  • [14] Error-resilient video coding with end-to-end rate-distortion optimized at macroblock level
    Xiao, Jimin
    Tillo, Tammam
    Lin, Chunyu
    Zhao, Yao
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
  • [15] End-to-end rate-distortion optimized MD mode selection for multiple description video coding
    Heng, Brian A.
    Apostolopoulos, John G.
    Lim, Jae S.
    Eurasip Journal on Applied Signal Processing, 2006, 2006 : 1 - 12
  • [16] Error-resilient video coding with end-to-end rate-distortion optimized at macroblock level
    Jimin Xiao
    Tammam Tillo
    Chunyu Lin
    Yao Zhao
    EURASIP Journal on Advances in Signal Processing, 2011
  • [17] End-to-End Rate-Distortion Optimized MD Mode Selection for Multiple Description Video Coding
    Brian A Heng
    John G Apostolopoulos
    Jae S Lim
    EURASIP Journal on Advances in Signal Processing, 2006
  • [18] End-to-end rate-distortion optimized MD mode selection for multiple description video coding
    Heng, Brian A.
    Apostolopoulos, John G.
    Lim, Jae S.
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1) : 1 - 12
  • [19] End-to-end video quality analysis and modeling for video streaming over IP network
    He, ZH
    Chen, CW
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 853 - 856
  • [20] End-to-End Rate-Distortion Optimized Description Generation for H.264 Multiple Description Video Coding
    Xu, Yuanyuan
    Zhu, Ce
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (09) : 1523 - 1536