User Navigation Modeling, Rate-Distortion Analysis, and End-to-End Optimization for Viewport-Driven 360° Video Streaming

被引:4
|
作者
Chakareski, Jacob [1 ]
Corbillon, Xavier [2 ]
Simon, Gwendal [3 ]
Swaminathan, Viswanathan [4 ]
机构
[1] New Jersey Inst Technol, Coll Comp, Newark, NJ 07103 USA
[2] Tiledmedia, Rotterdam, Zuid Holland, Netherlands
[3] Synmedia, Networking, Rennes, France
[4] Adobe, Adobe Res, San Jose, CA USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Omnidirectional video; quality of experience; viewport-adaptive 360 degrees video streaming; rate-distortion analysis and optimization; user navigation modeling;
D O I
10.1109/TMM.2022.3201397
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emerging technologies of Virtual Reality (VR) and 360 degrees video introduce new challenges for state-of-the-art video communication systems. Enormous data volume and spatial user navigation are unique characteristics of 360 degrees videos that necessitate a space-time effective allocation of the available network streaming bandwidth over the 360 degrees video content to maximize the Quality of Experience (QoE) delivered to the user. Towards this objective, we investigate a framework for viewport-driven rate-distortion optimized 360 degrees video streaming that integrates the user view navigation patterns and the spatiotemporal ratedistortion characteristics of the 360 degrees video content to maximize the delivered user viewport video quality, for the given network/system resources. The framework comprises a methodology for assigning dynamic navigation likelihoods over the 360 degrees video spatiotemporal panorama, induced by the user navigation patterns, an analysis and characterization of the 360 degrees video panorama's spatiotemporal rate-distortion characteristics that leverage preprocessed spatial tilling of the content, and an optimization problem formulation and solution that capture and aim to maximize the delivered expected viewport video quality, given a user's navigation patterns, the 360 degrees video encoding/streaming decisions, and the available system/network resources. We formulate a Markov model to capture the navigation patterns of a user over the 360 degrees video panorama and simultaneously extend our actual navigation datasets by synthesizing additional realistic navigation data. Moreover, we investigate the impact of using two different tile sizes for equirectangular tiling of the 360 degrees video panorama. Our experimental results demonstrate the advantages of our framework over the conventional approach of streaming a monolithic uniformly-encoded 360 degrees video and a state-of-the-art navigation-speed based reference method. Considerable average and instantaneous viewport video quality gains of up to 5 dB are demonstrated in the case of five popular 4 K 360 degrees videos. In addition, we explore the impact of two different popular 360 degrees video quality metrics applied to evaluate the streaming performance of our system framework and the two reference methods. Finally, we demonstrate that by exploiting the unequal rate-distortion characteristics of the different spatial sectors of the 360 degrees video panorama, we can enable spatially more uniform and temporally higher 360 degrees video viewport quality delivered to the user, relative to monolithic streaming.
引用
收藏
页码:5941 / 5956
页数:16
相关论文
共 50 条
  • [1] Viewport-Driven Rate-Distortion Optimized 360° Video Streaming
    Chakareski, Jacob
    Aksu, Ridvan
    Corbillon, Xavier
    Simon, Gwendal
    Swaminathan, Viswanathan
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [2] Viewport-Driven Rate-Distortion Optimized Scalable Live 360° Video Network Multicast
    Aksu, Ridvan
    Chakareski, Jacob
    Swaminathan, Viswanathan
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [3] End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression
    Yilmaz, M. Akin
    Tekalp, A. Murat
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1311 - 1315
  • [4] End-to-end rate-distortion optimized motion estimation
    Wan, Shuai
    Izquierdo, Ebroul
    Yang, Fuzheng
    Chang, Yilin
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 809 - +
  • [5] End-to-end rate-distortion optimized mode selection for multiple description video coding
    Heng, BA
    Apostolopoulos, JG
    Lim, AS
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 905 - 908
  • [6] Error-Resilient Multi-view Video Coding Based on End-to-End Rate-Distortion Optimization
    Gao Pan
    Peng Qiang
    Wang Qionghua
    CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (02) : 277 - 283
  • [7] Error-Resilient Multi-view Video Coding Based on End-to-End Rate-Distortion Optimization
    GAO Pan
    PENG Qiang
    WANG Qionghua
    Chinese Journal of Electronics, 2016, 25 (02) : 277 - 283
  • [8] Stereoscopic Video Streaming with End-to-End Modeling
    Tan, A. Serdar
    Aksay, Anil
    Akar, Goezde Bozdagi
    Arikan, Erdal
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 541 - +
  • [9] Ensemble Learning-Based Rate-Distortion Optimization for End-to-End Image Compression
    Wang, Yefei
    Liu, Dong
    Ma, Siwei
    Wu, Feng
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1193 - 1207
  • [10] Models and Analysis of Video Streaming End-to-End Distortion over LTE Network
    Fu, Huayong
    Yuan, Hui
    Li, Mengyu
    Sun, Zhenzhen
    Li, Fengrong
    PROCEEDINGS OF THE 2016 IEEE 11TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2016, : 516 - 521