User Navigation Modeling, Rate-Distortion Analysis, and End-to-End Optimization for Viewport-Driven 360° Video Streaming

被引：4

作者：

Chakareski, Jacob ^{[1
]}

Corbillon, Xavier ^{[2
]}

Simon, Gwendal ^{[3
]}

Swaminathan, Viswanathan ^{[4
]}

机构：

[1] New Jersey Inst Technol, Coll Comp, Newark, NJ 07103 USA

[2] Tiledmedia, Rotterdam, Zuid Holland, Netherlands

[3] Synmedia, Networking, Rennes, France

[4] Adobe, Adobe Res, San Jose, CA USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

美国国家卫生研究院; 美国国家科学基金会;

关键词：

Omnidirectional video; quality of experience; viewport-adaptive 360 degrees video streaming; rate-distortion analysis and optimization; user navigation modeling;

D O I：

10.1109/TMM.2022.3201397

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The emerging technologies of Virtual Reality (VR) and 360 degrees video introduce new challenges for state-of-the-art video communication systems. Enormous data volume and spatial user navigation are unique characteristics of 360 degrees videos that necessitate a space-time effective allocation of the available network streaming bandwidth over the 360 degrees video content to maximize the Quality of Experience (QoE) delivered to the user. Towards this objective, we investigate a framework for viewport-driven rate-distortion optimized 360 degrees video streaming that integrates the user view navigation patterns and the spatiotemporal ratedistortion characteristics of the 360 degrees video content to maximize the delivered user viewport video quality, for the given network/system resources. The framework comprises a methodology for assigning dynamic navigation likelihoods over the 360 degrees video spatiotemporal panorama, induced by the user navigation patterns, an analysis and characterization of the 360 degrees video panorama's spatiotemporal rate-distortion characteristics that leverage preprocessed spatial tilling of the content, and an optimization problem formulation and solution that capture and aim to maximize the delivered expected viewport video quality, given a user's navigation patterns, the 360 degrees video encoding/streaming decisions, and the available system/network resources. We formulate a Markov model to capture the navigation patterns of a user over the 360 degrees video panorama and simultaneously extend our actual navigation datasets by synthesizing additional realistic navigation data. Moreover, we investigate the impact of using two different tile sizes for equirectangular tiling of the 360 degrees video panorama. Our experimental results demonstrate the advantages of our framework over the conventional approach of streaming a monolithic uniformly-encoded 360 degrees video and a state-of-the-art navigation-speed based reference method. Considerable average and instantaneous viewport video quality gains of up to 5 dB are demonstrated in the case of five popular 4 K 360 degrees videos. In addition, we explore the impact of two different popular 360 degrees video quality metrics applied to evaluate the streaming performance of our system framework and the two reference methods. Finally, we demonstrate that by exploiting the unequal rate-distortion characteristics of the different spatial sectors of the 360 degrees video panorama, we can enable spatially more uniform and temporally higher 360 degrees video viewport quality delivered to the user, relative to monolithic streaming.

引用

页码：5941 / 5956

页数：16

共 50 条

[1] Viewport-Driven Rate-Distortion Optimized 360° Video Streaming
Chakareski, Jacob
Aksu, Ridvan
Corbillon, Xavier
Simon, Gwendal
Swaminathan, Viswanathan
2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
[2] Viewport-Driven Rate-Distortion Optimized Scalable Live 360° Video Network Multicast
Aksu, Ridvan
Chakareski, Jacob
Swaminathan, Viswanathan
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
[3] End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression
Yilmaz, M. Akin
Tekalp, A. Murat
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1311 - 1315
[4] End-to-end rate-distortion optimized motion estimation
Wan, Shuai
Izquierdo, Ebroul
Yang, Fuzheng
Chang, Yilin
2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 809 - +
[5] End-to-end rate-distortion optimized mode selection for multiple description video coding
Heng, BA
Apostolopoulos, JG
Lim, AS
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 905 - 908
[6] Error-Resilient Multi-view Video Coding Based on End-to-End Rate-Distortion Optimization
Gao Pan
Peng Qiang
Wang Qionghua
CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (02) : 277 - 283
[7] Error-Resilient Multi-view Video Coding Based on End-to-End Rate-Distortion Optimization
GAO Pan
PENG Qiang
WANG Qionghua
Chinese Journal of Electronics, 2016, 25 (02) : 277 - 283
[8] Stereoscopic Video Streaming with End-to-End Modeling
Tan, A. Serdar
Aksay, Anil
Akar, Goezde Bozdagi
Arikan, Erdal
2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 541 - +
[9] Ensemble Learning-Based Rate-Distortion Optimization for End-to-End Image Compression
Wang, Yefei
Liu, Dong
Ma, Siwei
Wu, Feng
Gao, Wen
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1193 - 1207
[10] Models and Analysis of Video Streaming End-to-End Distortion over LTE Network
Fu, Huayong
Yuan, Hui
Li, Mengyu
Sun, Zhenzhen
Li, Fengrong
PROCEEDINGS OF THE 2016 IEEE 11TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2016, : 516 - 521

← 1 2 3 4 5 →