Efficient representation of disoccluded regions in 3D video coding

被引：1

作者：

Farid, Muhammad Shahid ^{[1
]}

Babar, Badi uz Zaman ^{[1
]}

Khan, Muhammad Hassan ^{[1
]}

机构：

[1] Univ Punjab, Dept Comp Sci, Lahore 54590, Pakistan

来源：

ANNALS OF TELECOMMUNICATIONS | 2025年 / 80卷 / 1-2期

关键词：

Multiview video plus depth; 3D video coding; Free-viewpoint TV; Autostereoscopy; VIEW SYNTHESIS; MULTIVIEW; COMPRESSION; EXTENSIONS;

D O I：

10.1007/s12243-024-01019-3

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Three-dimensional (3D) video technology has gained immense admiration in recent times due to its numerous applications, particularly in the television and cinema industry. Three-dimensional television (3DTV) and free-viewpoint television (FTV) are two well-known applications that provide the end-user with a real-world and high-quality 3D display. In both applications, multiple views captured from different viewpoints are rendered simultaneously to offer depth sensation to the viewer. A large number of views are needed to enable FTV. However, transmitting this massive amount of data is challenging due to bandwidth limitations. Multiview video-plus-depth (MVD) is the most popular format where in addition to color images, corresponding depth information is also available which represents the scene geometry. The MVD format with the help of depth image-based rendering (DIBR) enables the generation of views at novel viewpoints. In this paper, we introduce a panorama-based representation of MVD data with an efficient keyframe-based disocclusions handling technique. The panorama view for a stereo pair with depth is constructed from the left view and the novel appearing region of the right view which is not visible from the left viewpoint. The disocclusions that appear in the right view when obtained from the DIBR of the left view are collected in a special frame named as keyframe. On the decoder side, the left view is available with a simple crop of panorama view. The right view is obtained through DIBR of the left view combined with the appearing region from the panorama view. The disocclusions in this warped view are filled from the keyframe. The panorama view with additional keyframes and the corresponding depth map are compressed using the standard HEVC codec. The experimental evaluations performed on standard MVD sequences showed that the proposed scheme achieves excellent video quality while saving considerable bit rate compared to HEVC simulcast.

引用

页码：123 / 137

页数：15

共 50 条

[1] 3D VIDEO COMPRESSION BY CODING OF DISOCCLUDED REGIONS
Domanski, Marek
Konieczny, Jacek
Kurc, Maciej
Ratajczak, Robert
Siast, Jakub
Stankiewicz, Olgierd
Stankowski, Jakub
Wegner, Krzysztof
2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1317 - 1320
[2] An Overview of 3D Video Representation and Coding
Jiang, Lianlian
He, Jiangqian
Zhang, Nan
Huang, Tiejun
3D RESEARCH, 2010, 1 (01) : 43 - 47
[3] Video coding using streamed 3D representation
Galpin, F
Morin, L
2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 636 - 639
[4] NONLINEAR DEPTH REPRESENTATION FOR 3D VIDEO CODING
Stankiewicz, Olgierd
Wegner, Krzysztof
Domanski, Marek
2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1752 - 1756
[5] A Compact Stereoscopic Video Representation for 3D Video Generation and Coding
Zhang, Zhebin
Wang, Ronggang
Zhou, Chen
Wang, Yizhou
Gao, Wen
2012 DATA COMPRESSION CONFERENCE (DCC), 2012, : 189 - 198
[6] 3D models coding and morphing for efficient video compression
Galpin, F
Balter, R
Morin, L
Deguchi, K
PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, 2004, : 334 - 341
[7] Efficient depth coding in 3D video to minimize coding bitrate and complexity
Liquan Shen
Zhaoyang Zhang
Multimedia Tools and Applications, 2014, 72 : 1639 - 1652
[8] Efficient depth coding in 3D video to minimize coding bitrate and complexity
Shen, Liquan
Zhang, Zhaoyang
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (02) : 1639 - 1652
[9] ASYMMETRIC 3D VIDEO CODING USING REGIONS OF PERCEPTUAL RELEVANCE
Pinto, Luis
Assuncao, Pedro
2012 International Conference on 3D Imaging (IC3D), 2012,
[10] Scalable and efficient coding of 3D model extracted from a video
Balter, R
Gioia, P
Morin, L
Galpin, F
2ND INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, 2004, : 836 - 843

← 1 2 3 4 5 →