Efficient representation of disoccluded regions in 3D video coding

被引:1
|
作者
Farid, Muhammad Shahid [1 ]
Babar, Badi uz Zaman [1 ]
Khan, Muhammad Hassan [1 ]
机构
[1] Univ Punjab, Dept Comp Sci, Lahore 54590, Pakistan
关键词
Multiview video plus depth; 3D video coding; Free-viewpoint TV; Autostereoscopy; VIEW SYNTHESIS; MULTIVIEW; COMPRESSION; EXTENSIONS;
D O I
10.1007/s12243-024-01019-3
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Three-dimensional (3D) video technology has gained immense admiration in recent times due to its numerous applications, particularly in the television and cinema industry. Three-dimensional television (3DTV) and free-viewpoint television (FTV) are two well-known applications that provide the end-user with a real-world and high-quality 3D display. In both applications, multiple views captured from different viewpoints are rendered simultaneously to offer depth sensation to the viewer. A large number of views are needed to enable FTV. However, transmitting this massive amount of data is challenging due to bandwidth limitations. Multiview video-plus-depth (MVD) is the most popular format where in addition to color images, corresponding depth information is also available which represents the scene geometry. The MVD format with the help of depth image-based rendering (DIBR) enables the generation of views at novel viewpoints. In this paper, we introduce a panorama-based representation of MVD data with an efficient keyframe-based disocclusions handling technique. The panorama view for a stereo pair with depth is constructed from the left view and the novel appearing region of the right view which is not visible from the left viewpoint. The disocclusions that appear in the right view when obtained from the DIBR of the left view are collected in a special frame named as keyframe. On the decoder side, the left view is available with a simple crop of panorama view. The right view is obtained through DIBR of the left view combined with the appearing region from the panorama view. The disocclusions in this warped view are filled from the keyframe. The panorama view with additional keyframes and the corresponding depth map are compressed using the standard HEVC codec. The experimental evaluations performed on standard MVD sequences showed that the proposed scheme achieves excellent video quality while saving considerable bit rate compared to HEVC simulcast.
引用
收藏
页码:123 / 137
页数:15
相关论文
共 50 条
  • [1] 3D VIDEO COMPRESSION BY CODING OF DISOCCLUDED REGIONS
    Domanski, Marek
    Konieczny, Jacek
    Kurc, Maciej
    Ratajczak, Robert
    Siast, Jakub
    Stankiewicz, Olgierd
    Stankowski, Jakub
    Wegner, Krzysztof
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1317 - 1320
  • [2] An Overview of 3D Video Representation and Coding
    Jiang, Lianlian
    He, Jiangqian
    Zhang, Nan
    Huang, Tiejun
    3D RESEARCH, 2010, 1 (01) : 43 - 47
  • [3] Video coding using streamed 3D representation
    Galpin, F
    Morin, L
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 636 - 639
  • [4] NONLINEAR DEPTH REPRESENTATION FOR 3D VIDEO CODING
    Stankiewicz, Olgierd
    Wegner, Krzysztof
    Domanski, Marek
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1752 - 1756
  • [5] A Compact Stereoscopic Video Representation for 3D Video Generation and Coding
    Zhang, Zhebin
    Wang, Ronggang
    Zhou, Chen
    Wang, Yizhou
    Gao, Wen
    2012 DATA COMPRESSION CONFERENCE (DCC), 2012, : 189 - 198
  • [6] 3D models coding and morphing for efficient video compression
    Galpin, F
    Balter, R
    Morin, L
    Deguchi, K
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, 2004, : 334 - 341
  • [7] Efficient depth coding in 3D video to minimize coding bitrate and complexity
    Liquan Shen
    Zhaoyang Zhang
    Multimedia Tools and Applications, 2014, 72 : 1639 - 1652
  • [8] Efficient depth coding in 3D video to minimize coding bitrate and complexity
    Shen, Liquan
    Zhang, Zhaoyang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (02) : 1639 - 1652
  • [9] ASYMMETRIC 3D VIDEO CODING USING REGIONS OF PERCEPTUAL RELEVANCE
    Pinto, Luis
    Assuncao, Pedro
    2012 International Conference on 3D Imaging (IC3D), 2012,
  • [10] Scalable and efficient coding of 3D model extracted from a video
    Balter, R
    Gioia, P
    Morin, L
    Galpin, F
    2ND INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, 2004, : 836 - 843