End-to-End Variable-Rate Learning-Based Depth Compression Guided by Deep Correlation Features

被引:0
|
作者
Sebai, Dorsaf [1 ]
Sehli, Maryem [1 ]
Ghorbel, Faouzi [1 ]
机构
[1] Natl Sch Comp Sci ENSI, Cristal Lab, Manouba, Tunisia
关键词
Depth maps; Learning-based compression; Wedgelets; Learnt deep correlation features; Variable-rate compression; MULTIVIEW;
D O I
10.1007/s11265-023-01906-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The progress in the field of 3D video, particularly depth maps, is leading to the emergence of various technologies such as augmented, virtual, and mixed reality that have a wide range of applications in smart cities, intelligent transportation, AI-enabled farms, healthcare, education, industry, and more. Additionally, the future development of the Internet of Things (IoT) heavily depends on incorporating 3D vision and depth perception into machines like autonomous cars, robots, and drones, so that they effectively perceive their surroundings similar to how humans do. However, traditional compression methods that focus only on texture are not suitable for efficiently handle the large volume of depth maps due to the distinct features between texture and depth. To tackle this challenge, we aim to propose a model for compressing depth maps. Our approach utilizes a learning variable-rate method combined with a conditional quality-controllable autoencoder. The model consists of an encoder that automatically extracts features from depth maps using an optimized Convolutional Neural Network. This latter consists of an initial layer that uses predetermined wedgelet filters, succeeded by a VGG19 model. Additionally, we utilize a technique for classifying image styles based on Learnt Deep Correlation Features in order to learn deep features that distinguish depth maps from texture images. Our model objective is to optimize a loss function with multiple terms, which maintains the accuracy of depth discontinuities in the reconstructed output while also ensuring high-quality synthesis. By capturing and preserving deep features specific to depth maps, our end-to-end network achieves better R/D compression performances compared to related methods and depth-oriented 3D-HEVC standard.
引用
收藏
页码:81 / 97
页数:17
相关论文
共 50 条
  • [41] Deep Learning-Based End-to-End Language Development Screening for Children Using Linguistic Knowledge
    Oh, Byoung-Doo
    Lee, Yoon-Kyoung
    Kim, Jong-Dae
    Park, Chan-Young
    Kim, Yu-Seop
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [42] End-to-end lossless compression of high precision depth maps guided by pseudo-residual
    Wu, Yuyang
    Gao, Wei
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 489 - 489
  • [43] Low Rank Based End-to-End Deep Neural Network Compression
    Jain, Swayambhoo
    Hamidi-Rad, Shahab
    Racape, Fabien
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 233 - 242
  • [44] END-TO-END DEPTH MAP COMPRESSION FRAMEWORK VIA RGB-TO-DEPTH STRUCTURE PRIORS LEARNING
    Chen, Minghui
    Zhang, Pingping
    Chen, Zhuo
    Zhang, Yun
    Wang, Xu
    Kwong, Sam
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3206 - 3210
  • [45] Video Multi-Scale-Based End-to-End Rate Control in Deep Contextual Video Compression
    Wei, Lili
    Yang, Zhenglong
    Zhang, Hua
    Liu, Xinyu
    Deng, Weihao
    Zhang, Youchao
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [46] A Comprehensive Review on Deep Learning-Based Motion Planning and End-to-End Learning for Self-Driving Vehicle
    Ganesan, Manikandan
    Kandhasamy, Sivanathan
    Chokkalingam, Bharatiraja
    Mihet-Popa, Lucian
    IEEE ACCESS, 2024, 12 : 66031 - 66067
  • [47] Learning True Rate-Distortion-Optimization for End-To-End Image Compression
    Brand, Fabian
    Fischer, Kristian
    Kopte, Alexander
    Kaup, Andre
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 443 - 443
  • [48] End-to-end representation learning for Correlation Filter based tracking
    Valmadre, Jack
    Bertinetto, Luca
    Henriques, Joao
    Vedaldi, Andrea
    Torr, Philip H. S.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5000 - 5008
  • [49] End-to-end sound field reproduction based on deep learning
    Hong, Xi
    Du, Bokai
    Yang, Shuang
    Lei, Menghui
    Zeng, Xiangyang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (05): : 3055 - 3064
  • [50] A deep learning network based end-to-end image composition
    Zhu, Xiaoyu
    Wang, Haodi
    Zhang, Zhiyi
    Wu, Xiuping
    Guo, Junqi
    Wu, Hao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101