End-to-End Variable-Rate Learning-Based Depth Compression Guided by Deep Correlation Features

被引：0

作者：

Sebai, Dorsaf ^{[1
]}

Sehli, Maryem ^{[1
]}

Ghorbel, Faouzi ^{[1
]}

机构：

[1] Natl Sch Comp Sci ENSI, Cristal Lab, Manouba, Tunisia

来源：

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2024年 / 96卷 / 01期

关键词：

Depth maps; Learning-based compression; Wedgelets; Learnt deep correlation features; Variable-rate compression; MULTIVIEW;

D O I：

10.1007/s11265-023-01906-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The progress in the field of 3D video, particularly depth maps, is leading to the emergence of various technologies such as augmented, virtual, and mixed reality that have a wide range of applications in smart cities, intelligent transportation, AI-enabled farms, healthcare, education, industry, and more. Additionally, the future development of the Internet of Things (IoT) heavily depends on incorporating 3D vision and depth perception into machines like autonomous cars, robots, and drones, so that they effectively perceive their surroundings similar to how humans do. However, traditional compression methods that focus only on texture are not suitable for efficiently handle the large volume of depth maps due to the distinct features between texture and depth. To tackle this challenge, we aim to propose a model for compressing depth maps. Our approach utilizes a learning variable-rate method combined with a conditional quality-controllable autoencoder. The model consists of an encoder that automatically extracts features from depth maps using an optimized Convolutional Neural Network. This latter consists of an initial layer that uses predetermined wedgelet filters, succeeded by a VGG19 model. Additionally, we utilize a technique for classifying image styles based on Learnt Deep Correlation Features in order to learn deep features that distinguish depth maps from texture images. Our model objective is to optimize a loss function with multiple terms, which maintains the accuracy of depth discontinuities in the reconstructed output while also ensuring high-quality synthesis. By capturing and preserving deep features specific to depth maps, our end-to-end network achieves better R/D compression performances compared to related methods and depth-oriented 3D-HEVC standard.

引用

页码：81 / 97

页数：17

共 50 条

[41] Deep Learning-Based End-to-End Language Development Screening for Children Using Linguistic Knowledge
Oh, Byoung-Doo
Lee, Yoon-Kyoung
Kim, Jong-Dae
Park, Chan-Young
Kim, Yu-Seop
APPLIED SCIENCES-BASEL, 2022, 12 (09):
[42] End-to-end lossless compression of high precision depth maps guided by pseudo-residual
Wu, Yuyang
Gao, Wei
DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 489 - 489
[43] Low Rank Based End-to-End Deep Neural Network Compression
Jain, Swayambhoo
Hamidi-Rad, Shahab
Racape, Fabien
2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 233 - 242
[44] END-TO-END DEPTH MAP COMPRESSION FRAMEWORK VIA RGB-TO-DEPTH STRUCTURE PRIORS LEARNING
Chen, Minghui
Zhang, Pingping
Chen, Zhuo
Zhang, Yun
Wang, Xu
Kwong, Sam
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3206 - 3210
[45] Video Multi-Scale-Based End-to-End Rate Control in Deep Contextual Video Compression
Wei, Lili
Yang, Zhenglong
Zhang, Hua
Liu, Xinyu
Deng, Weihao
Zhang, Youchao
APPLIED SCIENCES-BASEL, 2024, 14 (13):
[46] A Comprehensive Review on Deep Learning-Based Motion Planning and End-to-End Learning for Self-Driving Vehicle
Ganesan, Manikandan
Kandhasamy, Sivanathan
Chokkalingam, Bharatiraja
Mihet-Popa, Lucian
IEEE ACCESS, 2024, 12 : 66031 - 66067
[47] Learning True Rate-Distortion-Optimization for End-To-End Image Compression
Brand, Fabian
Fischer, Kristian
Kopte, Alexander
Kaup, Andre
DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 443 - 443
[48] End-to-end representation learning for Correlation Filter based tracking
Valmadre, Jack
Bertinetto, Luca
Henriques, Joao
Vedaldi, Andrea
Torr, Philip H. S.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5000 - 5008
[49] End-to-end sound field reproduction based on deep learning
Hong, Xi
Du, Bokai
Yang, Shuang
Lei, Menghui
Zeng, Xiangyang
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (05): : 3055 - 3064
[50] A deep learning network based end-to-end image composition
Zhu, Xiaoyu
Wang, Haodi
Zhang, Zhiyi
Wu, Xiuping
Guo, Junqi
Wu, Hao
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101

← 1 2 3 4 5 →