End-to-End Variable-Rate Learning-Based Depth Compression Guided by Deep Correlation Features

被引：0

作者：

Sebai, Dorsaf ^{[1
]}

Sehli, Maryem ^{[1
]}

Ghorbel, Faouzi ^{[1
]}

机构：

[1] Natl Sch Comp Sci ENSI, Cristal Lab, Manouba, Tunisia

来源：

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2024年 / 96卷 / 01期

关键词：

Depth maps; Learning-based compression; Wedgelets; Learnt deep correlation features; Variable-rate compression; MULTIVIEW;

D O I：

10.1007/s11265-023-01906-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The progress in the field of 3D video, particularly depth maps, is leading to the emergence of various technologies such as augmented, virtual, and mixed reality that have a wide range of applications in smart cities, intelligent transportation, AI-enabled farms, healthcare, education, industry, and more. Additionally, the future development of the Internet of Things (IoT) heavily depends on incorporating 3D vision and depth perception into machines like autonomous cars, robots, and drones, so that they effectively perceive their surroundings similar to how humans do. However, traditional compression methods that focus only on texture are not suitable for efficiently handle the large volume of depth maps due to the distinct features between texture and depth. To tackle this challenge, we aim to propose a model for compressing depth maps. Our approach utilizes a learning variable-rate method combined with a conditional quality-controllable autoencoder. The model consists of an encoder that automatically extracts features from depth maps using an optimized Convolutional Neural Network. This latter consists of an initial layer that uses predetermined wedgelet filters, succeeded by a VGG19 model. Additionally, we utilize a technique for classifying image styles based on Learnt Deep Correlation Features in order to learn deep features that distinguish depth maps from texture images. Our model objective is to optimize a loss function with multiple terms, which maintains the accuracy of depth discontinuities in the reconstructed output while also ensuring high-quality synthesis. By capturing and preserving deep features specific to depth maps, our end-to-end network achieves better R/D compression performances compared to related methods and depth-oriented 3D-HEVC standard.

引用

页码：81 / 97

页数：17

共 50 条

[21] Reliability of Deep Neural Networks for an End-to-End Imitation Learning-Based Lane Keeping
Liu, Shen
Mueller, Steffen
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 13768 - 13786
[22] A Deep Learning-Based End-to-End Composite System for Hand Detection and Gesture Recognition
Mohammed, Adam Ahmed Qaid
Lv, Jiancheng
Islam, Md. Sajjatul
SENSORS, 2019, 19 (23)
[23] End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation
Haubner T.
Brendel A.
Kellermann W.
IEEE/ACM Transactions on Audio Speech and Language Processing, 2024, 32 : 227 - 238
[24] Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
Zhao, Jiang
Liu, Han
Sun, Jiaming
Wu, Kun
Cai, Zhihao
Ma, Yan
Wang, Yingxun
BIOMIMETICS, 2022, 7 (04)
[25] End-to-End Deep Learning-Based Compressive Spectrum Sensing in Cognitive Radio Networks
Meng, Xiangyue
Inaltekin, Hazer
Krongold, Brian
ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
[26] Deep Learning-Based End-to-End Diagnosis System for Avascular Necrosis of Femoral Head
Li, Yang
Li, Yan
Tian, Hua
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) : 2093 - 2102
[27] Deep Learning-Based End-to-End Carrier Signal Detection in Broadband Power Spectrum
Huang, Hao
Wang, Peng
Wang, Jiao
Li, Jianqing
ELECTRONICS, 2022, 11 (12)
[28] Deep hierarchical guidance and regularization learning for end-to-end depth estimation
Zhang, Zhenyu
Xu, Chunyan
Yang, Jian
Tai, Ying
Chen, Liang
PATTERN RECOGNITION, 2018, 83 : 430 - 442
[29] Review and Evaluation of End-to-End Video Compression with Deep-Learning
Yasin, Hajar Maseeh
Ameen, Siddeeq Yosef
2021 INTERNATIONAL CONFERENCE OF MODERN TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY INDUSTRY (MTICTI 2021), 2021, : 81 - 88
[30] New Results in End-to-end Image and Video Compression by Deep Learning
Ozsoy, Gokberk
Yilmaz, Melih
Kirmemis, Ogun
Tekalp, A. Murat
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,

← 1 2 3 4 5 →