End-to-End Variable-Rate Learning-Based Depth Compression Guided by Deep Correlation Features

被引：0

作者：

Sebai, Dorsaf ^{[1
]}

Sehli, Maryem ^{[1
]}

Ghorbel, Faouzi ^{[1
]}

机构：

[1] Natl Sch Comp Sci ENSI, Cristal Lab, Manouba, Tunisia

来源：

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2024年 / 96卷 / 01期

关键词：

Depth maps; Learning-based compression; Wedgelets; Learnt deep correlation features; Variable-rate compression; MULTIVIEW;

D O I：

10.1007/s11265-023-01906-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The progress in the field of 3D video, particularly depth maps, is leading to the emergence of various technologies such as augmented, virtual, and mixed reality that have a wide range of applications in smart cities, intelligent transportation, AI-enabled farms, healthcare, education, industry, and more. Additionally, the future development of the Internet of Things (IoT) heavily depends on incorporating 3D vision and depth perception into machines like autonomous cars, robots, and drones, so that they effectively perceive their surroundings similar to how humans do. However, traditional compression methods that focus only on texture are not suitable for efficiently handle the large volume of depth maps due to the distinct features between texture and depth. To tackle this challenge, we aim to propose a model for compressing depth maps. Our approach utilizes a learning variable-rate method combined with a conditional quality-controllable autoencoder. The model consists of an encoder that automatically extracts features from depth maps using an optimized Convolutional Neural Network. This latter consists of an initial layer that uses predetermined wedgelet filters, succeeded by a VGG19 model. Additionally, we utilize a technique for classifying image styles based on Learnt Deep Correlation Features in order to learn deep features that distinguish depth maps from texture images. Our model objective is to optimize a loss function with multiple terms, which maintains the accuracy of depth discontinuities in the reconstructed output while also ensuring high-quality synthesis. By capturing and preserving deep features specific to depth maps, our end-to-end network achieves better R/D compression performances compared to related methods and depth-oriented 3D-HEVC standard.

引用

页码：81 / 97

页数：17

共 50 条

[31] Spectrum Monitoring Based on End-to-End Learning by Deep Learning
Mahdiyeh Rahmani
Reza Ghazizadeh
International Journal of Wireless Information Networks, 2022, 29 : 180 - 192
[32] Spectrum Monitoring Based on End-to-End Learning by Deep Learning
Rahmani, Mahdiyeh
Ghazizadeh, Reza
INTERNATIONAL JOURNAL OF WIRELESS INFORMATION NETWORKS, 2022, 29 (02) : 180 - 192
[33] Curriculum Learning-Based Approaches for End-to-End Gas Recognition
Zhang, Chao
Wang, Wen
Pan, Yong
Zhai, Shoupei
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[34] Deep Learning-Based End-to-End Wireless Communication Systems With Conditional GANs as Unknown Channels
Ye, Hao
Liang, Le
Li, Geoffrey Ye
Juang, Biing-Hwang
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (05) : 3133 - 3143
[35] DeepSTEP - Deep Learning-Based Spatio-Temporal End-To-End Perception for Autonomous Vehicles
Huch, Sebastian
Sauerbeck, Florian
Betz, Johannes
2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
[36] Deep learning-based end-to-end automated stenosis classification and localization on catheter coronary angiography
Cong, Chao
Kato, Yoko
De Vasconcellos, Henrique Doria
Ostovaneh, Mohammad R.
Lima, Joao A. C.
Ambale-Venkatesh, Bharath
FRONTIERS IN CARDIOVASCULAR MEDICINE, 2023, 10
[37] End-to-End Deep Learning-Based Human Activity Recognition Using Channel State Information
Hsieh, Chaur-Heh
Chen, Jen-Yang
Kuo, Chung-Ming
Wang, Ping
JOURNAL OF INTERNET TECHNOLOGY, 2021, 22 (02): : 271 - 281
[38] End-to-end deep learning-based autonomous driving control for high-speed environment
Kim, Cheol-jin
Lee, Myung-jae
Hwang, Kyu-hong
Ha, Young-guk
JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 1961 - 1982
[39] Speech Vision: An End-to-End Deep Learning-Based Dysarthric Automatic Speech Recognition System
Shahamiri, Seyed Reza
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 852 - 861
[40] End-to-end deep learning-based autonomous driving control for high-speed environment
Cheol-jin Kim
Myung-jae Lee
Kyu-hong Hwang
Young-guk Ha
The Journal of Supercomputing, 2022, 78 : 1961 - 1982

← 1 2 3 4 5 →