End-to-End Variable-Rate Learning-Based Depth Compression Guided by Deep Correlation Features

被引:0
|
作者
Sebai, Dorsaf [1 ]
Sehli, Maryem [1 ]
Ghorbel, Faouzi [1 ]
机构
[1] Natl Sch Comp Sci ENSI, Cristal Lab, Manouba, Tunisia
关键词
Depth maps; Learning-based compression; Wedgelets; Learnt deep correlation features; Variable-rate compression; MULTIVIEW;
D O I
10.1007/s11265-023-01906-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The progress in the field of 3D video, particularly depth maps, is leading to the emergence of various technologies such as augmented, virtual, and mixed reality that have a wide range of applications in smart cities, intelligent transportation, AI-enabled farms, healthcare, education, industry, and more. Additionally, the future development of the Internet of Things (IoT) heavily depends on incorporating 3D vision and depth perception into machines like autonomous cars, robots, and drones, so that they effectively perceive their surroundings similar to how humans do. However, traditional compression methods that focus only on texture are not suitable for efficiently handle the large volume of depth maps due to the distinct features between texture and depth. To tackle this challenge, we aim to propose a model for compressing depth maps. Our approach utilizes a learning variable-rate method combined with a conditional quality-controllable autoencoder. The model consists of an encoder that automatically extracts features from depth maps using an optimized Convolutional Neural Network. This latter consists of an initial layer that uses predetermined wedgelet filters, succeeded by a VGG19 model. Additionally, we utilize a technique for classifying image styles based on Learnt Deep Correlation Features in order to learn deep features that distinguish depth maps from texture images. Our model objective is to optimize a loss function with multiple terms, which maintains the accuracy of depth discontinuities in the reconstructed output while also ensuring high-quality synthesis. By capturing and preserving deep features specific to depth maps, our end-to-end network achieves better R/D compression performances compared to related methods and depth-oriented 3D-HEVC standard.
引用
收藏
页码:81 / 97
页数:17
相关论文
共 50 条
  • [21] Reliability of Deep Neural Networks for an End-to-End Imitation Learning-Based Lane Keeping
    Liu, Shen
    Mueller, Steffen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 13768 - 13786
  • [22] A Deep Learning-Based End-to-End Composite System for Hand Detection and Gesture Recognition
    Mohammed, Adam Ahmed Qaid
    Lv, Jiancheng
    Islam, Md. Sajjatul
    SENSORS, 2019, 19 (23)
  • [23] End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation
    Haubner T.
    Brendel A.
    Kellermann W.
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2024, 32 : 227 - 238
  • [24] Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking
    Zhao, Jiang
    Liu, Han
    Sun, Jiaming
    Wu, Kun
    Cai, Zhihao
    Ma, Yan
    Wang, Yingxun
    BIOMIMETICS, 2022, 7 (04)
  • [25] End-to-End Deep Learning-Based Compressive Spectrum Sensing in Cognitive Radio Networks
    Meng, Xiangyue
    Inaltekin, Hazer
    Krongold, Brian
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [26] Deep Learning-Based End-to-End Diagnosis System for Avascular Necrosis of Femoral Head
    Li, Yang
    Li, Yan
    Tian, Hua
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) : 2093 - 2102
  • [27] Deep Learning-Based End-to-End Carrier Signal Detection in Broadband Power Spectrum
    Huang, Hao
    Wang, Peng
    Wang, Jiao
    Li, Jianqing
    ELECTRONICS, 2022, 11 (12)
  • [28] Deep hierarchical guidance and regularization learning for end-to-end depth estimation
    Zhang, Zhenyu
    Xu, Chunyan
    Yang, Jian
    Tai, Ying
    Chen, Liang
    PATTERN RECOGNITION, 2018, 83 : 430 - 442
  • [29] Review and Evaluation of End-to-End Video Compression with Deep-Learning
    Yasin, Hajar Maseeh
    Ameen, Siddeeq Yosef
    2021 INTERNATIONAL CONFERENCE OF MODERN TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY INDUSTRY (MTICTI 2021), 2021, : 81 - 88
  • [30] New Results in End-to-end Image and Video Compression by Deep Learning
    Ozsoy, Gokberk
    Yilmaz, Melih
    Kirmemis, Ogun
    Tekalp, A. Murat
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,