End-to-End Variable-Rate Learning-Based Depth Compression Guided by Deep Correlation Features

被引:0
|
作者
Sebai, Dorsaf [1 ]
Sehli, Maryem [1 ]
Ghorbel, Faouzi [1 ]
机构
[1] Natl Sch Comp Sci ENSI, Cristal Lab, Manouba, Tunisia
关键词
Depth maps; Learning-based compression; Wedgelets; Learnt deep correlation features; Variable-rate compression; MULTIVIEW;
D O I
10.1007/s11265-023-01906-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The progress in the field of 3D video, particularly depth maps, is leading to the emergence of various technologies such as augmented, virtual, and mixed reality that have a wide range of applications in smart cities, intelligent transportation, AI-enabled farms, healthcare, education, industry, and more. Additionally, the future development of the Internet of Things (IoT) heavily depends on incorporating 3D vision and depth perception into machines like autonomous cars, robots, and drones, so that they effectively perceive their surroundings similar to how humans do. However, traditional compression methods that focus only on texture are not suitable for efficiently handle the large volume of depth maps due to the distinct features between texture and depth. To tackle this challenge, we aim to propose a model for compressing depth maps. Our approach utilizes a learning variable-rate method combined with a conditional quality-controllable autoencoder. The model consists of an encoder that automatically extracts features from depth maps using an optimized Convolutional Neural Network. This latter consists of an initial layer that uses predetermined wedgelet filters, succeeded by a VGG19 model. Additionally, we utilize a technique for classifying image styles based on Learnt Deep Correlation Features in order to learn deep features that distinguish depth maps from texture images. Our model objective is to optimize a loss function with multiple terms, which maintains the accuracy of depth discontinuities in the reconstructed output while also ensuring high-quality synthesis. By capturing and preserving deep features specific to depth maps, our end-to-end network achieves better R/D compression performances compared to related methods and depth-oriented 3D-HEVC standard.
引用
收藏
页码:81 / 97
页数:17
相关论文
共 50 条
  • [31] Spectrum Monitoring Based on End-to-End Learning by Deep Learning
    Mahdiyeh Rahmani
    Reza Ghazizadeh
    International Journal of Wireless Information Networks, 2022, 29 : 180 - 192
  • [32] Spectrum Monitoring Based on End-to-End Learning by Deep Learning
    Rahmani, Mahdiyeh
    Ghazizadeh, Reza
    INTERNATIONAL JOURNAL OF WIRELESS INFORMATION NETWORKS, 2022, 29 (02) : 180 - 192
  • [33] Curriculum Learning-Based Approaches for End-to-End Gas Recognition
    Zhang, Chao
    Wang, Wen
    Pan, Yong
    Zhai, Shoupei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [34] Deep Learning-Based End-to-End Wireless Communication Systems With Conditional GANs as Unknown Channels
    Ye, Hao
    Liang, Le
    Li, Geoffrey Ye
    Juang, Biing-Hwang
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (05) : 3133 - 3143
  • [35] DeepSTEP - Deep Learning-Based Spatio-Temporal End-To-End Perception for Autonomous Vehicles
    Huch, Sebastian
    Sauerbeck, Florian
    Betz, Johannes
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [36] Deep learning-based end-to-end automated stenosis classification and localization on catheter coronary angiography
    Cong, Chao
    Kato, Yoko
    De Vasconcellos, Henrique Doria
    Ostovaneh, Mohammad R.
    Lima, Joao A. C.
    Ambale-Venkatesh, Bharath
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2023, 10
  • [37] End-to-End Deep Learning-Based Human Activity Recognition Using Channel State Information
    Hsieh, Chaur-Heh
    Chen, Jen-Yang
    Kuo, Chung-Ming
    Wang, Ping
    JOURNAL OF INTERNET TECHNOLOGY, 2021, 22 (02): : 271 - 281
  • [38] End-to-end deep learning-based autonomous driving control for high-speed environment
    Kim, Cheol-jin
    Lee, Myung-jae
    Hwang, Kyu-hong
    Ha, Young-guk
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 1961 - 1982
  • [39] Speech Vision: An End-to-End Deep Learning-Based Dysarthric Automatic Speech Recognition System
    Shahamiri, Seyed Reza
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 852 - 861
  • [40] End-to-end deep learning-based autonomous driving control for high-speed environment
    Cheol-jin Kim
    Myung-jae Lee
    Kyu-hong Hwang
    Young-guk Ha
    The Journal of Supercomputing, 2022, 78 : 1961 - 1982