TSUDepth: Exploring temporal symmetry-based uncertainty for unsupervised monocular depth estimation

被引:0
|
作者
Zhu, Yufan [1 ]
Ren, Rui [1 ]
Dong, Weisheng [1 ]
Li, Xin [2 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[2] SUNY Albany, Dept Comp Sci, Albany, NY 12222 USA
基金
中国国家自然科学基金;
关键词
Unsupervised depth estimation; Automatic uncertainty learning; Temporal symmetry; Cross-resolution distillation; FRAMEWORK;
D O I
10.1016/j.neucom.2024.128165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When faced with occlusions and non-rigid motions, machines often struggle with depth estimation, a task effortlessly performed by humans with just one eye. Continuous RGB images embody rich temporal features, such as symmetry and optical flow, which current deep-learning models fail to effectively leverage. In response to this limitation, we introduce an innovative framework known as Temporal Symmetry-based Uncertainty (TSU)-Depth, aimed at enhancing the accuracy of unsupervised monocular depth estimation. The Temporal Symmetry-based Occlusion Optimization (TSOO) component plays a pivotal role in robustly identifying occluded regions and comparable optimization across adjacent frames. Simultaneously, we propose Temporal Optical Flow Masking (TOFM) to effectively identify and exclude static pixels (such as out-of-range depths and non-rigid objects) between adjacent frames. Additionally, we introduce Cross-Resolution Distillation (CRED) to enhance depth estimation accuracy across various resolutions, especially in low input resolution scenarios. Furthermore, we designed a new depth estimation structure utilizing the DPT structure and incorporating a GRU module to enhance performance details. Through extensive experiments on benchmark datasets, including KITTI, Cityscapes, and Make3D, our TSUDepth framework has consistently demonstrated state-of-the-art performance. Code is available at https://github.com/BlueEg/TSUDepth/.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Uncertainty Estimation for Efficient Monocular Depth Perception
    Du, Hao
    Cheng, Guoan
    Matsune, Ai
    Zhu, Qiang
    Zhan, Shu
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 804 - 808
  • [12] AsiANet: Autoencoders in Autoencoder for Unsupervised Monocular Depth Estimation
    Yusiong, John Paul T.
    Naval, Prospero C., Jr.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 443 - 451
  • [13] Unsupervised Monocular Depth Estimation With Channel and Spatial Attention
    Wang, Zhuping
    Dai, Xinke
    Guo, Zhanyu
    Huang, Chao
    Zhang, Hao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7860 - 7870
  • [14] Structured Adversarial Training for Unsupervised Monocular Depth Estimation
    Mehta, Ishit
    Sakurikar, Parikshit
    Narayanan, P. J.
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 314 - 323
  • [15] Dual CNN Models for Unsupervised Monocular Depth Estimation
    Repala, Vamshi Krishna
    Dubey, Shiv Ram
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 209 - 217
  • [16] Unsupervised Monocular Depth Estimation in Highly Complex Environments
    Zhao, Chaoqiang
    Tang, Yang
    Sun, Qiyu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (05): : 1237 - 1246
  • [17] Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones
    Zhao Shuanfeng
    Huang Tao
    Xu Qian
    Geng Longlong
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
  • [18] Learning monocular depth estimation with unsupervised trinocular assumptions
    Poggi, Matteo
    Tosi, Fabio
    Mattoccia, Stefano
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 324 - 333
  • [19] An Adaptive Unsupervised Learning Framework for Monocular Depth Estimation
    Yang, Delong
    Zhong, Xunyu
    Lin, Lixiong
    Peng, Xiafu
    IEEE ACCESS, 2019, 7 : 148142 - 148151
  • [20] UNSUPERVISED MONOCULAR DEPTH ESTIMATION BASED ON DUAL ATTENTION MECHANISM AND DEPTH-AWARE LOSS
    Ye, Xinchen
    Zhang, Mingliang
    Xu, Rui
    Zhong, Wei
    Fan, Xin
    Liu, Zhu
    Zhang, Jiaao
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 169 - 174