TSUDepth: Exploring temporal symmetry-based uncertainty for unsupervised monocular depth estimation

被引:0
|
作者
Zhu, Yufan [1 ]
Ren, Rui [1 ]
Dong, Weisheng [1 ]
Li, Xin [2 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[2] SUNY Albany, Dept Comp Sci, Albany, NY 12222 USA
基金
中国国家自然科学基金;
关键词
Unsupervised depth estimation; Automatic uncertainty learning; Temporal symmetry; Cross-resolution distillation; FRAMEWORK;
D O I
10.1016/j.neucom.2024.128165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When faced with occlusions and non-rigid motions, machines often struggle with depth estimation, a task effortlessly performed by humans with just one eye. Continuous RGB images embody rich temporal features, such as symmetry and optical flow, which current deep-learning models fail to effectively leverage. In response to this limitation, we introduce an innovative framework known as Temporal Symmetry-based Uncertainty (TSU)-Depth, aimed at enhancing the accuracy of unsupervised monocular depth estimation. The Temporal Symmetry-based Occlusion Optimization (TSOO) component plays a pivotal role in robustly identifying occluded regions and comparable optimization across adjacent frames. Simultaneously, we propose Temporal Optical Flow Masking (TOFM) to effectively identify and exclude static pixels (such as out-of-range depths and non-rigid objects) between adjacent frames. Additionally, we introduce Cross-Resolution Distillation (CRED) to enhance depth estimation accuracy across various resolutions, especially in low input resolution scenarios. Furthermore, we designed a new depth estimation structure utilizing the DPT structure and incorporating a GRU module to enhance performance details. Through extensive experiments on benchmark datasets, including KITTI, Cityscapes, and Make3D, our TSUDepth framework has consistently demonstrated state-of-the-art performance. Code is available at https://github.com/BlueEg/TSUDepth/.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] OptiDepthNet: A Real-Time Unsupervised Monocular Depth Estimation Network
    Wei, Feng
    Yin, XingHui
    Shen, Jie
    Wang, HuiBin
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 128 (04) : 2831 - 2846
  • [42] Accurate unsupervised monocular depth estimation for ill-posed region
    Wang, Xiaofeng
    Sun, Jiameng
    Qin, Hao
    Yuan, Yuxing
    Yu, Jun
    Su, Yingying
    Sun, Zhiheng
    FRONTIERS IN PHYSICS, 2023, 10
  • [43] OptiDepthNet: A Real-Time Unsupervised Monocular Depth Estimation Network
    Feng Wei
    XingHui Yin
    Jie Shen
    HuiBin Wang
    Wireless Personal Communications, 2023, 128 : 2831 - 2846
  • [44] Unsupervised Monocular Training Method for Depth Estimation Using Statistical Masks
    Wang, Xiangtong
    Li, Wei
    Yang, Menglong
    Cheng, Peng
    Liang, Binbin
    IEEE ACCESS, 2020, 8 (191530-191541): : 191530 - 191541
  • [45] Unsupervised Ego-Motion and Dense Depth Estimation with Monocular Video
    Xu, Yufan
    Wang, Yan
    Guo, Lei
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2018, : 1306 - 1310
  • [46] Structured Coupled Generative Adversarial Networks for Unsupervised Monocular Depth Estimation
    Puscas, Mihai Marian
    Xu, Dan
    Pilzer, Andrea
    Sebe, Niculae
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 18 - 26
  • [47] CASCADED DETAIL-AWARE NETWORK FOR UNSUPERVISED MONOCULAR DEPTH ESTIMATION
    Ye, Xinchen
    Zhang, Mingliang
    Fan, Xin
    Xu, Rui
    Pu, Juncheng
    Yan, Ruoke
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [48] Towards real-time unsupervised monocular depth estimation on CPU
    Poggi, Matteo
    Aleotti, Filippo
    Tosi, Fabio
    Mattoccia, Stefano
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 5848 - 5854
  • [49] Improving the robustness to the uncertainty of initial depth estimation in monocular SLAM
    Meng, Xujiong
    Jiang, Rongxin
    Chen, Yaowu
    2009 INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2009), VOLUMES 1 AND 2, 2009, : 1028 - 1032
  • [50] Attention based multilayer feature fusion convolutional neural network for unsupervised monocular depth estimation
    Lei, Zeyu
    Wang, Yan
    Li, Zijian
    Yang, Junyao
    NEUROCOMPUTING, 2021, 423 : 343 - 352