TSUDepth: Exploring temporal symmetry-based uncertainty for unsupervised monocular depth estimation

被引：0

作者：

Zhu, Yufan ^{[1
]}

Ren, Rui ^{[1
]}

Dong, Weisheng ^{[1
]}

Li, Xin ^{[2
]}

Shi, Guangming ^{[1
]}

机构：

[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China

[2] SUNY Albany, Dept Comp Sci, Albany, NY 12222 USA

来源：

NEUROCOMPUTING | 2024年 / 600卷

基金：

中国国家自然科学基金;

关键词：

Unsupervised depth estimation; Automatic uncertainty learning; Temporal symmetry; Cross-resolution distillation; FRAMEWORK;

D O I：

10.1016/j.neucom.2024.128165

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When faced with occlusions and non-rigid motions, machines often struggle with depth estimation, a task effortlessly performed by humans with just one eye. Continuous RGB images embody rich temporal features, such as symmetry and optical flow, which current deep-learning models fail to effectively leverage. In response to this limitation, we introduce an innovative framework known as Temporal Symmetry-based Uncertainty (TSU)-Depth, aimed at enhancing the accuracy of unsupervised monocular depth estimation. The Temporal Symmetry-based Occlusion Optimization (TSOO) component plays a pivotal role in robustly identifying occluded regions and comparable optimization across adjacent frames. Simultaneously, we propose Temporal Optical Flow Masking (TOFM) to effectively identify and exclude static pixels (such as out-of-range depths and non-rigid objects) between adjacent frames. Additionally, we introduce Cross-Resolution Distillation (CRED) to enhance depth estimation accuracy across various resolutions, especially in low input resolution scenarios. Furthermore, we designed a new depth estimation structure utilizing the DPT structure and incorporating a GRU module to enhance performance details. Through extensive experiments on benchmark datasets, including KITTI, Cityscapes, and Make3D, our TSUDepth framework has consistently demonstrated state-of-the-art performance. Code is available at https://github.com/BlueEg/TSUDepth/.

引用

页数：12

共 50 条

[11] Uncertainty Estimation for Efficient Monocular Depth Perception
Du, Hao
Cheng, Guoan
Matsune, Ai
Zhu, Qiang
Zhan, Shu
2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 804 - 808
[12] AsiANet: Autoencoders in Autoencoder for Unsupervised Monocular Depth Estimation
Yusiong, John Paul T.
Naval, Prospero C., Jr.
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 443 - 451
[13] Unsupervised Monocular Depth Estimation With Channel and Spatial Attention
Wang, Zhuping
Dai, Xinke
Guo, Zhanyu
Huang, Chao
Zhang, Hao
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7860 - 7870
[14] Structured Adversarial Training for Unsupervised Monocular Depth Estimation
Mehta, Ishit
Sakurikar, Parikshit
Narayanan, P. J.
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 314 - 323
[15] Dual CNN Models for Unsupervised Monocular Depth Estimation
Repala, Vamshi Krishna
Dubey, Shiv Ram
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 209 - 217
[16] Unsupervised Monocular Depth Estimation in Highly Complex Environments
Zhao, Chaoqiang
Tang, Yang
Sun, Qiyu
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (05): : 1237 - 1246
[17] Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones
Zhao Shuanfeng
Huang Tao
Xu Qian
Geng Longlong
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
[18] Learning monocular depth estimation with unsupervised trinocular assumptions
Poggi, Matteo
Tosi, Fabio
Mattoccia, Stefano
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 324 - 333
[19] An Adaptive Unsupervised Learning Framework for Monocular Depth Estimation
Yang, Delong
Zhong, Xunyu
Lin, Lixiong
Peng, Xiafu
IEEE ACCESS, 2019, 7 : 148142 - 148151
[20] UNSUPERVISED MONOCULAR DEPTH ESTIMATION BASED ON DUAL ATTENTION MECHANISM AND DEPTH-AWARE LOSS
Ye, Xinchen
Zhang, Mingliang
Xu, Rui
Zhong, Wei
Fan, Xin
Liu, Zhu
Zhang, Jiaao
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 169 - 174

← 1 2 3 4 5 →