Learning Depth from Monocular Videos using Direct Methods

被引：397

作者：

Wang, Chaoyang ^{[1
]}

Miguel Buenaposada, Jose ^{[1
,2
]}

Zhu, Rui ^{[1
]}

Lucey, Simon ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[2] Univ Rey Juan Carlos, Mostoles, Spain

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

D O I：

10.1109/CVPR.2018.00216

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The ability to predict depth from a single image - using recent advances in CNNs - is of increasing interest to the vision community. Unsupervised strategies to learning are particularly appealing as they can utilize much larger and varied monocular video datasets during learning without the need for ground truth depth or stereo. In previous works, separate pose and depth CNN predictors had to be determined such that their joint outputs minimized the photometric error. Inspired by recent advances in direct visual odometry (DVO), we argue that the depth CNN predictor can be learned without a pose CNN predictor. Further, we demonstrate empirically that incorporation of a differentiable implementation of DVO, along with a novel depth normalization strategy - substantially improves performance over state of the art that use monocular videos for training.

引用

页码：2022 / 2030

页数：9

共 50 条

[21] Monocular Depth Estimation Using Deep Learning: A Review
Masoumian, Armin
Rashwan, Hatem A.
Cristiano, Julian
Asif, M. Salman
Puig, Domenec
SENSORS, 2022, 22 (14)
[22] Depth Estimation from Monocular Images Using Dilated Convolution and Uncertainty Learning
Ma, Haojie
Ding, Yinzhang
Wang, Lianghao
Zhang, Ming
Li, Dongxiao
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 13 - 23
[23] Monocular Depth Estimation by Learning from Heterogeneous Datasets
Gurram, Akhil
Urfalioglu, Onay
Halfaoui, Ibrahim
Bouzaraa, Fand
Lopez, Antonio M.
2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 2176 - 2181
[24] Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos
Paruchuri, Akshay
Ehrenstein, Samuel
Wang, Shuxian
Fried, Inbar
Pizer, Stephen M.
Niethammer, Marc
Sengupta, Roni
COMPUTER VISION - ECCV 2024, PT XXXII, 2025, 15090 : 473 - 491
[25] MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos
Tian, Fengrui
Du, Shaoyi
Duan, Yueqi
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17857 - 17867
[26] Adaptive Self-supervised Depth Estimation in Monocular Videos
Mendoza, Julio
Pedrini, Helio
IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 687 - 699
[27] Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields
Liu, Fayao
Shen, Chunhua
Lin, Guosheng
Reid, Ian
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) : 2024 - 2039
[28] Monocular Depth Perception Using Image Processing and Machine Learning
Hombali, Apoorv
Gorde, Vaibhav
Deshpande, Abhishek
INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
[29] EVALUATING MONOCULAR DEPTH ESTIMATION METHODS
Padkan, N.
Trybala, P.
Battisti, R.
Remondino, F.
Bergeret, C.
2ND GEOBENCH WORKSHOP ON EVALUATION AND BENCHMARKING OF SENSORS, SYSTEMS AND GEOSPATIAL DATA IN PHOTOGRAMMETRY AND REMOTE SENSING, VOL. 48-1, 2023, : 137 - 144
[30] Learning Single-Image Depth from Videos using Quality Assessment Networks
Chen, Weifeng
Qian, Shengyi
Deng, Jia
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5587 - 5596

← 1 2 3 4 5 →