Learning Depth from Monocular Videos using Direct Methods

被引：397

作者：

Wang, Chaoyang ^{[1
]}

Miguel Buenaposada, Jose ^{[1
,2
]}

Zhu, Rui ^{[1
]}

Lucey, Simon ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[2] Univ Rey Juan Carlos, Mostoles, Spain

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

D O I：

10.1109/CVPR.2018.00216

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The ability to predict depth from a single image - using recent advances in CNNs - is of increasing interest to the vision community. Unsupervised strategies to learning are particularly appealing as they can utilize much larger and varied monocular video datasets during learning without the need for ground truth depth or stereo. In previous works, separate pose and depth CNN predictors had to be determined such that their joint outputs minimized the photometric error. Inspired by recent advances in direct visual odometry (DVO), we argue that the depth CNN predictor can be learned without a pose CNN predictor. Further, we demonstrate empirically that incorporation of a differentiable implementation of DVO, along with a novel depth normalization strategy - substantially improves performance over state of the art that use monocular videos for training.

引用

页码：2022 / 2030

页数：9

共 50 条

[31] Learning Depth from Monocular Sequence with Convolutional LSTM Network
Yeh, Chia-Hung
Huang, Yao-Pao
Lin, Chih-Yang
Lin, Min-Hui
ADVANCES IN NETWORKED-BASED INFORMATION SYSTEMS, NBIS-2019, 2020, 1036 : 502 - 507
[32] Dense Depth Estimation in Monocular Endoscopy With Self-Supervised Learning Methods
Liu, Xingtong
Sinha, Ayushi
Ishii, Masaru
Hager, Gregory D.
Reiter, Austin
Taylor, Russell H.
Unberath, Mathias
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (05) : 1438 - 1447
[33] Direct Estimation of Dense Scene Flow and Depth from a Monocular Sequence
Mathlouthi, Yosra
Mitiche, Amar
Ben Ayed, Ismail
ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT 1, 2014, 8887 : 107 - 117
[34] Self-Supervised Monocular Depth Estimation From Videos via Adaptive Reconstruction Constraints
Ye, Xinchen
Ou, Yuxiang
Wu, Biao
Xu, Rui
Li, Haojie
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2161 - 2172
[35] Depth Prediction for Monocular Direct Visual Odometry
Cheng, Ran
Agia, Christopher
Meger, David
Dudek, Gregory
2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 70 - 77
[36] Dense Depth Estimation from Stereo Endoscopy Videos Using Unsupervised Optical Flow Methods
Yang, Zixin
Simon, Richard
Li, Yangming
Linte, Cristian A.
MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2021), 2021, 12722 : 337 - 349
[37] Self-supervised learning of monocular depth using quantized networks
Lu, Keyu
Zeng, Chengyi
Zeng, Yonghu
NEUROCOMPUTING, 2022, 488 : 634 - 646
[38] Pedestrian Segmentation From Uncalibrated Monocular Videos Using a Projection Map
Jo, Younggwan
Nam, Woonhyun
Han, Joon Hee
IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (07) : 604 - 607
[39] Monocular and stereo methods for AAM learning from video
Saragih, Jason
Goecke, Roland
2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 680 - +
[40] DEPTH MAP ESTIMATION IN DIBR STEREOSCOPIC 3D VIDEOS USING A COMBINATION OF MONOCULAR CUES
Aabed, Mohammed
Temel, Dogancan
Solh, Mashhour
AlRegib, Ghassan
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 729 - 733

← 1 2 3 4 5 →