Learning Depth from Monocular Videos using Direct Methods

被引:397
|
作者
Wang, Chaoyang [1 ]
Miguel Buenaposada, Jose [1 ,2 ]
Zhu, Rui [1 ]
Lucey, Simon [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Rey Juan Carlos, Mostoles, Spain
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to predict depth from a single image - using recent advances in CNNs - is of increasing interest to the vision community. Unsupervised strategies to learning are particularly appealing as they can utilize much larger and varied monocular video datasets during learning without the need for ground truth depth or stereo. In previous works, separate pose and depth CNN predictors had to be determined such that their joint outputs minimized the photometric error. Inspired by recent advances in direct visual odometry (DVO), we argue that the depth CNN predictor can be learned without a pose CNN predictor. Further, we demonstrate empirically that incorporation of a differentiable implementation of DVO, along with a novel depth normalization strategy - substantially improves performance over state of the art that use monocular videos for training.
引用
收藏
页码:2022 / 2030
页数:9
相关论文
共 50 条
  • [31] Learning Depth from Monocular Sequence with Convolutional LSTM Network
    Yeh, Chia-Hung
    Huang, Yao-Pao
    Lin, Chih-Yang
    Lin, Min-Hui
    ADVANCES IN NETWORKED-BASED INFORMATION SYSTEMS, NBIS-2019, 2020, 1036 : 502 - 507
  • [32] Dense Depth Estimation in Monocular Endoscopy With Self-Supervised Learning Methods
    Liu, Xingtong
    Sinha, Ayushi
    Ishii, Masaru
    Hager, Gregory D.
    Reiter, Austin
    Taylor, Russell H.
    Unberath, Mathias
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (05) : 1438 - 1447
  • [33] Direct Estimation of Dense Scene Flow and Depth from a Monocular Sequence
    Mathlouthi, Yosra
    Mitiche, Amar
    Ben Ayed, Ismail
    ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT 1, 2014, 8887 : 107 - 117
  • [34] Self-Supervised Monocular Depth Estimation From Videos via Adaptive Reconstruction Constraints
    Ye, Xinchen
    Ou, Yuxiang
    Wu, Biao
    Xu, Rui
    Li, Haojie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2161 - 2172
  • [35] Depth Prediction for Monocular Direct Visual Odometry
    Cheng, Ran
    Agia, Christopher
    Meger, David
    Dudek, Gregory
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 70 - 77
  • [36] Dense Depth Estimation from Stereo Endoscopy Videos Using Unsupervised Optical Flow Methods
    Yang, Zixin
    Simon, Richard
    Li, Yangming
    Linte, Cristian A.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2021), 2021, 12722 : 337 - 349
  • [37] Self-supervised learning of monocular depth using quantized networks
    Lu, Keyu
    Zeng, Chengyi
    Zeng, Yonghu
    NEUROCOMPUTING, 2022, 488 : 634 - 646
  • [38] Pedestrian Segmentation From Uncalibrated Monocular Videos Using a Projection Map
    Jo, Younggwan
    Nam, Woonhyun
    Han, Joon Hee
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (07) : 604 - 607
  • [39] Monocular and stereo methods for AAM learning from video
    Saragih, Jason
    Goecke, Roland
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 680 - +
  • [40] DEPTH MAP ESTIMATION IN DIBR STEREOSCOPIC 3D VIDEOS USING A COMBINATION OF MONOCULAR CUES
    Aabed, Mohammed
    Temel, Dogancan
    Solh, Mashhour
    AlRegib, Ghassan
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 729 - 733