Learning Depth from Monocular Videos using Direct Methods

被引:397
|
作者
Wang, Chaoyang [1 ]
Miguel Buenaposada, Jose [1 ,2 ]
Zhu, Rui [1 ]
Lucey, Simon [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Rey Juan Carlos, Mostoles, Spain
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to predict depth from a single image - using recent advances in CNNs - is of increasing interest to the vision community. Unsupervised strategies to learning are particularly appealing as they can utilize much larger and varied monocular video datasets during learning without the need for ground truth depth or stereo. In previous works, separate pose and depth CNN predictors had to be determined such that their joint outputs minimized the photometric error. Inspired by recent advances in direct visual odometry (DVO), we argue that the depth CNN predictor can be learned without a pose CNN predictor. Further, we demonstrate empirically that incorporation of a differentiable implementation of DVO, along with a novel depth normalization strategy - substantially improves performance over state of the art that use monocular videos for training.
引用
收藏
页码:2022 / 2030
页数:9
相关论文
共 50 条
  • [21] Monocular Depth Estimation Using Deep Learning: A Review
    Masoumian, Armin
    Rashwan, Hatem A.
    Cristiano, Julian
    Asif, M. Salman
    Puig, Domenec
    SENSORS, 2022, 22 (14)
  • [22] Depth Estimation from Monocular Images Using Dilated Convolution and Uncertainty Learning
    Ma, Haojie
    Ding, Yinzhang
    Wang, Lianghao
    Zhang, Ming
    Li, Dongxiao
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 13 - 23
  • [23] Monocular Depth Estimation by Learning from Heterogeneous Datasets
    Gurram, Akhil
    Urfalioglu, Onay
    Halfaoui, Ibrahim
    Bouzaraa, Fand
    Lopez, Antonio M.
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 2176 - 2181
  • [24] Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos
    Paruchuri, Akshay
    Ehrenstein, Samuel
    Wang, Shuxian
    Fried, Inbar
    Pizer, Stephen M.
    Niethammer, Marc
    Sengupta, Roni
    COMPUTER VISION - ECCV 2024, PT XXXII, 2025, 15090 : 473 - 491
  • [25] MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos
    Tian, Fengrui
    Du, Shaoyi
    Duan, Yueqi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17857 - 17867
  • [26] Adaptive Self-supervised Depth Estimation in Monocular Videos
    Mendoza, Julio
    Pedrini, Helio
    IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 687 - 699
  • [27] Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields
    Liu, Fayao
    Shen, Chunhua
    Lin, Guosheng
    Reid, Ian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) : 2024 - 2039
  • [28] Monocular Depth Perception Using Image Processing and Machine Learning
    Hombali, Apoorv
    Gorde, Vaibhav
    Deshpande, Abhishek
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [29] EVALUATING MONOCULAR DEPTH ESTIMATION METHODS
    Padkan, N.
    Trybala, P.
    Battisti, R.
    Remondino, F.
    Bergeret, C.
    2ND GEOBENCH WORKSHOP ON EVALUATION AND BENCHMARKING OF SENSORS, SYSTEMS AND GEOSPATIAL DATA IN PHOTOGRAMMETRY AND REMOTE SENSING, VOL. 48-1, 2023, : 137 - 144
  • [30] Learning Single-Image Depth from Videos using Quality Assessment Networks
    Chen, Weifeng
    Qian, Shengyi
    Deng, Jia
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5587 - 5596