Self-supervised monocular depth estimation with direct methods

被引:0
|
作者
Wang H. [1 ]
Sun Y. [1 ]
Wu Q.M.J. [2 ]
Lu X. [1 ]
Wang X. [1 ]
Zhang Z. [1 ]
机构
[1] Robotics Research Center, College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao
[2] Department of Electrical and Computer Engineering, University of Windsor, Windsor, N9B-3P4, ON
基金
中国国家自然科学基金;
关键词
Auto-mask; Depth estimation; Monocular vision;
D O I
10.1016/j.neucom.2020.10.025
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Depth estimation is crucial to understanding the geometry of a scene in robotics and computer vision. Traditionally, depth estimators can be trained with various forms of self-supervised stereo data or supervised ground-truth data. In comparison to the methods that utilize stereo depth perception or ground-truth data from laser scans, determining depth relation using an unlabeled monocular camera proves considerably more challenging. Recent work has shown that CNN-based depth estimators can be learned using unlabeled monocular video. Without needing the stereo data or ground-truth depth data, learning with monocular self-supervised strategies can utilize much larger and more varied image datasets. Inspired by recent advances in depth estimation, in this paper, we propose a novel objective that replaces the use of explicit ground-truth depth or binocular stereo depth with unlabeled monocular video sequence data. No assumptions about scene geometry or pre-trained information are used in the proposed architecture. To enable a better pose prediction, we propose the use of an improved differentiable direct visual odometry (DDVO), which is fused with an appearance-matching loss. The auto-masking approach is introduced in the DDVO depth predictor to filter out the low-texture area or occlusion area, which can easily reduce matching error, from one frame to the subsequent frame in the monocular sequence. Additionally, we introduce a self-supervised loss function to fuse the auto-masking segment and the depth-prediction segment accordingly. Our method produces state-of-the-art results for monocular depth estimation on the KITTI driving dataset, even outperforming some supervised methods that have been trained with ground-truth depth. © 2020 Elsevier B.V.
引用
收藏
页码:340 / 348
页数:8
相关论文
共 50 条
  • [21] Self-supervised Learning for Dense Depth Estimation in Monocular Endoscopy
    Liu, Xingtong
    Sinha, Ayushi
    Unberath, Mathias
    Ishii, Masaru
    Hager, Gregory D.
    Taylor, Russell H.
    Reiter, Austin
    OR 2.0 CONTEXT-AWARE OPERATING THEATERS, COMPUTER ASSISTED ROBOTIC ENDOSCOPY, CLINICAL IMAGE-BASED PROCEDURES, AND SKIN IMAGE ANALYSIS, OR 2.0 2018, 2018, 11041 : 128 - 138
  • [22] Frequency-Aware Self-Supervised Monocular Depth Estimation
    Chen, Xingyu
    Li, Thomas H.
    Zhang, Ruonan
    Li, Ge
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5797 - 5806
  • [23] MonoVAN: Visual Attention for Self-Supervised Monocular Depth Estimation
    Indyk, Ilia
    Makarov, Ilya
    2023 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY, ISMAR, 2023, : 1211 - 1220
  • [24] Self-Supervised Scale Recovery for Monocular Depth and Egomotion Estimation
    Wagstaff, Brandon
    Kelly, Jonathan
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2620 - 2627
  • [25] Graph semantic information for self-supervised monocular depth estimation
    Zhang, Dongdong
    Wang, Chunping
    Wang, Huiying
    Fu, Qiang
    PATTERN RECOGNITION, 2024, 156
  • [26] Image Masking for Robust Self-Supervised Monocular Depth Estimation
    Chawla, Hemang
    Jeeveswaran, Kishaan
    Arani, Elahe
    Zonooz, Bahram
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10054 - 10060
  • [27] Exploring the vulnerability of self-supervised monocular depth estimation models
    Hou, Ruitao
    Mo, Kanghua
    Long, Yucheng
    Li, Ning
    Rao, Yuan
    INFORMATION SCIENCES, 2024, 677
  • [28] Self-Supervised Monocular Depth Estimation by Digging into Uncertainty Quantification
    Yuan-Zhen Li
    Sheng-Jie Zheng
    Zi-Xin Tan
    Tuo Cao
    Fei Luo
    Chun-Xia Xiao
    Journal of Computer Science and Technology, 2023, 38 : 510 - 525
  • [29] Deep Digging into the Generalization of Self-Supervised Monocular Depth Estimation
    Bae, Jinwoo
    Moon, Sungho
    Im, Sunghoon
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 187 - 196
  • [30] Self-Supervised Monocular Depth Estimation Based on Channel Attention
    Tao, Bo
    Chen, Xinbo
    Tong, Xiliang
    Jiang, Du
    Chen, Baojia
    PHOTONICS, 2022, 9 (06)