Self-Supervised Monocular Depth Estimation With Isometric-Self-Sample-Based Learning

被引:2
|
作者
Cha, Geonho [1 ]
Jang, Ho-Deok [1 ]
Wee, Dongyoon [1 ]
机构
[1] NAVER Corp, Clova AI, Seongnam 13561, South Korea
关键词
Training; Estimation; Vehicle dynamics; Optical flow; Cameras; Three-dimensional displays; Point cloud compression; Autonomous vehicle navigation; deep learning methods; RGB-D perception; vision-based navigation;
D O I
10.1109/LRA.2022.3221871
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Managing the dynamic regions in the photometric loss formulation has been a main issue for handling the self-supervised depth estimation problem. Most previous methods have alleviated this issue by removing the dynamic regions in the photometric loss formulation based on the masks estimated from another module, making it difficult to fully utilize the training images. In this letter, to handle this problem, we propose an isometric self-sample-based learning (ISSL) method to fully utilize the training images in a simple yet effective way. The proposed method provides additional supervision during training using self-generated images that comply with pure static scene assumption. Specifically, the isometric self-sample generator synthesizes self-samples for each training image by applying random rigid transformations on the estimated depth. Thus both the generated self-samples and the corresponding training image always follow the static scene relation. Our method can serve as a plug-and-play module for two existing models without any architectural modifications. It provides additional supervision during training phase only. Thus, there is no additional overhead on base model parameters and computation during inference phase. These properties fit well with models oriented to real-time applications. We show that plugging our ISSL module into two existing models consistently improves the performance by a large margin. In addition, it also boosts the depth accuracy over different types of scene, i.e., outdoor scenes (KITTI and Make3D) and indoor scene (NYUv2), validating its high effectiveness.
引用
收藏
页码:2173 / 2180
页数:8
相关论文
共 50 条
  • [41] Frequency-Aware Self-Supervised Monocular Depth Estimation
    Chen, Xingyu
    Li, Thomas H.
    Zhang, Ruonan
    Li, Ge
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5797 - 5806
  • [42] MonoVAN: Visual Attention for Self-Supervised Monocular Depth Estimation
    Indyk, Ilia
    Makarov, Ilya
    2023 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY, ISMAR, 2023, : 1211 - 1220
  • [43] Self-Supervised Scale Recovery for Monocular Depth and Egomotion Estimation
    Wagstaff, Brandon
    Kelly, Jonathan
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2620 - 2627
  • [44] Graph semantic information for self-supervised monocular depth estimation
    Zhang, Dongdong
    Wang, Chunping
    Wang, Huiying
    Fu, Qiang
    PATTERN RECOGNITION, 2024, 156
  • [45] Image Masking for Robust Self-Supervised Monocular Depth Estimation
    Chawla, Hemang
    Jeeveswaran, Kishaan
    Arani, Elahe
    Zonooz, Bahram
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10054 - 10060
  • [46] Exploring the vulnerability of self-supervised monocular depth estimation models
    Hou, Ruitao
    Mo, Kanghua
    Long, Yucheng
    Li, Ning
    Rao, Yuan
    INFORMATION SCIENCES, 2024, 677
  • [47] Self-Supervised Monocular Depth Estimation by Digging into Uncertainty Quantification
    Yuan-Zhen Li
    Sheng-Jie Zheng
    Zi-Xin Tan
    Tuo Cao
    Fei Luo
    Chun-Xia Xiao
    Journal of Computer Science and Technology, 2023, 38 : 510 - 525
  • [48] Deep Digging into the Generalization of Self-Supervised Monocular Depth Estimation
    Bae, Jinwoo
    Moon, Sungho
    Im, Sunghoon
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 187 - 196
  • [49] Self-Supervised Human Depth Estimation from Monocular Videos
    Tan, Feitong
    Zhu, Hao
    Cui, Zhaopeng
    Zhu, Siyu
    Pollefeys, Marc
    Tan, Ping
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 647 - 656
  • [50] Self-Supervised Monocular Depth Estimation with Multi-constraints
    Yang, Xinpeng
    Zhang, Sen
    Zhao, Baoyong
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8422 - 8427