Unsupervised Monocular Depth and Camera Pose Estimation with Multiple Masks and Geometric Consistency Constraints

被引:1
|
作者
Zhang, Xudong [1 ]
Zhao, Baigan [2 ]
Yao, Jiannan [2 ]
Wu, Guoqing [1 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[2] Nantong Univ, Sch Mech Engn, Nantong 226019, Peoples R China
基金
中国国家自然科学基金;
关键词
depth estimation; camera pose; visual odometry; unsupervised learning; VISUAL ODOMETRY;
D O I
10.3390/s23115329
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents a novel unsupervised learning framework for estimating scene depth and camera pose from video sequences, fundamental to many high-level tasks such as 3D reconstruction, visual navigation, and augmented reality. Although existing unsupervised methods have achieved promising results, their performance suffers in challenging scenes such as those with dynamic objects and occluded regions. As a result, multiple mask technologies and geometric consistency constraints are adopted in this research to mitigate their negative impacts. Firstly, multiple mask technologies are used to identify numerous outliers in the scene, which are excluded from the loss computation. In addition, the identified outliers are employed as a supervised signal to train a mask estimation network. The estimated mask is then utilized to preprocess the input to the pose estimation network, mitigating the potential adverse effects of challenging scenes on pose estimation. Furthermore, we propose geometric consistency constraints to reduce the sensitivity of illumination changes, which act as additional supervised signals to train the network. Experimental results on the KITTI dataset demonstrate that our proposed strategies can effectively enhance the model's performance, outperforming other unsupervised methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Geometric Pretraining for Monocular Depth Estimation
    Wang, Kaixuan
    Chen, Yao
    Guo, Hengkai
    Wen, Linfu
    Shen, Shaojie
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4782 - 4788
  • [22] PLG-IN: Pluggable Geometric Consistency Loss with Wasserstein Distance in Monocular Depth Estimation
    Hirose, Noriaki
    Koide, Satoshi
    Kawano, Keisuke
    Kondo, Ruho
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 12868 - 12874
  • [23] Unsupervised Monocular Depth Estimation for Monocular Visual SLAM Systems
    Liu, Feng
    Huang, Ming
    Ge, Hongyu
    Tao, Dan
    Gao, Ruipeng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
  • [24] Monocular Depth Estimation Based on Unsupervised Learning
    Liu, Wan
    Sun, Yan
    Wang, XuCheng
    Yang, Lin
    Zheng, Zhenrong
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY VI, 2019, 11187
  • [25] Outside-in Monocular IR Camera based HMD Pose Estimation via Geometric Optimization
    Savkin, Pavel A.
    Saito, Shunsuke
    Vansteenberge, Jarich
    Fukusato, Tsukasa
    Wilson, Lochlainn
    Morishima, Shigeo
    VRST'17: PROCEEDINGS OF THE 23RD ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, 2017,
  • [26] Self-supervised Monocular Depth Estimation Based on Semantic Assistance and Depth Temporal Consistency Constraints
    Ling, Chuanwu
    Chen, Hua
    Xu, Dayong
    Zhang, Xiaogang
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2024, 51 (08): : 1 - 12
  • [27] Self-supervised Learning with Geometric Constraints in Monocular Video Connecting Flow, Depth, and Camera
    Chen, Yuhua
    Schmid, Cordelia
    Sminchisescu, Cristian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7062 - 7071
  • [28] SoftPOSIT Enhancements for Monocular Camera Spacecraft Pose Estimation
    Shi, Jian-Feng
    Ulrich, Steve
    2016 21ST INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2016, : 30 - 35
  • [29] Integrating Visual and Geometric Consistency for Pose Estimation
    Chen, Huiqin
    Aldea, Emanuel
    Le Hegarat-Mascle, Sylvie
    PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,
  • [30] Self-supervised monocular depth estimation via multiple bilateral consistency
    Lu, Zhengyang
    Chen, Ying
    MULTIMEDIA SYSTEMS, 2025, 31 (02)