Unsupervised Monocular Depth and Camera Pose Estimation with Multiple Masks and Geometric Consistency Constraints

被引:1
|
作者
Zhang, Xudong [1 ]
Zhao, Baigan [2 ]
Yao, Jiannan [2 ]
Wu, Guoqing [1 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[2] Nantong Univ, Sch Mech Engn, Nantong 226019, Peoples R China
基金
中国国家自然科学基金;
关键词
depth estimation; camera pose; visual odometry; unsupervised learning; VISUAL ODOMETRY;
D O I
10.3390/s23115329
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents a novel unsupervised learning framework for estimating scene depth and camera pose from video sequences, fundamental to many high-level tasks such as 3D reconstruction, visual navigation, and augmented reality. Although existing unsupervised methods have achieved promising results, their performance suffers in challenging scenes such as those with dynamic objects and occluded regions. As a result, multiple mask technologies and geometric consistency constraints are adopted in this research to mitigate their negative impacts. Firstly, multiple mask technologies are used to identify numerous outliers in the scene, which are excluded from the loss computation. In addition, the identified outliers are employed as a supervised signal to train a mask estimation network. The estimated mask is then utilized to preprocess the input to the pose estimation network, mitigating the potential adverse effects of challenging scenes on pose estimation. Furthermore, we propose geometric consistency constraints to reduce the sensitivity of illumination changes, which act as additional supervised signals to train the network. Experimental results on the KITTI dataset demonstrate that our proposed strategies can effectively enhance the model's performance, outperforming other unsupervised methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Unsupervised Monocular Depth and Pose Estimation Using Multiple Masks Based on Photometric and Geometric Consistency
    Kong, Huifang
    Liu, Tiankuo
    Hu, Jie
    Fang, Yao
    Sun, Jixing
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3558 - 3563
  • [2] Occlusion-Aware Unsupervised Learning of Monocular Depth, Optical Flow and Camera Pose with Geometric Constraints
    Teng, Qianru
    Chen, Yimin
    Huang, Chen
    FUTURE INTERNET, 2018, 10 (10)
  • [3] Unsupervised Monocular Estimation of Depth and Visual Odometry Using Attention and Depth-Pose Consistency Loss
    Song, Xiaogang
    Hu, Haoyue
    Liang, Li
    Shi, Weiwei
    Xie, Guo
    Lu, Xiaofeng
    Hei, Xinhong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3517 - 3529
  • [4] Unsupervised monocular visual odometry with decoupled camera pose estimation
    Lin, Lili
    Wang, Weisheng
    Luo, Wan
    Song, Lesheng
    Zhou, Wenhui
    DIGITAL SIGNAL PROCESSING, 2021, 114
  • [5] Unsupervised Monocular Depth Estimation with Left-Right Consistency
    Godard, Clement
    Mac Aodha, Oisin
    Brostow, Gabriel J.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6602 - 6611
  • [6] Unsupervised Monocular Training Method for Depth Estimation Using Statistical Masks
    Wang, Xiangtong
    Li, Wei
    Yang, Menglong
    Cheng, Peng
    Liang, Binbin
    IEEE ACCESS, 2020, 8 (191530-191541): : 191530 - 191541
  • [7] Enhancing Self-supervised Monocular Depth Estimation via Piece-Wise Pose Estimation and Geometric Constraints
    Shyam, Pranjay
    Okon, Alexandre
    Yoo, HyunJin
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 221 - 231
  • [8] Do Planar Constraints Improve Camera Pose Estimation in Monocular SLAM?
    Arndt, Charlotte
    Sabzevari, Reza
    Civera, Javier
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2213 - 2222
  • [9] Unsupervised Estimation of Monocular Depth and VO in Dynamic Environments via Hybrid Masks
    Sun, Qiyu
    Tang, Yang
    Zhang, Chongzhen
    Zhao, Chaoqiang
    Qian, Feng
    Kurths, Jurgen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) : 2023 - 2033
  • [10] Unsupervised Learning of Depth and Pose Based on Monocular Camera and Inertial Measurement Unit (IMU)
    Wang, Yanbo
    Yang, Hanwen
    Cai, Jianwei
    Wang, Guangming
    Wang, Jingchuan
    Huang, Yi
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10010 - 10017