Unsupervised Monocular Depth and Camera Pose Estimation with Multiple Masks and Geometric Consistency Constraints

被引:1
|
作者
Zhang, Xudong [1 ]
Zhao, Baigan [2 ]
Yao, Jiannan [2 ]
Wu, Guoqing [1 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[2] Nantong Univ, Sch Mech Engn, Nantong 226019, Peoples R China
基金
中国国家自然科学基金;
关键词
depth estimation; camera pose; visual odometry; unsupervised learning; VISUAL ODOMETRY;
D O I
10.3390/s23115329
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents a novel unsupervised learning framework for estimating scene depth and camera pose from video sequences, fundamental to many high-level tasks such as 3D reconstruction, visual navigation, and augmented reality. Although existing unsupervised methods have achieved promising results, their performance suffers in challenging scenes such as those with dynamic objects and occluded regions. As a result, multiple mask technologies and geometric consistency constraints are adopted in this research to mitigate their negative impacts. Firstly, multiple mask technologies are used to identify numerous outliers in the scene, which are excluded from the loss computation. In addition, the identified outliers are employed as a supervised signal to train a mask estimation network. The estimated mask is then utilized to preprocess the input to the pose estimation network, mitigating the potential adverse effects of challenging scenes on pose estimation. Furthermore, we propose geometric consistency constraints to reduce the sensitivity of illumination changes, which act as additional supervised signals to train the network. Experimental results on the KITTI dataset demonstrate that our proposed strategies can effectively enhance the model's performance, outperforming other unsupervised methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Unsupervised Learning of Depth and Camera Pose with Feature Map Warping
    Guo, Ente
    Chen, Zhifeng
    Zhou, Yanlin
    Wu, Dapeng Oliver
    SENSORS, 2021, 21 (03) : 1 - 15
  • [42] Multi-body Depth and Camera Pose Estimation from Multiple Views
    Dal Cin, Andrea Porfiri
    Boracchi, Giacomo
    Magri, Luca
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17758 - 17768
  • [43] Unsupervised Monocular Depth Estimation in Highly Complex Environments
    Zhao, Chaoqiang
    Tang, Yang
    Sun, Qiyu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (05): : 1237 - 1246
  • [44] Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones
    Zhao Shuanfeng
    Huang Tao
    Xu Qian
    Geng Longlong
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
  • [45] Unsupervised monocular depth estimation based on edge enhancement
    Qu Y.
    Chen Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (01): : 71 - 79
  • [46] Learning monocular depth estimation with unsupervised trinocular assumptions
    Poggi, Matteo
    Tosi, Fabio
    Mattoccia, Stefano
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 324 - 333
  • [47] An Adaptive Unsupervised Learning Framework for Monocular Depth Estimation
    Yang, Delong
    Zhong, Xunyu
    Lin, Lixiong
    Peng, Xiafu
    IEEE ACCESS, 2019, 7 : 148142 - 148151
  • [48] Camera Pose Estimation Based on Angle Constraints
    Wang, Fei
    Jiang, Caigui
    Zheng, Nanning
    Guo, Yu
    ADVANCES IN VISUAL COMPUTING, PT I, 2010, 6453 : 96 - 107
  • [49] Unsupervised Learning of Monocular Depth and Ego-Motion with Optical Flow Features and Multiple Constraints
    Zhao, Baigan
    Huang, Yingping
    Ci, Wenyan
    Hu, Xing
    SENSORS, 2022, 22 (04)
  • [50] Unsupervised monocular depth estimation with omnidirectional camera for 3D reconstruction of grape berries in the wild
    Tamura, Yasuto
    Utsumi, Yuzuko
    Miwa, Yuka
    Iwamura, Masakazu
    Kise, Koichi
    PLOS ONE, 2025, 20 (02):