Unsupervised learning of depth and ego-motion with absolutely global scale recovery from visual and inertial data sequences

被引:0
|
作者
Meng Y. [1 ]
Sun Q. [1 ]
Zhang C. [1 ]
Tang Y. [1 ]
机构
[1] Key Laboratory of Advanced Control and Optimization for Chemical Processes of Ministry of Education, East China University of Science and Technology, Shanghai
基金
中国国家自然科学基金;
关键词
BiLSTM; depth; ego-motion; Monocular; scale recovery;
D O I
10.1080/23335777.2020.1811386
中图分类号
学科分类号
摘要
In this paper, we propose an unsupervised learning method for jointly estimating monocular depth and ego-motion, which is capable to recover the absolute scale of global camera trajectory. In order to solve the general problems of scale drift and scale ambiguity of monocular camera, we fuse geometric movement data from inertial measurement unit (IMU), and use Bi-directional Long Short-Term Memory (BiLSTM) to extract temporal features. Besides, we add a lightweight and efficient attention mechanism, Convolutional Block Attention Module (CBAM), to Convolutional Neural Networks (CNNs) to complete the extraction of image features. Considering the scenes with severe illumination changes, ambiguous structures, moving objects and occlusions, especially scenes with progressively-variant textures, the geometric features can provide adaptive estimation results in the case of the degeneration of visual features. Experiments on the KITTI driving dataset reveal that our scheme achieves promising results in the estimation of camera pose and depth. Moreover, the absolute scale recovery for the global camera trajectory is effective. © 2020 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:133 / 158
页数:25
相关论文
共 26 条
  • [21] Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints
    Mahjourian, Reza
    Wicke, Martin
    Angelova, Anelia
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5667 - 5675
  • [22] Epipolar Geometry based Learning of Multi-view Depth and Ego-Motion from Monocular Sequences
    Prasad, Vignesh
    Das, Dipanjan
    Bhowmick, Brojeshwar
    ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
  • [23] Unsupervised Learning for Depth, Ego-Motion, and Optical Flow Estimation Using Coupled Consistency Conditions
    Mun, Ji-Hun
    Jeon, Moongu
    Lee, Byung-Geun
    SENSORS, 2019, 19 (11)
  • [24] Joint self-supervised learning of interest point, descriptor, depth, and ego-motion from monocular video
    Wang, Zhongyi
    Shen, Mengjiao
    Chen, Qijun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (32) : 77529 - 77547
  • [25] Maximizing Self-Supervision From Thermal Image for Effective Self-Supervised Learning of Depth and Ego-Motion
    Shin, Ukcheol
    Lee, Kyunghyun
    Lee, Byeong-Uk
    Kweon, In So
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7771 - 7778
  • [26] Self-Supervised Learning of Depth and Ego-Motion From Videos by Alternative Training and Geometric Constraints from 3-D to 2-D
    Fang, Jiaojiao
    Liu, Guizhong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 223 - 233