Unsupervised learning of depth and ego-motion with absolutely global scale recovery from visual and inertial data sequences

被引：0

作者：

Meng Y. ^{[1
]}

Sun Q. ^{[1
]}

Zhang C. ^{[1
]}

Tang Y. ^{[1
]}

机构：

[1] Key Laboratory of Advanced Control and Optimization for Chemical Processes of Ministry of Education, East China University of Science and Technology, Shanghai

来源：

Cyber-Physical Systems | 2021年 / 7卷 / 03期

基金：

中国国家自然科学基金;

关键词：

BiLSTM; depth; ego-motion; Monocular; scale recovery;

D O I：

10.1080/23335777.2020.1811386

中图分类号：

学科分类号：

摘要：

In this paper, we propose an unsupervised learning method for jointly estimating monocular depth and ego-motion, which is capable to recover the absolute scale of global camera trajectory. In order to solve the general problems of scale drift and scale ambiguity of monocular camera, we fuse geometric movement data from inertial measurement unit (IMU), and use Bi-directional Long Short-Term Memory (BiLSTM) to extract temporal features. Besides, we add a lightweight and efficient attention mechanism, Convolutional Block Attention Module (CBAM), to Convolutional Neural Networks (CNNs) to complete the extraction of image features. Considering the scenes with severe illumination changes, ambiguous structures, moving objects and occlusions, especially scenes with progressively-variant textures, the geometric features can provide adaptive estimation results in the case of the degeneration of visual features. Experiments on the KITTI driving dataset reveal that our scheme achieves promising results in the estimation of camera pose and depth. Moreover, the absolute scale recovery for the global camera trajectory is effective. © 2020 Informa UK Limited, trading as Taylor & Francis Group.

引用

页码：133 / 158

页数：25

共 26 条

[21] Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints
Mahjourian, Reza
Wicke, Martin
Angelova, Anelia
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5667 - 5675
[22] Epipolar Geometry based Learning of Multi-view Depth and Ego-Motion from Monocular Sequences
Prasad, Vignesh
Das, Dipanjan
Bhowmick, Brojeshwar
ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
[23] Unsupervised Learning for Depth, Ego-Motion, and Optical Flow Estimation Using Coupled Consistency Conditions
Mun, Ji-Hun
Jeon, Moongu
Lee, Byung-Geun
SENSORS, 2019, 19 (11)
[24] Joint self-supervised learning of interest point, descriptor, depth, and ego-motion from monocular video
Wang, Zhongyi
Shen, Mengjiao
Chen, Qijun
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (32) : 77529 - 77547
[25] Maximizing Self-Supervision From Thermal Image for Effective Self-Supervised Learning of Depth and Ego-Motion
Shin, Ukcheol
Lee, Kyunghyun
Lee, Byeong-Uk
Kweon, In So
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7771 - 7778
[26] Self-Supervised Learning of Depth and Ego-Motion From Videos by Alternative Training and Geometric Constraints from 3-D to 2-D
Fang, Jiaojiao
Liu, Guizhong
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 223 - 233

← 1 2 3 →