From Local Understanding to Global Regression in Monocular Visual Odometry

被引：6

作者：

Esfahani, Mandi Abolfazli ^{[1
]}

Wu, Keyu ^{[1
]}

Yuan, Shenghai ^{[1
]}

Wang, Han ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore

来源：

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE | 2020年 / 34卷 / 01期

关键词：

Visual odometry; deep learning; convolutional neural network (CNN); simultaneous localization and mapping (SLAM); classification; regression;

D O I：

10.1142/S0218001420550022

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The most significant part of any autonomous intelligent robot is the localization module that gives the robot knowledge about its position and orientation. This knowledge assists the robot to move to the location of its desired goal and complete its task. Visual Odometry (VO) measures the displacement of the robots' camera in consecutive frames which results in the estimation of the robot position and orientation. Deep Learning, nowadays, helps to learn rich and informative features for the problem of VO to estimate frame-by-frame camera movement. Recent Deep Learning-based VO methods train an end-by-end network to solve VO as a regression problem directly without visualizing and sensing the label of training data in the training procedure. In this paper, a new approach to train Convolutional Neural Networks (CNNs) for the regression problems, such as VO, is proposed. The proposed method first changes the problem to a classification problem to learn different subspaces with similar observations. After solving the classification problem, the problem converts to the original regression problem to solve using the knowledge achieved by solving the classification problem. This approach helps CNN to solve regression problem globally in a local domain learned in the classification step, and improves the performance of the regression module for approximately 10%.

引用

页数：16

共 50 条

[21] Resolving Scale Ambiguity for Monocular Visual Odometry
Choi, Sunglok
Park, Jaehyun
Yu, Wonpil
2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 604 - 608
[22] Deep Monocular Visual Odometry for Ground Vehicle
Wang, Xiangwei
Zhang, Hui
IEEE ACCESS, 2020, 8 : 175220 - 175229
[23] Multimodal Scale Estimation for Monocular Visual Odometry
Fanani, Nolang
Stuerck, Alina
Barnada, Marc
Mester, Rudolf
2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1714 - 1721
[24] Monocular Visual Odometry for underground railway scenarios
Etxeberria-Garcia, Mikel
Labayen, Mikel
Eizaguirre, Fernando
Zamalloa, Maider
Arana-Arexolaleiba, Nestor
FIFTEENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2021, 11794
[25] LIMO: Lidar-Monocular Visual Odometry
Graeter, Johannes
Wilczynski, Alexander
Lauer, Martin
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 7872 - 7879
[26] Perceptual Enhancement for Unsupervised Monocular Visual Odometry
Wang, Zhongyi
Shen, Mengjiao
Liu, Chengju
Chen, Qijun
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2025, 23 (01) : 346 - 357
[27] Milk: Monocular Visual Odometry with Motion Constraints
Choi, Sunglok
Yu, Wonpil
2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAL), 2012, : 199 - 199
[28] Robust Monocular Visual Odometry by Uncertainty Voting
Van Hamme, David
Veelaert, Peter
Philips, Wilfried
2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, : 643 - 647
[29] Local to Global: Efficient Visual Localization for a Monocular Camera
Lee, Sang Jun
Kim, Deokhwa
Hwang, Sung Soo
Lee, Donghwan
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2230 - 2239
[30] STMVO: biologically inspired monocular visual odometry
Yangming Li
Jian Zhang
Shuai Li
Neural Computing and Applications, 2018, 29 : 215 - 225

← 1 2 3 4 5 →