From Local Understanding to Global Regression in Monocular Visual Odometry

被引:6
|
作者
Esfahani, Mandi Abolfazli [1 ]
Wu, Keyu [1 ]
Yuan, Shenghai [1 ]
Wang, Han [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
关键词
Visual odometry; deep learning; convolutional neural network (CNN); simultaneous localization and mapping (SLAM); classification; regression;
D O I
10.1142/S0218001420550022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most significant part of any autonomous intelligent robot is the localization module that gives the robot knowledge about its position and orientation. This knowledge assists the robot to move to the location of its desired goal and complete its task. Visual Odometry (VO) measures the displacement of the robots' camera in consecutive frames which results in the estimation of the robot position and orientation. Deep Learning, nowadays, helps to learn rich and informative features for the problem of VO to estimate frame-by-frame camera movement. Recent Deep Learning-based VO methods train an end-by-end network to solve VO as a regression problem directly without visualizing and sensing the label of training data in the training procedure. In this paper, a new approach to train Convolutional Neural Networks (CNNs) for the regression problems, such as VO, is proposed. The proposed method first changes the problem to a classification problem to learn different subspaces with similar observations. After solving the classification problem, the problem converts to the original regression problem to solve using the knowledge achieved by solving the classification problem. This approach helps CNN to solve regression problem globally in a local domain learned in the classification step, and improves the performance of the regression module for approximately 10%.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Resolving Scale Ambiguity for Monocular Visual Odometry
    Choi, Sunglok
    Park, Jaehyun
    Yu, Wonpil
    2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 604 - 608
  • [22] Deep Monocular Visual Odometry for Ground Vehicle
    Wang, Xiangwei
    Zhang, Hui
    IEEE ACCESS, 2020, 8 : 175220 - 175229
  • [23] Multimodal Scale Estimation for Monocular Visual Odometry
    Fanani, Nolang
    Stuerck, Alina
    Barnada, Marc
    Mester, Rudolf
    2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1714 - 1721
  • [24] Monocular Visual Odometry for underground railway scenarios
    Etxeberria-Garcia, Mikel
    Labayen, Mikel
    Eizaguirre, Fernando
    Zamalloa, Maider
    Arana-Arexolaleiba, Nestor
    FIFTEENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2021, 11794
  • [25] LIMO: Lidar-Monocular Visual Odometry
    Graeter, Johannes
    Wilczynski, Alexander
    Lauer, Martin
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 7872 - 7879
  • [26] Perceptual Enhancement for Unsupervised Monocular Visual Odometry
    Wang, Zhongyi
    Shen, Mengjiao
    Liu, Chengju
    Chen, Qijun
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2025, 23 (01) : 346 - 357
  • [27] Milk: Monocular Visual Odometry with Motion Constraints
    Choi, Sunglok
    Yu, Wonpil
    2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAL), 2012, : 199 - 199
  • [28] Robust Monocular Visual Odometry by Uncertainty Voting
    Van Hamme, David
    Veelaert, Peter
    Philips, Wilfried
    2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, : 643 - 647
  • [29] Local to Global: Efficient Visual Localization for a Monocular Camera
    Lee, Sang Jun
    Kim, Deokhwa
    Hwang, Sung Soo
    Lee, Donghwan
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2230 - 2239
  • [30] STMVO: biologically inspired monocular visual odometry
    Yangming Li
    Jian Zhang
    Shuai Li
    Neural Computing and Applications, 2018, 29 : 215 - 225