From Local Understanding to Global Regression in Monocular Visual Odometry

被引:6
|
作者
Esfahani, Mandi Abolfazli [1 ]
Wu, Keyu [1 ]
Yuan, Shenghai [1 ]
Wang, Han [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
关键词
Visual odometry; deep learning; convolutional neural network (CNN); simultaneous localization and mapping (SLAM); classification; regression;
D O I
10.1142/S0218001420550022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most significant part of any autonomous intelligent robot is the localization module that gives the robot knowledge about its position and orientation. This knowledge assists the robot to move to the location of its desired goal and complete its task. Visual Odometry (VO) measures the displacement of the robots' camera in consecutive frames which results in the estimation of the robot position and orientation. Deep Learning, nowadays, helps to learn rich and informative features for the problem of VO to estimate frame-by-frame camera movement. Recent Deep Learning-based VO methods train an end-by-end network to solve VO as a regression problem directly without visualizing and sensing the label of training data in the training procedure. In this paper, a new approach to train Convolutional Neural Networks (CNNs) for the regression problems, such as VO, is proposed. The proposed method first changes the problem to a classification problem to learn different subspaces with similar observations. After solving the classification problem, the problem converts to the original regression problem to solve using the knowledge achieved by solving the classification problem. This approach helps CNN to solve regression problem globally in a local domain learned in the classification step, and improves the performance of the regression module for approximately 10%.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] DYNAMIC OBJECT-AWARE MONOCULAR VISUAL ODOMETRY WITH LOCAL AND GLOBAL INFORMATION AGGREGATION
    Wan, Yiming
    Gao, Wei
    Han, Sheng
    Wu, Yihong
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 603 - 607
  • [2] A review of monocular visual odometry
    He, Ming
    Zhu, Chaozheng
    Huang, Qian
    Ren, Baosen
    Liu, Jintao
    VISUAL COMPUTER, 2020, 36 (05): : 1053 - 1065
  • [3] A review of monocular visual odometry
    Ming He
    Chaozheng Zhu
    Qian Huang
    Baosen Ren
    Jintao Liu
    The Visual Computer, 2020, 36 : 1053 - 1065
  • [4] Monocular SLAM for visual odometry
    Munguia, Rodrigo
    Grau, Antoni
    2007 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING, CONFERENCE PROCEEDINGS BOOK, 2007, : 443 - 448
  • [5] Fast and accurate visual odometry from a monocular camera
    Xin Yang
    Tangli Xue
    Hongcheng Luo
    Jiabin Guo
    Frontiers of Computer Science, 2019, 13 : 1326 - 1336
  • [6] Fast and accurate visual odometry from a monocular camera
    Yang, Xin
    Xue, Tangli
    Luo, Hongcheng
    Guo, Jiabin
    FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (06) : 1326 - 1336
  • [7] A Global Pose and Relative Pose Fusion Network for Monocular Visual Odometry
    Su, Bo
    Zang, Tianxiang
    IEEE ACCESS, 2024, 12 : 108863 - 108875
  • [8] Robust Stereo Visual Odometry from Monocular Techniques
    Persson, Mikael
    Piccini, Tommaso
    Felsberg, Michael
    Mester, Rudolf
    2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 686 - 691
  • [9] The precision of triangulation in monocular visual odometry
    Fanani, Nolang
    Mester, Rudolf
    2018 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION (SSIAI), 2018, : 73 - 76
  • [10] An Adaptive Strategy For Monocular Visual Odometry
    Xie, Shengguang
    Lin, Mengxiang
    Hang, Meng
    Ding, Rong
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE (ICRAI 2017), 2015, : 46 - 50