A Global Pose and Relative Pose Fusion Network for Monocular Visual Odometry

被引:0
|
作者
Su, Bo [1 ]
Zang, Tianxiang [1 ]
机构
[1] Henan Polytech Univ, Sch Elect Engn & Automat, Jiaozuo 454003, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Visual odometry; Transformers; Feature extraction; Visualization; Pose estimation; Computational modeling; Task analysis; Monocular visual odometry; trajectory drift; divided space-time attention; cross-frame attention; transformer; DEPTH; ATTENTION;
D O I
10.1109/ACCESS.2024.3439529
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual odometry (VO) systems typically rely on stacked convolutional layers and Long Short-Term Memory (LSTM) units to capture long-range dependencies within sequences. However, convolutional neural networks are relatively weak in modeling global context information, making them ill-suited to addressing trajectory drift issues that arise during long-term navigation. Furthermore, their inherent locality impedes the model from achieving better performance. Therefore, we propose a transformer and cross-frame-based visual odometry method, named TSCVO, to address these challenges. TSCVO consists of two main components: 1) A relative pose subnetwork based on the k-NN Timeformer model, which utilizes divided space-time attention to estimate relative poses between adjacent frames. It substitutes k-NN attention for the original fully connected self-attention in the Timeformer model to ignore irrelevant token information and reduce computational complexity. 2) A global pose subnetwork based on cross-frame attention, which utilizes cross-frame interaction mechanisms to learn long-range dependencies and internal correlations within image sequences. Additionally, we introduce robust - as the foundational loss function, supplemented by geometric consistency loss to further constrain structural similarity between adjacent frames and improve prediction accuracy. The evaluation results on the KITTI and EuRoC datasets indicate that TSCVO outperforms other learning-based methods.
引用
收藏
页码:108863 / 108875
页数:13
相关论文
共 50 条
  • [1] Pose Graph for Improved Monocular Visual Odometry
    Kieman, Pawel
    Narkiewicz, Janusz
    2014 19TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2014, : 549 - 553
  • [2] Pose Graph Optimization for Unsupervised Monocular Visual Odometry
    Li, Yang
    Ushiku, Yoshitaka
    Harada, Tatsuya
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5439 - 5445
  • [3] Unsupervised monocular visual odometry with decoupled camera pose estimation
    Lin, Lili
    Wang, Weisheng
    Luo, Wan
    Song, Lesheng
    Zhou, Wenhui
    DIGITAL SIGNAL PROCESSING, 2021, 114
  • [4] WPO-Net: Windowed Pose Optimization Network for Monocular Visual Odometry Estimation
    Gadipudi, Nivesh
    Elamvazuthi, Irraivan
    Lu, Cheng-Kai
    Paramasivam, Sivajothi
    Su, Steven
    SENSORS, 2021, 21 (23)
  • [5] DPDM: FEATURE-BASED POSE REFINEMENT WITH DEEP POSE AND DEEP MATCH FOR MONOCULAR VISUAL ODOMETRY
    Huang, Li-Yang
    Huang, Shao-Syuan
    Chien, Shao-Yi
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1370 - 1374
  • [6] Robust and Efficient Estimation of Absolute Camera Pose for Monocular Visual Odometry
    Li, Haoang
    Chen, Wen
    Zhao, Ji
    Bazin, Jean-Charles
    Luo, Lei
    Liu, Zhe
    Liu, Yun-Hui
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2675 - 2681
  • [7] Combining Points and Lines for Camera Pose Estimation and Optimization in Monocular Visual Odometry
    Li, Haoang
    Yao, Jian
    Lu, Xiaohu
    Wu, Junlin
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 1289 - 1296
  • [8] Pose Estimation Based on Monocular Visual Odometry and Lane Detection for Intelligent Vehicles
    Galarza, Juan
    Perez, Esteban
    Serrano, Esteban
    Tapia, Andres
    Aguilar, Wilbert G.
    AUGMENTED REALITY, VIRTUAL REALITY, AND COMPUTER GRAPHICS, AVR 2018, PT II, 2018, 10851 : 562 - 566
  • [9] Robust self-supervised monocular visual odometry based on prediction-update pose estimation network
    Xiu, Haixin
    Liang, Yiyou
    Zeng, Hui
    Li, Qing
    Liu, Hongmin
    Fan, Bin
    Li, Chen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [10] A 2-point Pose Estimation Algorithm for Monocular Visual Odometry of Ground Vehicles
    Gao, Yanyan
    Chen, Jian
    Zhang, Kaixiang
    Jia, Bingxi
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 1610 - 1615