Relative pose estimation from panoramic images using a hybrid neural network architecture

被引:0
|
作者
Offermann, Lars [1 ]
机构
[1] Bielefeld Univ, Fac Technol, D-33615 Bielefeld, Germany
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
VISUAL ODOMETRY; NAVIGATION;
D O I
10.1038/s41598-024-75124-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Camera-based relative pose estimation (RPE) localizes a mobile robot given a view at the current position and an image at a reference location. Matching the landmarks between views is critical to localization quality. Common challenges are appearance changes, for example due to differing illumination. Indirect RPE methods extract high-level features that provide invariance against appearance changes but neglect the remaining image data. This can lead to poor pose estimates in scenes with little detail. Direct RPE methods mitigate this issue by operating on the pixel level with only moderate preprocessing, but invariances have to be achieved by different means. We propose to attain illumination invariance for the direct RPE algorithm MinWarping by integrating it with a convolutional neural network for image preprocessing, creating a hybrid architecture. We optimize network parameters using a metric on RPE quality, backpropagating through MinWarping and the network. We focus on planar movement, panoramic images, and indoor scenes with varying illumination conditions; a novel dataset for this setup is recorded and used for analysis. Our method compares favourably against the previous best preprocessing method for MinWarping, edge filtering, and against a modern deep-learning-based indirect RPE pipeline. Analysis of the trained hybrid architecture indicates that neglecting landmarks in a direct RPE framework can improve estimation quality in scenes with occlusion and few details.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Head Pose Estimation for an Omnidirectional Camera using a Convolutional Neural Network
    Yamaura, Yusuke
    Tsuboshita, Yukihiro
    Onishi, Takeshi
    PROCEEDINGS 2018 IEEE 13TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2018,
  • [22] Segmentation of ultrasound images by using a hybrid neural network
    Dokur, Z
    Ölmez, T
    PATTERN RECOGNITION LETTERS, 2002, 23 (14) : 1825 - 1836
  • [23] Implementation experiments on convolutional neural network training using synthetic images for 3D pose estimation of an excavator on real images
    Mahmood, Bilawal
    Han, SangUk
    Seo, Jongwon
    AUTOMATION IN CONSTRUCTION, 2022, 133
  • [24] Hierarchical neural network for hand pose estimation
    Chen, Zheng
    Du, Kuo
    Sun, Yi
    Lin, Xiangbo
    Ma, Xiaohong
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 87
  • [25] Novel Hybrid Neural Network for Dense Depth Estimation using On-Board Monocular Images
    Jia, Shaocheng
    Pei, Xin
    Yang, Zi
    Tian, Shan
    Yue, Yun
    TRANSPORTATION RESEARCH RECORD, 2020, 2674 (12) : 312 - 323
  • [26] Camera pose estimation by an artificial neural network
    Benton, Ryan G.
    Chu, Chee-hung Henry
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 604 - 611
  • [27] Preterm infants' limb-pose estimation from depth images using convolutional neural networks
    Moccia, Sara
    Migliorelli, Lucia
    Pietrini, Rocco
    Frontoni, Emanuele
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY - CIBCB 2019, 2019, : 1 - 7
  • [28] Joint Customer Pose and Orientation Estimation using Deep Neural Network from Surveillance Camera
    Liu, Jingwen
    Gu, Yanlei
    Kamijo, Shunsuke
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 216 - 221
  • [29] Extreme Relative Pose Network under Hybrid Representations
    Yang, Zhenpei
    Yan, Siming
    Huang, Qixing
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2452 - 2461
  • [30] Depth estimation from single monocular images using deep hybrid network
    Aleksei Grigorev
    Feng Jiang
    Seungmin Rho
    Worku J. Sori
    Shaohui Liu
    Sergey Sai
    Multimedia Tools and Applications, 2017, 76 : 18585 - 18604