Relative pose estimation from panoramic images using a hybrid neural network architecture

被引：0

作者：

Offermann, Lars ^{[1
]}

机构：

[1] Bielefeld Univ, Fac Technol, D-33615 Bielefeld, Germany

来源：

SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期

关键词：

VISUAL ODOMETRY; NAVIGATION;

D O I：

10.1038/s41598-024-75124-7

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Camera-based relative pose estimation (RPE) localizes a mobile robot given a view at the current position and an image at a reference location. Matching the landmarks between views is critical to localization quality. Common challenges are appearance changes, for example due to differing illumination. Indirect RPE methods extract high-level features that provide invariance against appearance changes but neglect the remaining image data. This can lead to poor pose estimates in scenes with little detail. Direct RPE methods mitigate this issue by operating on the pixel level with only moderate preprocessing, but invariances have to be achieved by different means. We propose to attain illumination invariance for the direct RPE algorithm MinWarping by integrating it with a convolutional neural network for image preprocessing, creating a hybrid architecture. We optimize network parameters using a metric on RPE quality, backpropagating through MinWarping and the network. We focus on planar movement, panoramic images, and indoor scenes with varying illumination conditions; a novel dataset for this setup is recorded and used for analysis. Our method compares favourably against the previous best preprocessing method for MinWarping, edge filtering, and against a modern deep-learning-based indirect RPE pipeline. Analysis of the trained hybrid architecture indicates that neglecting landmarks in a direct RPE framework can improve estimation quality in scenes with occlusion and few details.

引用

页数：25

共 50 条

[21] Head Pose Estimation for an Omnidirectional Camera using a Convolutional Neural Network
Yamaura, Yusuke
Tsuboshita, Yukihiro
Onishi, Takeshi
PROCEEDINGS 2018 IEEE 13TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2018,
[22] Segmentation of ultrasound images by using a hybrid neural network
Dokur, Z
Ölmez, T
PATTERN RECOGNITION LETTERS, 2002, 23 (14) : 1825 - 1836
[23] Implementation experiments on convolutional neural network training using synthetic images for 3D pose estimation of an excavator on real images
Mahmood, Bilawal
Han, SangUk
Seo, Jongwon
AUTOMATION IN CONSTRUCTION, 2022, 133
[24] Hierarchical neural network for hand pose estimation
Chen, Zheng
Du, Kuo
Sun, Yi
Lin, Xiangbo
Ma, Xiaohong
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 87
[25] Novel Hybrid Neural Network for Dense Depth Estimation using On-Board Monocular Images
Jia, Shaocheng
Pei, Xin
Yang, Zi
Tian, Shan
Yue, Yun
TRANSPORTATION RESEARCH RECORD, 2020, 2674 (12) : 312 - 323
[26] Camera pose estimation by an artificial neural network
Benton, Ryan G.
Chu, Chee-hung Henry
NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 604 - 611
[27] Preterm infants' limb-pose estimation from depth images using convolutional neural networks
Moccia, Sara
Migliorelli, Lucia
Pietrini, Rocco
Frontoni, Emanuele
2019 16TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY - CIBCB 2019, 2019, : 1 - 7
[28] Joint Customer Pose and Orientation Estimation using Deep Neural Network from Surveillance Camera
Liu, Jingwen
Gu, Yanlei
Kamijo, Shunsuke
PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 216 - 221
[29] Extreme Relative Pose Network under Hybrid Representations
Yang, Zhenpei
Yan, Siming
Huang, Qixing
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2452 - 2461
[30] Depth estimation from single monocular images using deep hybrid network
Aleksei Grigorev
Feng Jiang
Seungmin Rho
Worku J. Sori
Shaohui Liu
Sergey Sai
Multimedia Tools and Applications, 2017, 76 : 18585 - 18604

← 1 2 3 4 5 →