Camera pose estimation in multi-view environments: From virtual scenarios to the real world

被引：11

作者：

Charco, Jorge L. ^{[1
,2
]}

Sappa, Angel D. ^{[2
,3
]}

Vintimilla, Boris X. ^{[2
]}

Velesaca, Henry O. ^{[2
]}

机构：

[1] Univ Guayaquil, Delta & Kennedy Av,PB EC090514, Guayaquil, Ecuador

[2] Escuela Super Politecn Litoral, ESPOL, Campus Gustavo Galindo Km 30-5 Via Perimetral, Guayaquil, Ecuador

[3] Comp Vis Ctr, Edifici O,Campus UAB, Barcelona 08193, Spain

来源：

IMAGE AND VISION COMPUTING | 2021年 / 110卷

关键词：

Relative camera pose estimation; Domain adaptation; Siamese architecture; Synthetic data; Multi-view environments;

D O I：

10.1016/j.imavis.2021.104182

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a domain adaptation strategy to efficiently train network architectures for estimating the relative camera pose in multi-view scenarios. The network architectures are fed by a pair of simultaneously acquired images, hence in order to improve the accuracy of the solutions, and due to the lack of large datasets with pairs of overlapped images, a domain adaptation strategy is proposed. The domain adaptation strategy consists on transferring the knowledge learned from synthetic images to real-world scenarios. For this, the networks are firstly trained using pairs of synthetic images, which are captured at the same time by a pair of cameras in a virtual environment; and then, the learned weights of the networks are transferred to the real-world case, where the networks are retrained with a few real images. Different virtual 3D scenarios are generated to evaluate the relationship between the accuracy on the result and the similarity between virtual and real scenarios & mdash;similarity on both geometry of the objects contained in the scene as well as relative pose between camera and objects in the scene. Experimental results and comparisons are provided showing that the accuracy of all the evaluated networks for estimating the camera pose improves when the proposed domain adaptation strategy is used, highlighting the importance on the similarity between virtual-real scenarios. (c) 2021 Published by Elsevier B.V.

引用

页数：12

共 50 条

[41] A Hierarchical Approach for Joint Multi-view Object Pose Estimation and Categorization
Ozay, Mete
Walas, Krzysztof
Leonardis, Ales
2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 5480 - 5487
[42] A view-based statistical system for multi-view face detection and pose estimation
Chen, Ju-Chin
Lien, Jenn-Jier James
IMAGE AND VISION COMPUTING, 2009, 27 (09) : 1252 - 1271
[43] 3D human pose estimation in multi-view operating room videos using differentiable camera projections
Gerats, Beerend G. A.
Wolterink, Jelmer M.
Broeders, Ivo A. M. J.
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (04): : 1197 - 1205
[44] Generalizing the virtual camera pose for view synthesis
Martín, EX
Martínez, AB
IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 : 701 - 708
[45] PPT: Token-Pruned Pose Transformer for Monocular and Multi-view Human Pose Estimation
Ma, Haoyu
Wang, Zhe
Chen, Yifei
Kong, Deying
Chen, Liangjian
Liu, Xingwei
Yan, Xiangyi
Tang, Hao
Xie, Xiaohui
COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 424 - 442
[46] Direct Multi-view Multi-person 3D Pose Estimation
Wang, Tao
Zhang, Jianfeng
Cai, Yujun
Yan, Shuicheng
Feng, Jiashi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[47] 3D Human Pose Estimation from multi-view thermal vision sensors
Lupion, Marcos
Polo-Rodriguez, Aurora
Medina-Quero, Javier
Sanjuan, Juan F.
Ortigosa, Pilar M.
INFORMATION FUSION, 2024, 104
[48] Learning Monocular 3D Human Pose Estimation from Multi-view Images
Rhodin, Helge
Sporri, Jorg
Katircioglu, Isinsu
Constantin, Victor
Meyer, Frederic
Mueller, Erich
Salzmann, Mathieu
Fua, Pascal
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8437 - 8446
[49] Real-Time Multi-View Face Detection and Pose Estimation Based on Cost-Sensitive AdaBoost
马勇
丁晓青
TsinghuaScienceandTechnology, 2005, (02) : 152 - 157
[50] Flycon: Real-time Environment-independent Multi-view Human Pose Estimation with Aerial Vehicles
Nageli, Tobias
Oberholzer, Samuel
Pluss, Silvan
Alonso-Mora, Javier
Hilliges, Otmar
ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (06):

← 1 2 3 4 5 →