Camera pose estimation in multi-view environments: From virtual scenarios to the real world

被引:11
|
作者
Charco, Jorge L. [1 ,2 ]
Sappa, Angel D. [2 ,3 ]
Vintimilla, Boris X. [2 ]
Velesaca, Henry O. [2 ]
机构
[1] Univ Guayaquil, Delta & Kennedy Av,PB EC090514, Guayaquil, Ecuador
[2] Escuela Super Politecn Litoral, ESPOL, Campus Gustavo Galindo Km 30-5 Via Perimetral, Guayaquil, Ecuador
[3] Comp Vis Ctr, Edifici O,Campus UAB, Barcelona 08193, Spain
关键词
Relative camera pose estimation; Domain adaptation; Siamese architecture; Synthetic data; Multi-view environments;
D O I
10.1016/j.imavis.2021.104182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a domain adaptation strategy to efficiently train network architectures for estimating the relative camera pose in multi-view scenarios. The network architectures are fed by a pair of simultaneously acquired images, hence in order to improve the accuracy of the solutions, and due to the lack of large datasets with pairs of overlapped images, a domain adaptation strategy is proposed. The domain adaptation strategy consists on transferring the knowledge learned from synthetic images to real-world scenarios. For this, the networks are firstly trained using pairs of synthetic images, which are captured at the same time by a pair of cameras in a virtual environment; and then, the learned weights of the networks are transferred to the real-world case, where the networks are retrained with a few real images. Different virtual 3D scenarios are generated to evaluate the relationship between the accuracy on the result and the similarity between virtual and real scenarios & mdash;similarity on both geometry of the objects contained in the scene as well as relative pose between camera and objects in the scene. Experimental results and comparisons are provided showing that the accuracy of all the evaluated networks for estimating the camera pose improves when the proposed domain adaptation strategy is used, highlighting the importance on the similarity between virtual-real scenarios. (c) 2021 Published by Elsevier B.V.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A Hierarchical Approach for Joint Multi-view Object Pose Estimation and Categorization
    Ozay, Mete
    Walas, Krzysztof
    Leonardis, Ales
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 5480 - 5487
  • [42] A view-based statistical system for multi-view face detection and pose estimation
    Chen, Ju-Chin
    Lien, Jenn-Jier James
    IMAGE AND VISION COMPUTING, 2009, 27 (09) : 1252 - 1271
  • [43] 3D human pose estimation in multi-view operating room videos using differentiable camera projections
    Gerats, Beerend G. A.
    Wolterink, Jelmer M.
    Broeders, Ivo A. M. J.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (04): : 1197 - 1205
  • [44] Generalizing the virtual camera pose for view synthesis
    Martín, EX
    Martínez, AB
    IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 : 701 - 708
  • [45] PPT: Token-Pruned Pose Transformer for Monocular and Multi-view Human Pose Estimation
    Ma, Haoyu
    Wang, Zhe
    Chen, Yifei
    Kong, Deying
    Chen, Liangjian
    Liu, Xingwei
    Yan, Xiangyi
    Tang, Hao
    Xie, Xiaohui
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 424 - 442
  • [46] Direct Multi-view Multi-person 3D Pose Estimation
    Wang, Tao
    Zhang, Jianfeng
    Cai, Yujun
    Yan, Shuicheng
    Feng, Jiashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [47] 3D Human Pose Estimation from multi-view thermal vision sensors
    Lupion, Marcos
    Polo-Rodriguez, Aurora
    Medina-Quero, Javier
    Sanjuan, Juan F.
    Ortigosa, Pilar M.
    INFORMATION FUSION, 2024, 104
  • [48] Learning Monocular 3D Human Pose Estimation from Multi-view Images
    Rhodin, Helge
    Sporri, Jorg
    Katircioglu, Isinsu
    Constantin, Victor
    Meyer, Frederic
    Mueller, Erich
    Salzmann, Mathieu
    Fua, Pascal
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8437 - 8446
  • [49] Real-Time Multi-View Face Detection and Pose Estimation Based on Cost-Sensitive AdaBoost
    马勇
    丁晓青
    TsinghuaScienceandTechnology, 2005, (02) : 152 - 157
  • [50] Flycon: Real-time Environment-independent Multi-view Human Pose Estimation with Aerial Vehicles
    Nageli, Tobias
    Oberholzer, Samuel
    Pluss, Silvan
    Alonso-Mora, Javier
    Hilliges, Otmar
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (06):