Camera pose estimation in multi-view environments: From virtual scenarios to the real world

被引:11
|
作者
Charco, Jorge L. [1 ,2 ]
Sappa, Angel D. [2 ,3 ]
Vintimilla, Boris X. [2 ]
Velesaca, Henry O. [2 ]
机构
[1] Univ Guayaquil, Delta & Kennedy Av,PB EC090514, Guayaquil, Ecuador
[2] Escuela Super Politecn Litoral, ESPOL, Campus Gustavo Galindo Km 30-5 Via Perimetral, Guayaquil, Ecuador
[3] Comp Vis Ctr, Edifici O,Campus UAB, Barcelona 08193, Spain
关键词
Relative camera pose estimation; Domain adaptation; Siamese architecture; Synthetic data; Multi-view environments;
D O I
10.1016/j.imavis.2021.104182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a domain adaptation strategy to efficiently train network architectures for estimating the relative camera pose in multi-view scenarios. The network architectures are fed by a pair of simultaneously acquired images, hence in order to improve the accuracy of the solutions, and due to the lack of large datasets with pairs of overlapped images, a domain adaptation strategy is proposed. The domain adaptation strategy consists on transferring the knowledge learned from synthetic images to real-world scenarios. For this, the networks are firstly trained using pairs of synthetic images, which are captured at the same time by a pair of cameras in a virtual environment; and then, the learned weights of the networks are transferred to the real-world case, where the networks are retrained with a few real images. Different virtual 3D scenarios are generated to evaluate the relationship between the accuracy on the result and the similarity between virtual and real scenarios & mdash;similarity on both geometry of the objects contained in the scene as well as relative pose between camera and objects in the scene. Experimental results and comparisons are provided showing that the accuracy of all the evaluated networks for estimating the camera pose improves when the proposed domain adaptation strategy is used, highlighting the importance on the similarity between virtual-real scenarios. (c) 2021 Published by Elsevier B.V.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Multi-View Camera Pose Estimation for Robotic Arm Manipulation
    Ali, Ihtisham
    Suominen, Olli J.
    Morales, Emilio Ruiz
    Gotchev, Atanas
    IEEE ACCESS, 2020, 8 (08): : 174305 - 174316
  • [2] Multi-View Metal Parts Pose Estimation Based on a Single Camera
    Chen, Chen
    Jiang, Xin
    SENSORS, 2024, 24 (11)
  • [3] Deep learning based camera pose estimation in multi-view environment
    Charco, Jorge L.
    Vintimilla, Boris X.
    Sappa, Angel D.
    2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 224 - 228
  • [4] Simultaneous Multi-View Camera Pose Estimation and Object Tracking With Squared Planar Markers
    Sarmadi, Hamid
    Munoz-Salinas, Rafael
    Berbis, M. A.
    Medina-Carnicer, R.
    IEEE ACCESS, 2019, 7 : 22927 - 22940
  • [5] Multi-view structure-from-motion for hybrid camera scenarios
    Bastanlar, Y.
    Temizel, A.
    Yardimci, Y.
    Sturm, P.
    IMAGE AND VISION COMPUTING, 2012, 30 (08) : 557 - 572
  • [6] Multi-person 3D pose estimation from multi-view without extrinsic camera parameters
    Xu, Daoliang
    Zheng, Tianyou
    Zhang, Yang
    Yang, Xiaodong
    Fu, Weiwei
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [7] Single-Camera Multi-View 6DoF pose estimation for robotic grasping
    Yuan, Shuangjie
    Ge, Zhenpeng
    Yang, Lu
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [8] Epipolar Transformer for Multi-view Human Pose Estimation
    He, Yihui
    Yan, Rui
    Fragkiadaki, Katerina
    Yu, Shoou-, I
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4466 - 4471
  • [9] Real-time multi-view face detection and pose estimation in video stream
    Wang, Yan
    Liu, Yanghua
    Tao, Linmi
    Xu, Guangyou
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 354 - +
  • [10] Head pose estimation in single- and multi-view environments - Results on the CLEAR'07 benchmarks
    Voit, Michael
    Nickel, Kai
    Stiefelhagen, Rainer
    MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2008, 4625 : 307 - 316