Visual Camera Re-Localization From RGB and RGB-D Images Using DSAC

被引:99
|
作者
Brachmann, Eric [1 ]
Rother, Carsten [2 ]
机构
[1] Niantic, San Francisco, CA 94104 USA
[2] Heidelberg Univ, Visual Learning Lab, D-69117 Heidelberg, Germany
基金
欧洲研究理事会;
关键词
Cameras; Training; Three-dimensional displays; Visualization; Optimization; Neural networks; Solid modeling; Camera re-localization; pose estimation; differentiable RANSAC; DSAC; differentiable argmax; differentiable PnP;
D O I
10.1109/TPAMI.2021.3070754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RGB-D or RGB, and a 3D model of the environment can be utilized for training but is not necessary. In the minimal case, our system requires only RGB images and ground truth poses at training time, and it requires only a single RGB image at test time. The framework consists of a deep neural network and fully differentiable pose optimization. The neural network predicts so called scene coordinates, i.e., dense correspondences between the input image and 3D scene space of the environment. The pose optimization implements robust fitting of pose parameters using differentiable RANSAC (DSAC) to facilitate end-to-end training. The system, an extension of DSAC++ and referred to as DSAC*, achieves state-of-the-art accuracy on various public datasets for RGB-based re-localization, and competitive accuracy for RGB-D based re-localization.
引用
收藏
页码:5847 / 5865
页数:19
相关论文
共 50 条
  • [41] Visual Saliency Detection for RGB-D Images with Generative Model
    Wang, Song-Tao
    Zhou, Zhen
    Qu, Han-Bing
    Li, Bin
    COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 20 - 35
  • [42] RDBN: Visual relationship detection with inaccurate RGB-D images
    Liu, Xiaozhou
    Gan, Ming-Gang
    KNOWLEDGE-BASED SYSTEMS, 2020, 204
  • [43] Simultaneous localization and mapping using an RGB-D camera for autonomous mobile robot navigation
    Macias, Luis Rodolfo
    Orozco-Rosas, Ulises
    Picos, Kenia
    OPTICS AND PHOTONICS FOR INFORMATION PROCESSING XV, 2021, 11841
  • [44] Target Localization using RGB-D Camera and LiDAR Sensor Fusion for Relative Navigation
    Song, Ha-ryong
    Choi, Won-sub
    Lim, Seong-min
    Kim, Hae-dong
    2014 CACS INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS 2014), 2014, : 144 - 149
  • [45] Sparse Direct Robot Localization Method Based on RGB-D Camera
    Hou, Rongbo
    Wei, Wu
    Yao, Yeboah
    Huang, Ting
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL AND AUTOMATION ENGINEERING (ECAE 2017), 2017, 140 : 178 - 186
  • [46] A Survey of the Simultaneous Localization and Mapping (Slam) Based on Rgb-D Camera
    Zhang, Zhifan
    Liu, Mengna
    Diao, Chen
    Chen, Shengyong
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGY (MEET 2019), 2019, : 48 - 58
  • [47] Efficient Scene Simulation for Robust Monte Carlo Localization using an RGB-D Camera
    Fallon, Maurice F.
    Johannsson, Hordur
    Leonard, John J.
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 1663 - 1670
  • [48] Visualization of Temperature Change Using RGB-D Camera and Thermal Camera
    Nakagawa, Wataru
    Matsumoto, Kazuki
    de Sorbier, Francois
    Sugimoto, Maki
    Saito, Hideo
    Senda, Shuji
    Shibata, Takashi
    Iketani, Akihiko
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 386 - 400
  • [49] Real-Time Visual Odometry from Dense RGB-D Images
    Steinbruecker, Frank
    Sturm, Juergen
    Cremers, Daniel
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [50] Scene Coordinate Regression Forests for Camera Relocalization in RGB-D Images
    Shotton, Jamie
    Glocker, Ben
    Zach, Christopher
    Izadi, Shahram
    Criminisi, Antonio
    Fitzgibbon, Andrew
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2930 - 2937