Visual Camera Re-Localization From RGB and RGB-D Images Using DSAC

被引:99
|
作者
Brachmann, Eric [1 ]
Rother, Carsten [2 ]
机构
[1] Niantic, San Francisco, CA 94104 USA
[2] Heidelberg Univ, Visual Learning Lab, D-69117 Heidelberg, Germany
基金
欧洲研究理事会;
关键词
Cameras; Training; Three-dimensional displays; Visualization; Optimization; Neural networks; Solid modeling; Camera re-localization; pose estimation; differentiable RANSAC; DSAC; differentiable argmax; differentiable PnP;
D O I
10.1109/TPAMI.2021.3070754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RGB-D or RGB, and a 3D model of the environment can be utilized for training but is not necessary. In the minimal case, our system requires only RGB images and ground truth poses at training time, and it requires only a single RGB image at test time. The framework consists of a deep neural network and fully differentiable pose optimization. The neural network predicts so called scene coordinates, i.e., dense correspondences between the input image and 3D scene space of the environment. The pose optimization implements robust fitting of pose parameters using differentiable RANSAC (DSAC) to facilitate end-to-end training. The system, an extension of DSAC++ and referred to as DSAC*, achieves state-of-the-art accuracy on various public datasets for RGB-based re-localization, and competitive accuracy for RGB-D based re-localization.
引用
收藏
页码:5847 / 5865
页数:19
相关论文
共 50 条
  • [21] Modeling Hair from an RGB-D Camera
    Zhang, Meng
    Wu, Pan
    Wu, Hongzhi
    Weng, Yanlin
    Zheng, Youyi
    Zhou, Kun
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (06):
  • [22] RGB-D Object Recognition Using the Knowledge Transferred from Relevant RGB Images
    Gao, Depeng
    Wu, Rui
    Liu, Jiafeng
    Huang, Qingcheng
    Tang, Xianglong
    Liu, Peng
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 642 - 651
  • [23] Tomato segmentation and localization method based on RGB-D camera
    Malik, Muhammad Hammad
    Qiu, Ruicheng
    Gao, Yang
    Zhang, Man
    Li, Han
    Li, Minzan
    International Agricultural Engineering Journal, 2019, 28 (04): : 278 - 287
  • [24] Simultaneous Localization and Appearance Estimation with a Consumer RGB-D Camera
    Wu, Hongzhi
    Wang, Zhaotian
    Zhou, Kun
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (08) : 2012 - 2023
  • [25] A New VLC Localization System with the Assistance of RGB-D Camera
    Zheng, Xiaolong
    Yang, Chuanchuan
    Kou, Hong
    Wang, Ziyu
    PROCEEDINGS OF 5TH IEEE CONFERENCE ON UBIQUITOUS POSITIONING, INDOOR NAVIGATION AND LOCATION-BASED SERVICES (UPINLBS), 2018, : 266 - 270
  • [26] Graph-Based Visual SLAM and Visual Odometry Using an RGB-D Camera
    Kluessendorff, Jan Helge
    Hartmann, Jan
    Forouher, Dariush
    Maehle, Erik
    2013 9TH INTERNATIONAL WORKSHOP ON ROBOT MOTION AND CONTROL (ROMOCO), 2013, : 288 - 293
  • [27] Evaluation of Recent Approaches to Visual Odometry from RGB-D Images
    Alexandrov, Sergey
    Herpers, Rainer
    ROBOCUP 2013: ROBOT WORLD CUP XVII, 2014, 8371 : 444 - 455
  • [28] REFLECTION REMOVAL USING RGB-D IMAGES
    Shibata, Toshihiro
    Akai, Yuji
    Matsuoka, Ryo
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1862 - 1866
  • [29] Edge and Intensity based Visual Odometry for RGB-D Camera
    Yao, Erliang
    Zhang, Hexin
    Zhang, Guoliang
    Xu, Hui
    2018 IEEE CSAA GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2018,
  • [30] Continuous Direct Sparse Visual Odometry from RGB-D Images
    Ghaffari, Maani
    Clark, William
    Bloch, Anthony
    Eustice, Ryan M.
    Grizzle, Jessy W.
    ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,