Visual Camera Re-Localization From RGB and RGB-D Images Using DSAC

被引:99
|
作者
Brachmann, Eric [1 ]
Rother, Carsten [2 ]
机构
[1] Niantic, San Francisco, CA 94104 USA
[2] Heidelberg Univ, Visual Learning Lab, D-69117 Heidelberg, Germany
基金
欧洲研究理事会;
关键词
Cameras; Training; Three-dimensional displays; Visualization; Optimization; Neural networks; Solid modeling; Camera re-localization; pose estimation; differentiable RANSAC; DSAC; differentiable argmax; differentiable PnP;
D O I
10.1109/TPAMI.2021.3070754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RGB-D or RGB, and a 3D model of the environment can be utilized for training but is not necessary. In the minimal case, our system requires only RGB images and ground truth poses at training time, and it requires only a single RGB image at test time. The framework consists of a deep neural network and fully differentiable pose optimization. The neural network predicts so called scene coordinates, i.e., dense correspondences between the input image and 3D scene space of the environment. The pose optimization implements robust fitting of pose parameters using differentiable RANSAC (DSAC) to facilitate end-to-end training. The system, an extension of DSAC++ and referred to as DSAC*, achieves state-of-the-art accuracy on various public datasets for RGB-based re-localization, and competitive accuracy for RGB-D based re-localization.
引用
收藏
页码:5847 / 5865
页数:19
相关论文
共 50 条
  • [1] Robust Localization Using RGB-D Images
    Oh, Yoonseon
    Oh, Songhwai
    2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1023 - 1026
  • [2] An end-to-end learning framework for visual camera relocalization using RGB and RGB-D images
    Zhang, Kai
    Meng, Xiaolin
    Wang, Qing
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [3] Mobil robot localization using RGB-D camera
    Somlyai, Laszlo
    IEEE 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL CYBERNETICS (ICCC 2013), 2013, : 131 - 136
  • [4] Pallet recognition and localization using an RGB-D camera
    Xiao, Junhao
    Lu, Huimin
    Zhang, Lilian
    Zhang, Jianhua
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2017, 14 (06):
  • [5] From RGB-D Images to RGB Images: Single Labeling for Mining Visual Models
    Zhang, Quanshi
    Song, Xuan
    Shao, Xiaowei
    Zhao, Huijing
    Shibasaki, Ryosuke
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (02)
  • [6] Mobile Robot Localization using Ceiling Landmarks and Images Captured from an RGB-D Camera
    Huang, Wen-Tsai
    Tsai, Chun-Lung
    Lin, Huei-Yung
    2012 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2012, : 855 - 860
  • [7] Visual Recognition in RGB Images and Videos by Learning from RGB-D Data
    Li, Wen
    Chen, Lin
    Xu, Dong
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) : 2030 - 2036
  • [8] Domain adaptation from RGB-D to RGB images
    Li, Xiao
    Fang, Min
    Zhang, Ju-Jie
    Wu, Jinqiao
    SIGNAL PROCESSING, 2017, 131 : 27 - 35
  • [9] Visual Odometry using RGB-D Camera on Ceiling Vision
    Wang, Han
    Mou, Wei
    Suratno, Hendra
    Seet, Gerald
    Li, Maohai
    Lau, M. W. S.
    Wang, Danwei
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,
  • [10] A Novel Hybrid Visual Odometry Using an RGB-D Camera
    Wang, Huiguo
    Wu, Xinyu
    Chen, Zhiheng
    He, Yong
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 47 - 51