Visual Camera Re-Localization From RGB and RGB-D Images Using DSAC

被引：99

作者：

Brachmann, Eric ^{[1
]}

Rother, Carsten ^{[2
]}

机构：

[1] Niantic, San Francisco, CA 94104 USA

[2] Heidelberg Univ, Visual Learning Lab, D-69117 Heidelberg, Germany

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 09期

基金：

欧洲研究理事会;

关键词：

Cameras; Training; Three-dimensional displays; Visualization; Optimization; Neural networks; Solid modeling; Camera re-localization; pose estimation; differentiable RANSAC; DSAC; differentiable argmax; differentiable PnP;

D O I：

10.1109/TPAMI.2021.3070754

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RGB-D or RGB, and a 3D model of the environment can be utilized for training but is not necessary. In the minimal case, our system requires only RGB images and ground truth poses at training time, and it requires only a single RGB image at test time. The framework consists of a deep neural network and fully differentiable pose optimization. The neural network predicts so called scene coordinates, i.e., dense correspondences between the input image and 3D scene space of the environment. The pose optimization implements robust fitting of pose parameters using differentiable RANSAC (DSAC) to facilitate end-to-end training. The system, an extension of DSAC++ and referred to as DSAC*, achieves state-of-the-art accuracy on various public datasets for RGB-based re-localization, and competitive accuracy for RGB-D based re-localization.

引用

页码：5847 / 5865

页数：19

共 50 条

[21] Modeling Hair from an RGB-D Camera
Zhang, Meng
Wu, Pan
Wu, Hongzhi
Weng, Yanlin
Zheng, Youyi
Zhou, Kun
ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (06):
[22] RGB-D Object Recognition Using the Knowledge Transferred from Relevant RGB Images
Gao, Depeng
Wu, Rui
Liu, Jiafeng
Huang, Qingcheng
Tang, Xianglong
Liu, Peng
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 642 - 651
[23] Tomato segmentation and localization method based on RGB-D camera
Malik, Muhammad Hammad
Qiu, Ruicheng
Gao, Yang
Zhang, Man
Li, Han
Li, Minzan
International Agricultural Engineering Journal, 2019, 28 (04): : 278 - 287
[24] Simultaneous Localization and Appearance Estimation with a Consumer RGB-D Camera
Wu, Hongzhi
Wang, Zhaotian
Zhou, Kun
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (08) : 2012 - 2023
[25] A New VLC Localization System with the Assistance of RGB-D Camera
Zheng, Xiaolong
Yang, Chuanchuan
Kou, Hong
Wang, Ziyu
PROCEEDINGS OF 5TH IEEE CONFERENCE ON UBIQUITOUS POSITIONING, INDOOR NAVIGATION AND LOCATION-BASED SERVICES (UPINLBS), 2018, : 266 - 270
[26] Graph-Based Visual SLAM and Visual Odometry Using an RGB-D Camera
Kluessendorff, Jan Helge
Hartmann, Jan
Forouher, Dariush
Maehle, Erik
2013 9TH INTERNATIONAL WORKSHOP ON ROBOT MOTION AND CONTROL (ROMOCO), 2013, : 288 - 293
[27] Evaluation of Recent Approaches to Visual Odometry from RGB-D Images
Alexandrov, Sergey
Herpers, Rainer
ROBOCUP 2013: ROBOT WORLD CUP XVII, 2014, 8371 : 444 - 455
[28] REFLECTION REMOVAL USING RGB-D IMAGES
Shibata, Toshihiro
Akai, Yuji
Matsuoka, Ryo
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1862 - 1866
[29] Edge and Intensity based Visual Odometry for RGB-D Camera
Yao, Erliang
Zhang, Hexin
Zhang, Guoliang
Xu, Hui
2018 IEEE CSAA GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2018,
[30] Continuous Direct Sparse Visual Odometry from RGB-D Images
Ghaffari, Maani
Clark, William
Bloch, Anthony
Eustice, Ryan M.
Grizzle, Jessy W.
ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,

← 1 2 3 4 5 →