VirtualLoc: Large-scale Visual Localization Using Virtual Images

被引：1

作者：

Xiong, Yuan ^{[1
]}

Wang, Jingru ^{[1
]}

Zhou, Zhong ^{[1
,2
,3
]}

机构：

[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] State Key Lab Virtual Real Technol & Syst, 37 Xueyuan Rd, Beijing 100191, Peoples R China

[3] Zhongguancun Lab, 37 Xueyuan Rd, Beijing 100191, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2024年 / 20卷 / 03期

关键词：

Visual localization; virtual reality; image retrieval; rendering;

D O I：

10.1145/3622788

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Robust and accurate camera pose estimation is fundamental in computer vision. Learning-based regression approaches acquire six-degree-of-freedom camera parameters accurately from visual cues of an input image. However, most are trained on street-view and landmark datasets. These approaches can hardly be generalized to overlooking use cases, such as the calibration of the surveillance camera and unmanned aerial vehicle. Besides, reference images captured from the real world are rare and expensive, and their diversity is not guaranteed. In this article, we address the problem of using alternative virtual images for visual localization training. This work has the following principle contributions: First, we present a new challenging localization dataset containing six reconstructed large-scale three-dimensional scenes, 10,594 calibrated photographs with condition changes, and 300k virtual images with pixelwise labeled depth, relative surface normal, and semantic segmentation. Second, we present a flexible multi-feature fusion network trained on virtual image datasets for robust image retrieval. Third, we propose an end-to-end confidence map prediction network for feature filtering and pose estimation. We demonstrate that large-scale rendered virtual images are beneficial to visual localization. Using virtual images can solve the diversity problem of real images and leverage labeled multi-feature data for deep learning. Experimental results show that our method achieves remarkable performance surpassing state-of-the-art approaches. To foster research on improvement for visual localization using synthetic images, we release our benchmark at https://github.com/YuanXiong/contributions.

引用

页数：19

共 50 条

[41] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
Torii, Akihiko
Taira, Hajime
Sivic, Josef
Pollefeys, Marc
Okutomi, Masatoshi
Pajdla, Tomas
Sattler, Torsten
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 814 - 829
[42] Accurate and Robust Visual Localization System in Large-Scale Appearance-Changing Environments
Yu, Yang
Yun, Peng
Xue, Bohuan
Jiao, Jianhao
Fan, Rui
Liu, Ming
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (06) : 5222 - 5232
[43] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
Sattler, Torsten
Torii, Akihiko
Sivic, Josef
Pollefeys, Marc
Taira, Hajime
Okutomi, Masatoshi
Pajdla, Tomas
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6175 - 6184
[44] A large-scale dataset for indoor visual localization with high-precision ground truth
Liu, Yuchen
Gao, Wei
Hu, Zhanyi
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (02): : 129 - 135
[45] Harvesting Mid-level Visual Concepts from Large-scale Internet Images
Li, Quannan
Wu, Jiajun
Tul, Zhuowen
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 851 - 858
[46] Generating Visual Concept Network from Large-Scale Weakly-Tagged Images
Yang, Chunlei
Luo, Hangzai
Fan, Jianping
ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 251 - +
[47] iVAR: Interactive Visual Analytics of Radiomics Features from Large-Scale Medical Images
Yu, Lina
Jiang, Hengle
Yu, Hongfeng
Zhang, Chi
Mcallister, Josiah
Zheng, Dandan
2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3916 - 3923
[48] Region Registration of Large-Scale IR/Visual Images Based on Improved SC Algorithm
Zhang Xiaoqian
Li Junshan
Zhang Zhongmin
Du Yonghong
7TH INTERNATIONAL SYMPOSIUM ON ADVANCED OPTICAL MANUFACTURING AND TESTING TECHNOLOGIES: OPTICAL TEST AND MEASUREMENT TECHNOLOGY AND EQUIPMENT, 2014, 9282
[49] warpDOCK: Large-Scale Virtual Drug Discovery Using Cloud Infrastructure
McDougal, Daniel P.
Rajapaksha, Harinda
Pederick, Jordan L.
Bruning, John B.
ACS OMEGA, 2023, 8 (32): : 29143 - 29149
[50] The WALKABOUT: Using virtual environments to assess large-scale spatial abilities
Waller, D
COMPUTERS IN HUMAN BEHAVIOR, 2005, 21 (02) : 243 - 253

← 1 2 3 4 5 →