VirtualLoc: Large-scale Visual Localization Using Virtual Images

被引:1
|
作者
Xiong, Yuan [1 ]
Wang, Jingru [1 ]
Zhou, Zhong [1 ,2 ,3 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] State Key Lab Virtual Real Technol & Syst, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[3] Zhongguancun Lab, 37 Xueyuan Rd, Beijing 100191, Peoples R China
关键词
Visual localization; virtual reality; image retrieval; rendering;
D O I
10.1145/3622788
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robust and accurate camera pose estimation is fundamental in computer vision. Learning-based regression approaches acquire six-degree-of-freedom camera parameters accurately from visual cues of an input image. However, most are trained on street-view and landmark datasets. These approaches can hardly be generalized to overlooking use cases, such as the calibration of the surveillance camera and unmanned aerial vehicle. Besides, reference images captured from the real world are rare and expensive, and their diversity is not guaranteed. In this article, we address the problem of using alternative virtual images for visual localization training. This work has the following principle contributions: First, we present a new challenging localization dataset containing six reconstructed large-scale three-dimensional scenes, 10,594 calibrated photographs with condition changes, and 300k virtual images with pixelwise labeled depth, relative surface normal, and semantic segmentation. Second, we present a flexible multi-feature fusion network trained on virtual image datasets for robust image retrieval. Third, we propose an end-to-end confidence map prediction network for feature filtering and pose estimation. We demonstrate that large-scale rendered virtual images are beneficial to visual localization. Using virtual images can solve the diversity problem of real images and leverage labeled multi-feature data for deep learning. Experimental results show that our method achieves remarkable performance surpassing state-of-the-art approaches. To foster research on improvement for visual localization using synthetic images, we release our benchmark at https://github.com/YuanXiong/contributions.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
    Torii, Akihiko
    Taira, Hajime
    Sivic, Josef
    Pollefeys, Marc
    Okutomi, Masatoshi
    Pajdla, Tomas
    Sattler, Torsten
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 814 - 829
  • [42] Accurate and Robust Visual Localization System in Large-Scale Appearance-Changing Environments
    Yu, Yang
    Yun, Peng
    Xue, Bohuan
    Jiao, Jianhao
    Fan, Rui
    Liu, Ming
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (06) : 5222 - 5232
  • [43] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
    Sattler, Torsten
    Torii, Akihiko
    Sivic, Josef
    Pollefeys, Marc
    Taira, Hajime
    Okutomi, Masatoshi
    Pajdla, Tomas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6175 - 6184
  • [44] A large-scale dataset for indoor visual localization with high-precision ground truth
    Liu, Yuchen
    Gao, Wei
    Hu, Zhanyi
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (02): : 129 - 135
  • [45] Harvesting Mid-level Visual Concepts from Large-scale Internet Images
    Li, Quannan
    Wu, Jiajun
    Tul, Zhuowen
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 851 - 858
  • [46] Generating Visual Concept Network from Large-Scale Weakly-Tagged Images
    Yang, Chunlei
    Luo, Hangzai
    Fan, Jianping
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 251 - +
  • [47] iVAR: Interactive Visual Analytics of Radiomics Features from Large-Scale Medical Images
    Yu, Lina
    Jiang, Hengle
    Yu, Hongfeng
    Zhang, Chi
    Mcallister, Josiah
    Zheng, Dandan
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3916 - 3923
  • [48] Region Registration of Large-Scale IR/Visual Images Based on Improved SC Algorithm
    Zhang Xiaoqian
    Li Junshan
    Zhang Zhongmin
    Du Yonghong
    7TH INTERNATIONAL SYMPOSIUM ON ADVANCED OPTICAL MANUFACTURING AND TESTING TECHNOLOGIES: OPTICAL TEST AND MEASUREMENT TECHNOLOGY AND EQUIPMENT, 2014, 9282
  • [49] warpDOCK: Large-Scale Virtual Drug Discovery Using Cloud Infrastructure
    McDougal, Daniel P.
    Rajapaksha, Harinda
    Pederick, Jordan L.
    Bruning, John B.
    ACS OMEGA, 2023, 8 (32): : 29143 - 29149
  • [50] The WALKABOUT: Using virtual environments to assess large-scale spatial abilities
    Waller, D
    COMPUTERS IN HUMAN BEHAVIOR, 2005, 21 (02) : 243 - 253