Semantic signatures for large-scale visual localization

被引:0
|
作者
Li Weng
Valérie Gouet-Brunet
Bahman Soheilian
机构
[1] Hangzhou Dianzi University,Department of Automation (Artificial Intelligence)
[2] Univ. Gustave Eiffel,LaSTIG Lab.
[3] ENSG,undefined
[4] IGN,undefined
来源
关键词
Database search; Information retrieval; Visual localization; Semantic feature; Urban computing;
D O I
暂无
中图分类号
学科分类号
摘要
Visual localization is a useful alternative to standard localization techniques. It works by utilizing cameras. In a typical scenario, features are extracted from captured images and compared with geo-referenced databases. Location information is then inferred from the matching results. Conventional schemes mainly use low-level visual features. These approaches offer good accuracy but suffer from scalability issues. In order to assist localization in large urban areas, this work explores a different path by utilizing high-level semantic information. It is found that object information in a street view can facilitate localization. A novel descriptor scheme called “semantic signature” is proposed to summarize this information. A semantic signature consists of type and angle information of visible objects at a spatial location. Several metrics and protocols are proposed for signature comparison and retrieval. They illustrate different trade-offs between accuracy and complexity. Extensive simulation results confirm the potential of the proposed scheme in large-scale applications. This paper is an extended version of a conference paper in CBMI’18. A more efficient retrieval protocol is presented with additional experiment results.
引用
收藏
页码:22347 / 22372
页数:25
相关论文
共 50 条
  • [31] VISEL: A visual and magnetic fusion-based large-scale indoor localization system with improved high-precision semantic maps
    Li, Ning
    Tu, Weiping
    Ai, Haojun
    Deng, Huimin
    Tao, Jingjie
    Hu, Tan
    Sun, Xu
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (10) : 7992 - 8020
  • [32] Semantic tagging for large-scale content management
    Chen, Liming
    Roberts, Craig
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 478 - 481
  • [33] Semantic Integrity in Large-Scale Online Simulations
    Jha, Somesh
    Katzenbeisser, Stefan
    Schallhart, Christian
    Veith, Helmut
    Chenney, Stephen
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2010, 10 (01)
  • [34] Semantic Representation For Navigation In Large-Scale Environments
    Drouilly, Romain
    Rives, Patrick
    Morisset, Benoit
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 1106 - 1111
  • [35] A Visual Backchannel for Large-Scale Events
    Doerk, Marian
    Gruen, Daniel
    Williamson, Carey
    Carpendale, Sheelagh
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 1129 - 1138
  • [36] Large-Scale Visual Font Recognition
    Chen, Guang
    Yang, Jianchao
    Jin, Hailin
    Brandt, Jonathan
    Shechtman, Eli
    Agarwala, Aseem
    Han, Tony X.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3598 - 3605
  • [37] Large-Scale Visual Relationship Understanding
    Zhang, Ji
    Kalantidis, Yannis
    Rohrbach, Marcus
    Paluri, Manohar
    Elgammal, Ahmed
    Elhoseiny, Mohamed
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9185 - 9194
  • [38] Large-Scale Visual Speech Recognition
    Shillingford, Brendan
    Assael, Yannis
    Hoffman, Matthew W.
    Paine, Thomas
    Hughes, Cian
    Prabhu, Utsav
    Liao, Hank
    Sak, Hasim
    Rao, Kanishka
    Bennett, Lorrayne
    Mulville, Marie
    Denil, Misha
    Coppin, Ben
    Laurie, Ben
    Senior, Andrew
    de Freitas, Nando
    INTERSPEECH 2019, 2019, : 4135 - 4139
  • [39] Large-Scale Visual Data Analysis
    Johnson, Chris
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 1 - 1
  • [40] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
    Torii, Akihiko
    Taira, Hajime
    Sivic, Josef
    Pollefeys, Marc
    Okutomi, Masatoshi
    Pajdla, Tomas
    Sattler, Torsten
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 814 - 829