Geographic Semantic Network for Cross-View Image Geo-Localization

被引:24
|
作者
Zhu, Yingying [1 ]
Sun, Bin [1 ]
Lu, Xiufan [1 ]
Jia, Sen [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518052, Guangdong, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国国家自然科学基金;
关键词
Task analysis; Global Positioning System; Semantics; Location awareness; Network architecture; Visualization; Satellites; Batch hard-mining; capsule network; cross-view image matching; image geo-localization;
D O I
10.1109/TGRS.2021.3121337
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The task of cross-view image geo-localization aims to determine the geo-location (Global Positioning System (GPS) coordinates) of a query ground-view image by matching the image with GPS-tagged aerial (or satellite) images in the reference dataset. Due to the dramatic domain gap between the ground and aerial images, the problem is challenging. The existing approaches mainly adopt convolutional neural networks (CNNs) to learn discriminative features. However, these CNN-based methods mainly leverage appearance and semantic information but fail to jointly model the appearance, positional, and orientation properties of scene objects, which belong to the spatial hierarchy. Since spatial hierarchy information is crucial for cross-view feature correspondence, in this article, we propose an end-to-end network architecture, dubbed GeoNet. GeoNet consists of a ResNetX module and a GeoCaps module. On the one hand, the ResNetX module is developed to learn powerful intermediate feature maps and allows the stable propagation of gradients in deep CNNs. On the other hand, the GeoCaps module utilizes the capsule network to encapsulate the intermediate feature maps into several capsules, whose length and orientation represent the existence probability and spatial hierarchy information of scene objects, respectively. Moreover, by using a dynamic routing-by-agreement mechanism, the GeoCaps module is capable of modeling parts-to-whole relationships between scene objects, which is viewpoint invariant and capable of bridging the cross-view domain gap. In addition to GeoNet, we introduce a simple yet effective metric learning method, based on which two weighted soft margin loss functions with online batch hard sample mining are devised. These functions not only speed up convergence but also improve the generalization ability of the network. Extensive experiments on three well-known datasets demonstrate that our GeoNet not only achieves state-of-the-art results for the ground-to-aerial and aerial-to-ground geo-localization tasks but also outperforms competing approaches for the few-shot geo-localization task.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Leveraging cross-view geo-localization with ensemble learning and temporal awareness
    Ghanem, Abdulrahman
    Abdelhay, Ahmed
    Salah, Noor Eldeen
    Nour Eldeen, Ahmed
    Elhenawy, Mohammed
    Masoud, Mahmoud
    Hassan, Ammar M. M.
    Hassan, Abdallah A. A.
    PLOS ONE, 2023, 18 (03):
  • [42] CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization
    Hu, Sixing
    Feng, Mengdan
    Nguyen, Rang M. H.
    Lee, Gim Hee
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7258 - 7267
  • [43] Cross-View Geo-Localization via Learning Disentangled Geometric Layout Correspondence
    Zhang, Xiaohan
    Li, Xingyu
    Sultani, Waqas
    Zhou, Yi
    Wshah, Safwan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3480 - 3488
  • [44] Road Structure Inspired UGV-Satellite Cross-View Geo-Localization
    Hu, Di
    Yuan, Xia
    Xi, Huiying
    Li, Jie
    Song, Zhenbo
    Xiong, Fengchao
    Zhang, Kai
    Zhao, Chunxia
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 16767 - 16786
  • [45] Road Extraction Assisted Offset Regression Method in Cross-view Image-based Geo-localization
    Hou, Yuxuan
    Yang, Yi
    Wang, Junbo
    Fu, Mengyin
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 2934 - 2940
  • [46] HADGEO: IMAGE BASED 3-DOF CROSS-VIEW GEO-LOCALIZATION WITH HARD SAMPLE MINING
    Li, Chaoran
    Yan, Chao
    Xiang, Xiaojia
    Lai, Jun
    Zhou, Han
    Tang, Dengqing
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3520 - 3524
  • [47] Content-Aware Hierarchical Representation Selection for Cross-View Geo-Localization
    Lu, Zeng
    Pu, Tao
    Chen, Tianshui
    Lin, Liang
    COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 267 - 280
  • [48] CV-Cities: Advancing Cross-View Geo-Localization in Global Cities
    Huang, Gaoshuang
    Zhou, Yang
    Zhao, Luying
    Gan, Wenjian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 1592 - 1606
  • [49] Patch Similarity Self-Knowledge Distillation for Cross-View Geo-Localization
    Li, Songlian
    Hu, Min
    Xiao, Xiongwu
    Tu, Zhigang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5091 - 5103
  • [50] Each Part Matters: Local Patterns Facilitate Cross-View Geo-Localization
    Wang, Tingyu
    Zheng, Zhedong
    Yan, Chenggang
    Zhang, Jiyong
    Sun, Yaoqi
    Zheng, Bolun
    Yang, Yi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 867 - 879