Geographic Semantic Network for Cross-View Image Geo-Localization

被引:24
|
作者
Zhu, Yingying [1 ]
Sun, Bin [1 ]
Lu, Xiufan [1 ]
Jia, Sen [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518052, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Global Positioning System; Semantics; Location awareness; Network architecture; Visualization; Satellites; Batch hard-mining; capsule network; cross-view image matching; image geo-localization;
D O I
10.1109/TGRS.2021.3121337
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The task of cross-view image geo-localization aims to determine the geo-location (Global Positioning System (GPS) coordinates) of a query ground-view image by matching the image with GPS-tagged aerial (or satellite) images in the reference dataset. Due to the dramatic domain gap between the ground and aerial images, the problem is challenging. The existing approaches mainly adopt convolutional neural networks (CNNs) to learn discriminative features. However, these CNN-based methods mainly leverage appearance and semantic information but fail to jointly model the appearance, positional, and orientation properties of scene objects, which belong to the spatial hierarchy. Since spatial hierarchy information is crucial for cross-view feature correspondence, in this article, we propose an end-to-end network architecture, dubbed GeoNet. GeoNet consists of a ResNetX module and a GeoCaps module. On the one hand, the ResNetX module is developed to learn powerful intermediate feature maps and allows the stable propagation of gradients in deep CNNs. On the other hand, the GeoCaps module utilizes the capsule network to encapsulate the intermediate feature maps into several capsules, whose length and orientation represent the existence probability and spatial hierarchy information of scene objects, respectively. Moreover, by using a dynamic routing-by-agreement mechanism, the GeoCaps module is capable of modeling parts-to-whole relationships between scene objects, which is viewpoint invariant and capable of bridging the cross-view domain gap. In addition to GeoNet, we introduce a simple yet effective metric learning method, based on which two weighted soft margin loss functions with online batch hard sample mining are devised. These functions not only speed up convergence but also improve the generalization ability of the network. Extensive experiments on three well-known datasets demonstrate that our GeoNet not only achieves state-of-the-art results for the ground-to-aerial and aerial-to-ground geo-localization tasks but also outperforms competing approaches for the few-shot geo-localization task.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Cross-View Image Sequence Geo-localization
    Zhang, Xiaohan
    Sultani, Waqas
    Wshah, Safwan
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2913 - 2922
  • [2] AENet: attention efficient network for cross-view image geo-localization
    Xu, Jingqian
    Zhu, Ma
    Qi, Baojun
    Li, Jiangshan
    Yang, Chunfang
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (07): : 4119 - 4138
  • [3] Cross-View Geo-Localization: A Survey
    Durgam, Abhilash
    Paheding, Sidike
    Dhiman, Vikas
    Devabhaktuni, Vijay
    IEEE ACCESS, 2024, 12 : 192028 - 192050
  • [4] Dual Path Network for Cross-view Geo-Localization
    Dong, Leyi
    Wang, Yuhui
    Huang, Junshi
    Qian, Xueming
    Fan, Mingyuan
    Lai, Shenqi
    PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023, 2023, : 45 - 49
  • [5] Cross-View Image Matching for Geo-localization in Urban Environments
    Tian, Yicong
    Chen, Chen
    Shah, Mubarak
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1998 - 2006
  • [6] Optimal Feature Transport for Cross-View Image Geo-Localization
    Shi, Yujiao
    Yu, Xin
    Liu, Liu
    Zhang, Tong
    Li, Hongdong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11990 - 11997
  • [7] Perceptual Feature Fusion Network for Cross-View Geo-Localization
    Wang, Jiayi
    Chen, Ziyang
    Yuan, Xiaochen
    Zhao, Genping
    Computer Engineering and Applications, 60 (03): : 255 - 262
  • [8] GAMa: Cross-View Video Geo-Localization
    Vyas, Shruti
    Chen, Chen
    Shah, Mubarak
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 440 - 456
  • [9] Cross-view geo-localization with evolving transformer
    Yang, Hongji
    Lu, Xiufan
    Zhu, Yingying
    arXiv, 2021,
  • [10] A Cross-View Geo-Localization Algorithm Using UAV Image and Satellite Image
    Fan, Jiqi
    Zheng, Enhui
    He, Yufei
    Yang, Jianxing
    SENSORS, 2024, 24 (12)