Geodesic Based Image Matching Network for the Multi-scale Ground to Aerial Geo-localization

被引:0
|
作者
Amit, Rasna A. [1 ]
Mohan, C. Krishna [1 ]
机构
[1] Indian Inst Technol Hyderabad, Sangareddy, Telangana, India
关键词
ROBOT LOCALIZATION;
D O I
10.1109/AERO55745.2023.10115935
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Airport surveillance activities using remote sensing images are challenging due to object variations largely affecting the geo-localization and object detection/segmentation tasks. Furthermore, the problem of localization is even larger due to scale variations. Traditionally image-based geo-referencing is accomplished by superimposing ground positioning system (GPS) location to the queried image. It is also observed both the query and the geo-tagged reference images are taken from the same ground view or aerial height in the case of remote sensing images. In our research, we intend to revisit the scale effect on object variability, by introducing the concept of geodesic representations along with image-matching networks. The architecture pipeline introduces a data processing layer wherein objects are geo-referenced to generate the metadata information. This metadata consists of three-dimensional data including the orientation information of the object. A regression task is added to the training set which leverages the metadata information. We use the gradient weighted class activation maps (Grad-CAM) to generate the activation maps and selection based on high threshold values for the pixel. The orientations and the locations are further calculated using the geodesic representations. The baseline architecture for local feature extraction uses a simple Siamese network with a ResNet backbone network. A NetVLAD layer is used to generate the global features. We also introduce a Geospatial attention network (GsAN) to aid in enhanced localization of objects. The dataset used for experiments consisted of CVUSA and our custom dataset providing airport runway views for different scales and arbitrary orientations. The performance evaluations focused on recall as a retrieval metric and comparing various loss functions. The performance metrics indicate a higher accuracy rate.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Image-Based Geo-Localization Using Satellite Imagery
    Sixing Hu
    Gim Hee Lee
    International Journal of Computer Vision, 2020, 128 : 1205 - 1219
  • [32] Rethinking Pooling for Multi-Granularity Features in Aerial-View Geo-Localization
    Wang, Tingyu
    Yang, Zihao
    Chen, Quan
    Sun, Yaoqi
    Yan, Chenggang
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 3005 - 3009
  • [33] Joint Saliency Estimation and Matching using Image Regions for Geo-Localization of Online Video
    Shi, Haoyue
    Chen, Jia
    Hauptmann, Alexander G.
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 388 - 396
  • [34] OBTPN: A Vision-Based Network for UAV Geo-Localization in Multi-Altitude Environments
    Chen, Nanxing
    Fan, Jiqi
    Yuan, Jiayu
    Zheng, Enhui
    DRONES, 2025, 9 (01)
  • [35] Progressive matching method of aerial-ground remote sensing image via multi-scale context feature coding
    Xu, Chuan
    Xu, Junjie
    Huang, Tao
    Zhang, Huan
    Mei, Liye
    Zhang, Xia
    Duan, Yu
    Yang, Wei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (19) : 5876 - 5895
  • [36] Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval
    Zeng, Zelong
    Wang, Zheng
    Yang, Fan
    Satoh, Shin'ichi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2176 - 2188
  • [37] UAV Geo-Localization Dataset and Method Based on Cross-View Matching
    Yao, Yuwen
    Sun, Cheng
    Wang, Tao
    Yang, Jianxing
    Zheng, Enhui
    SENSORS, 2024, 24 (21)
  • [38] AENet: attention efficient network for cross-view image geo-localization
    Xu, Jingqian
    Zhu, Ma
    Qi, Baojun
    Li, Jiangshan
    Yang, Chunfang
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (07): : 4119 - 4138
  • [39] Where in the World Is This Image? Transformer-Based Geo-localization in the Wild
    Pramanick, Shraman
    Nowara, Ewa M.
    Gleason, Joshua
    Castillo, Carlos D.
    Chellappa, Rama
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 196 - 215
  • [40] Multi-scale motivated neural network for image-text matching
    Qin, Xueyang
    Li, Lishuang
    Pang, Guangyao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 4383 - 4407