A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent Attention

被引:8
|
作者
Cui, Zhuofan [1 ]
Zhou, Pengwei [1 ]
Wang, Xiaolong [1 ]
Zhang, Zilun [2 ]
Li, Yingxuan [1 ]
Li, Hongbo [3 ]
Zhang, Yu [1 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, State Key Lab Ind Control Technol, Hangzhou 310012, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou 310012, Peoples R China
[3] Beijing Geekplus Technol Co Ltd, 7-F,Block D,Beijing Cultural & Creat Bldg,30 Beiyu, Beijing 100107, Peoples R China
关键词
geo-localization; UAV; satellite; transformer; cross-view;
D O I
10.3390/rs15194667
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Geo-localization has been widely applied as an important technique to get the longitude and latitude for unmanned aerial vehicle (UAV) navigation in outdoor flight. Due to the possible interference and blocking of GPS signals, the method based on image retrieval, which is less likely to be interfered with, has received extensive attention in recent years. The geo-localization of UAVs and satellites can be achieved by querying pre-obtained satellite images with GPS-tagged and drone images from different perspectives. In this paper, an image transformation technique is used to extract cross-view geo-localization information from UAVs and satellites. A single-stage training method in UAV and satellite geo-localization is first proposed, which simultaneously realizes cross-view feature extraction and image retrieval, and achieves higher accuracy than existing multi-stage training techniques. A novel piecewise soft-margin triplet loss function is designed to avoid model parameters being trapped in suboptimal sets caused by the lack of constraint on positive and negative samples. The results illustrate that the proposed loss function enhances image retrieval accuracy and realizes a better convergence. Moreover, a data augmentation method for satellite images is proposed to overcome the disproportionate numbers of image samples. On the benchmark University-1652, the proposed method achieves the state-of-the-art result with a 6.67% improvement in recall rate (R@1) and 6.13% in average precision (AP). All codes will be publicized to promote reproducibility.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Cross-View Visual Geo-Localization for Outdoor Augmented Reality
    Mithun, Niluthpol Chowdhury
    Minhas, Kshitij S.
    Chiu, Han-Pang
    Oskiper, Taragay
    Sizintsev, Mikhail
    Samarasekera, Supun
    Kumar, Rakesh
    2023 IEEE CONFERENCE VIRTUAL REALITY AND 3D USER INTERFACES, VR, 2023, : 493 - 502
  • [32] Geographic Semantic Network for Cross-View Image Geo-Localization
    Zhu, Yingying
    Sun, Bin
    Lu, Xiufan
    Jia, Sen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [33] Cross-view Geo-localization with Layer-to-Layer Transformer
    Yang, Hongji
    Lu, Xiufan
    Zhu, Yingying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [34] Optimal Feature Transport for Cross-View Image Geo-Localization
    Shi, Yujiao
    Yu, Xin
    Liu, Liu
    Zhang, Tong
    Li, Hongdong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11990 - 11997
  • [35] An Efficient Method based on Multi-view Semantic Alignment for Cross-view Geo-localization
    Wang, Yifeng
    Xia, Yamei
    Lu, Tianbo
    Zhang, Xiaoyan
    Yao, Wenbin
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [36] CCR: A Counterfactual Causal Reasoning-based Method for Cross-view Geo-localization
    Du H.
    He J.
    Zhao Y.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (11) : 1 - 1
  • [37] Enhancing Cross-View Geo-Localization With Domain Alignment and Scene Consistency
    Xia, Panwang
    Wan, Yi
    Zheng, Zhi
    Zhang, Yongjun
    Deng, Jiwei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13271 - 13281
  • [38] Feature Relation Guided Cross-View Image Based Geo-Localization
    Hou, Qingfeng
    Lu, Jun
    Guo, Haitao
    Liu, Xiangyun
    Gong, Zhihui
    Zhu, Kun
    Ping, Yifan
    REMOTE SENSING, 2023, 15 (20)
  • [39] Learning Cross-View Visual Geo-Localization Without Ground Truth
    Li, Haoyuan
    Xu, Chang
    Yang, Wen
    Yu, Huai
    Xia, Gui-Song
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [40] Bridging Viewpoints in Cross-View Geo-Localization With Siamese Vision Transformer
    Ahn, Woo-Jin
    Park, So-Yeon
    Pae, Dong-Sung
    Choi, Hyun-Duck
    Lim, Myo-Taeg
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1