Dual-attention-transformer-based semantic reranking for large-scale image localization

被引:0
|
作者
Xiao, Yilin [1 ]
Du, Siliang [1 ]
Chen, Xu [1 ]
Liu, Mingzhong [1 ]
Sun, Mingwei [2 ]
机构
[1] Huawei Technol Co Ltd, Wuhan 430074, Hubei, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Hubei, Peoples R China
关键词
Image localization; Dual-attention-transformer; Semantic reranking; Adaptive triplet loss; VISUAL PLACE RECOGNITION;
D O I
10.1007/s10489-024-05539-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The large-scale image-based localization (IBL) problem involves matching a query image with a database image to determine the geolocation of the query. A major challenge in this problem stems from significant variations between images captured at the same location, including different viewpoints, illumination conditions, and seasonal changes. To address this issue, we recognize the potential advantages of integrating difficult positive samples into the training process. Consequently, we introduce a novel retrieval-based framework meticulously designed to harness the advantages presented by these difficult positive samples. A pivotal component is the proposed dual-attention-transformer-based semantic reranking module, which leverages semantic segmentation to preserve local feature points. This module, powered by the dual-attention-transformer, extracts nuanced global-to-local information via channel self-attention and window self-attention, thereby facilitating sample augmentation and final reranking. Additionally, we introduce the adaptive triplet loss, a dynamic mechanism incorporating weighted difficult positive samples into supervised information, which strengthens the model's robustness. We extensively evaluate our framework on various city-level datasets and demonstrate its superiority over state-of-the-art methods. Furthermore, an exhaustive ablation study systematically validates the effectiveness of each individual component, underscoring their contributions to the proposed methodology.
引用
收藏
页码:6946 / 6958
页数:13
相关论文
共 50 条
  • [1] Semantic segmentation of large-scale point clouds by integrating attention mechanisms and transformer models
    Yuan, Tiebiao
    Yu, Yangyang
    Wang, Xiaolong
    IMAGE AND VISION COMPUTING, 2024, 146
  • [2] MCTNet: Multiscale Cross-Attention-Based Transformer Network for Semantic Segmentation of Large-Scale Point Cloud
    Guo, Bo
    Deng, Liwei
    Wang, Ruisheng
    Guo, Wenchao
    Ng, Alex Hay-Man
    Bai, Wenfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [3] Semantic signatures for large-scale visual localization
    Li Weng
    Valérie Gouet-Brunet
    Bahman Soheilian
    Multimedia Tools and Applications, 2021, 80 : 22347 - 22372
  • [4] Semantic signatures for large-scale visual localization
    Weng, Li
    Gouet-Brunet, Valerie
    Soheilian, Bahman
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22347 - 22372
  • [5] Memory efficient large-scale image-based localization
    Guoyu Lu
    Nicu Sebe
    Congfu Xu
    Chandra Kambhamettu
    Multimedia Tools and Applications, 2015, 74 : 479 - 503
  • [6] Memory efficient large-scale image-based localization
    Lu, Guoyu
    Sebe, Nicu
    Xu, Congfu
    Kambhamettu, Chandra
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 479 - 503
  • [7] Deformable Transformer and Spectral U-Net for Large-Scale Hyperspectral Image Semantic Segmentation
    Zhang, Tianjian
    Xue, Zhaohui
    Su, Hongjun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 20227 - 20244
  • [8] A Novel Image Retrieval Method for Image Based Localization in Large-Scale Environment
    Yin, Xiliang
    Ma, Lin
    Tan, Xuezhi
    2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2021,
  • [9] Large-scale image retrieval based on boosting iterative quantization hashing with query-adaptive reranking
    Fu, Haiyan
    Kong, Xiangwei
    Lu, Jiayin
    NEUROCOMPUTING, 2013, 122 : 480 - 489
  • [10] Robust Large-Scale Collaborative Localization Based on Semantic Submaps With Extreme Outliers
    Tang, Yujie
    Wang, Meiling
    Yang, Yi
    Lan, Ziquan
    Yue, Yufeng
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 2649 - 2660