Video Geo-Localization Employing Geo-Temporal Feature Learning and GPS Trajectory Smoothing

被引:2
|
作者
Regmi, Krishna [1 ]
Shah, Mubarak [1 ]
机构
[1] Univ Cent Florida, Ctr Res Comp Vis, Orlando, FL 32816 USA
关键词
D O I
10.1109/ICCV48922.2021.01191
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of video geo-localization by proposing a Geo-Temporal Feature Learning (GTFL) Network to simultaneously learn the discriminative features for the query video frames and the gallery images for estimating the geo-spatial trajectory of a query video. Based on a transformer encoder architecture, our GTFL model encodes query and gallery data separately, via two dedicated branches. The proposed GPS Loss and Clip Triplet Loss exploit the geographical and temporal proximity between the frames and the clips to jointly learn the query and the gallery features. We also propose a deep learning approach to trajectory smoothing by predicting the outliers in the estimated GPS positions and learning the offsets to smooth the trajectory. We build a large dataset from four different regions of USA; New York, San Francisco, Berkeley and Bay Area using BDD driving videos as query, and by collecting corresponding Google StreetView (GSV) Images for gallery. Extensive evaluations of proposed method on this new dataset are provided. Code and dataset details is publicly available at https://github.com/kregmi/VTE.
引用
收藏
页码:12106 / 12115
页数:10
相关论文
共 50 条
  • [1] Geo-localization based on CNN feature matching
    Jin Tang
    Cheng Gong
    Fan Guo
    Zirong Yang
    Zhihu Wu
    Optoelectronics Letters, 2022, 18 : 300 - 306
  • [2] Geo-localization based on CNN feature matching
    TANG Jin
    GONG Cheng
    GUO Fan
    YANG Zirong
    WU Zhihu
    OptoelectronicsLetters, 2022, 18 (05) : 300 - 306
  • [3] Geo-localization based on CNN feature matching
    Tang Jin
    Gong Chang
    Guo Fan
    Yang Zirong
    Wu Zhihu
    OPTOELECTRONICS LETTERS, 2022, 18 (05) : 300 - 306
  • [4] Learned Contextual Feature Reweighting for Image Geo-Localization
    Kim, Hyo Jin
    Dunn, Enrique
    Frahm, Jan-Michael
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3251 - 3260
  • [5] GAMa: Cross-View Video Geo-Localization
    Vyas, Shruti
    Chen, Chen
    Shah, Mubarak
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 440 - 456
  • [6] Learning Robust Feature Representation for Cross-View Image Geo-Localization
    Gan, Wenjian
    Zhou, Yang
    Hu, Xiaofei
    Zhao, Luying
    Huang, Gaoshuang
    Hou, Mingbo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [7] Leveraging cross-view geo-localization with ensemble learning and temporal awareness
    Ghanem, Abdulrahman
    Abdelhay, Ahmed
    Salah, Noor Eldeen
    Nour Eldeen, Ahmed
    Elhenawy, Mohammed
    Masoud, Mahmoud
    Hassan, Ammar M. M.
    Hassan, Abdallah A. A.
    PLOS ONE, 2023, 18 (03):
  • [8] Overhead-based image and video geo-localization framework
    Hammoud, Riad I.
    Kuzdeba, Scott A.
    Berard, Brian
    Tom, Victor
    Ivey, Richard
    Bostwick, Renu
    HandUber, Jason
    Vinciguerra, Lori
    Shnidman, Nathan
    Smiley, Byron
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 320 - 327
  • [9] End-to-end Learning Improves Static Object Geo-localization from Video
    Chaabane, Mohamed
    Gueguen, Lionel
    Trabelsi, Ameni
    Beveridge, Ross
    O'Hara, Stephen
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2062 - 2071
  • [10] Perceptual Feature Fusion Network for Cross-View Geo-Localization
    Wang, Jiayi
    Chen, Ziyang
    Yuan, Xiaochen
    Zhao, Genping
    Computer Engineering and Applications, 60 (03): : 255 - 262