Transformer Tracking for Satellite Video: Matching, Propagation, and Prediction

被引:0
|
作者
Zhao, Manqi [1 ,2 ]
Li, Shengyang [1 ,3 ]
Yang, Jian [1 ,3 ]
机构
[1] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Key Lab Space Utilizat, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China
关键词
Target tracking; Satellites; Transformers; Training; Object tracking; Predictive models; Pipelines; Adaptation models; Feature extraction; Accuracy; Satellite video object tracking; sequence prediction; static matching; temporal propagation; transformer; OBJECT TRACKING; CORRELATION FILTER;
D O I
10.1109/TGRS.2024.3501380
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, transformer-based trackers have brought overwhelming advantages in general video. However, their performance in satellite video has been hindered by insufficient satellite-specific training and a lack of designs tailored to satellite targets and scene characteristics. To tackle these challenges, we propose a novel transformer-based tracking framework for satellite video object tracking: Transformer Matching, Propagation, and Prediction (TransMPP). TransMPP combines three stages: static matching, dynamic propagation, and prediction, to ensure accurate tracking in satellite videos. Specifically, the Matching model uses a one-stream pipeline for simultaneous feature extraction and relationship modeling across extensive search and template areas, thereby improving foreground and background discrimination capabilities. In addition, the Propagation and Prediction models enhance temporal modeling capabilities through local long-term and short-term feature propagation and global sequence prediction, respectively, boosting tracking robustness. Moreover, to ensure a fair comparison and evaluation, we also developed SatSOT-train, a large-scale training dataset for the SatSOT benchmark. After comprehensive training, TransMPP demonstrates state-of-the-art (SOTA) performance on the SatSOT dataset, achieving an area under the curve (AUC) score of 59.9% and a precision score of 71.5%, bringing improvements of 6.3% and 5.3%, respectively. The code will be available at https://github.com/DonDominic/TransMPP.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] SatSOT: A Benchmark Dataset for Satellite Video Single Object Tracking
    Zhao, Manqi
    Li, Shengyang
    Xuan, Shiyu
    Kou, Longxuan
    Gong, Shuai
    Zhou, Zhuang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [32] Multitarget Detection and Tracking Method in Remote Sensing Satellite Video
    Lei, Lei
    Guo, Dongen
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [33] Online Background Discriminative Learning for Satellite Video Object Tracking
    Zhong, Yanfei
    Fang, Xueting
    Shu, Meng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [34] A social commerce information propagation prediction model based on transformer
    Zhang, Huibing
    Dai, Jingrui
    He, Junfei
    Zhang, Huacheng
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
  • [35] Transformer Meets Remote Sensing Video Detection and Tracking: A Comprehensive Survey
    Jiao, Licheng
    Zhang, Xin
    Liu, Xu
    Liu, Fang
    Yang, Shuyuan
    Ma, Wenping
    Li, Lingling
    Chen, Puhua
    Feng, Zhixi
    Guo, Yuwei
    Tang, Xu
    Hou, Biao
    Zhang, Xiangrong
    Bai, Jing
    Quan, Dou
    Zhang, Junpeng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 18 - 45
  • [36] A Spectral-Spatial Transformer Fusion Method for Hyperspectral Video Tracking
    Wang, Ye
    Liu, Yuheng
    Ma, Mingyang
    Mei, Shaohui
    REMOTE SENSING, 2023, 15 (07)
  • [37] Satellite video single object tracking: A systematic review and an oriented object tracking benchmark
    Chen, Yuzeng
    Tang, Yuqi
    Xiao, Yi
    Yuan, Qiangqiang
    Zhang, Yuwei
    Liu, Fengqing
    He, Jiang
    Zhang, Liangpei
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 210 : 212 - 240
  • [38] Dual attentional transformer for video visual relation prediction q
    Qu, Mingcheng
    Deng, Ganlin
    Di, Donglin
    Cui, Jianxun
    Su, Tonghua
    NEUROCOMPUTING, 2023, 550
  • [39] Bird Flu Outbreak Prediction via Satellite Tracking
    Zhou, Yuanchun
    Tang, Mingjie
    Pan, Weike
    Li, Jinyan
    Wang, Weihang
    Shao, Jing
    Wu, Liang
    Li, Jianhui
    Yang, Qiang
    Yan, Baoping
    IEEE INTELLIGENT SYSTEMS, 2014, 29 (04) : 10 - 17
  • [40] Classifying, Segmenting, and Tracking Object Instances in Video with Mask Propagation
    Bertasius, Gedas
    Torresani, Lorenzo
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9736 - 9745