Target-Aware Transformer for Satellite Video Object Tracking

被引:12
|
作者
Lai, Pujian [1 ]
Zhang, Meili [1 ]
Cheng, Gong [1 ]
Li, Shengyang [2 ,3 ]
Huang, Xiankai [4 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Chinese Acad Sci, Ctr Space Utilizat, Key Lab Space Utilizat Technol & Engn, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China
[4] Beijing Technol & Business Univ, Business Sch, Beijing 100048, Peoples R China
基金
中国国家自然科学基金;
关键词
Bi-direction propagation and fusion (Bi-PF); satellite video object tracking; target-aware enhancement (TAE); CORRELATION FILTER;
D O I
10.1109/TGRS.2023.3339658
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recent years have witnessed the astonishing development of transformer-based paradigm in single object tracking (SOT) in generic videos. However, due to the fact that the targets of interest in satellite videos are small in size and weak in visual appearance, the advancements of transformer-based paradigm in satellite video object tracking are impeded. To alleviate this issue, a novel transformer-based recipe is proposed, which consists of a bi-direction propagation and fusion (Bi-PF) strategy and a target-aware enhancement (TAE) module. Concretely, we first adopt the Bi-PF strategy to make full use of multiscale information to generate discriminative representations of tracking targets. Then, the TAE module is employed to decouple an object query into content-aware embedding and spatial-aware embedding and produce a target prototype to help get high-quality content-aware embedding. It is worth mentioning that, different from the previous methods in satellite video tracking most of which evaluate their performance using only several videos, we conduct extensive experiments on the SatSOT dataset which consists of 105 videos. In particular, the proposed method achieves the success score of 45.6% and the precision score of 57.6%, surpassing the baseline method by 5.0% and 9.5%, respectively.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] TATrack: Target-aware transformer for object tracking
    Huang, Kai
    Chu, Jun
    Leng, Lu
    Dong, Xingbo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [2] Target-Aware Transformer Tracking
    Zheng, Yuhui
    Zhang, Yan
    Xiao, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4542 - 4551
  • [3] Know Who You Are: Learning Target-Aware Transformer for Object Tracking
    Zou, Zhuojun
    Liu, Xuexin
    Zhang, Yuanpei
    Shu, Lin
    Hao, Jie
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1427 - 1432
  • [4] TabCtNet: Target-aware bilateral CNN-transformer network for single object tracking in satellite videos
    Zhu, Qiqi
    Huang, Xin
    Guan, Qingfeng
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 128
  • [5] Target-aware transformer tracking with hard occlusion instance generation
    Xiao, Dingkun
    Wei, Zhenzhong
    Zhang, Guangjun
    FRONTIERS IN NEUROROBOTICS, 2024, 17
  • [6] Target-Aware Deep Tracking
    Li, Xin
    Ma, Chao
    Wu, Baoyuan
    He, Zhenyu
    Yang, Ming-Hsuan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1369 - 1378
  • [7] Visual object tracking using learnable target-aware token emphasis
    Park, Minho
    Song, Jinjoo
    Yoon, Sang Min
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 149
  • [8] Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation
    Zhou, Tianfei
    Li, Jianwu
    Li, Xueyi
    Shao, Ling
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6981 - 6990
  • [9] Target-aware and spatial-spectral discriminant feature joint correlation filters for hyperspectral video object tracking
    Tang, Yiming
    Liu, Yufei
    Huang, Hong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 223
  • [10] Knowledge Distillation via the Target-aware Transformer
    Lin, Sihao
    Xie, Hongwei
    Wang, Bing
    Yu, Kaicheng
    Chang, Xiaojun
    Liang, Xiaodan
    Wang, Gang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10905 - 10914