SiamATA: an asymmetric target-aware and frequency domain task-aware Siamese network for visual tracking

被引:0
|
作者
Liang, Xingzhu [1 ,2 ,3 ]
Xiao, Yunzhuang [1 ]
Lin, Yu-e [1 ]
Yan, Xinyun [4 ]
机构
[1] Anhui Univ Sci & Technol, Sch Comp Sci & Engn, Huainan 232001, Anhui, Peoples R China
[2] Anhui Univ Sci & Technol, Huainan Peoples Hosp 1, Affiliated Hosp 1, Huainan 232007, Anhui, Peoples R China
[3] Anhui Univ Sci & Technol, Inst Environm Friendly Mat & Occupat Hlth, Wuhu 241003, Anhui, Peoples R China
[4] Jinling Inst Technol, Jiangsu AI Transportat Innovat & Applicat Engn Res, Nanjing 211169, Jiangsu, Peoples R China
关键词
Target-aware attention; Task-aware attention; Siamese network; Visual tracking; OBJECT TRACKING;
D O I
10.1007/s13042-024-02394-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Siamese-based trackers have achieved promising results in visual object tracking. However, the feature extraction capability of current popular Siamese-like networks is limited, making it difficult to fully distinguish the object from the background. Trackers are susceptible to drifting caused by factors such as occlusion, scale variation, and fast motion. In this paper, we propose a novel tracker, dubbed Siamese network with asymmetric target-aware and task-aware (SiamATA). The network is based on the asymmetric structure of the classification-regression branches, including the template classification branch, template regression branch, search region classification branch, and search region regression branch, to alleviate overfitting. Meanwhile, a target-aware attention module is introduced to learn powerful context information through spatial attention and selectively emphasize dependency channel features through channel attention, providing target-aware semantic features for each branch. In addition, we adopt the nonlocal pixel-wise correlation method to suppress the influence of similar object interference. Finally, we design a frequency domain task-aware attention module to explore the self-semantic information of classification and regression branches. Extensive experiments demonstrate the effectiveness of our tracker on six benchmarks: OTB100, UAV123, VOT2018, VOT2019, GOT-10K, and LaSOT.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] A Learning Frequency-Aware Feature Siamese Network for Real-Time Visual Tracking
    Yang, Yuxiang
    Xing, Weiwei
    Zhang, Shunli
    Yu, Qi
    Guo, Xiaoyu
    Guo, Min
    ELECTRONICS, 2020, 9 (05)
  • [22] Target-Aware Correlation Filter Tracking in RGBD Videos
    Kuai, Yangliu
    Wen, Gongjian
    Li, Dongdong
    Xiao, Jingjing
    IEEE SENSORS JOURNAL, 2019, 19 (20) : 9522 - 9531
  • [23] Structural target-aware model for thermal infrared tracking
    Yuan, Di
    Shu, Xiu
    Liu, Qiao
    He, Zhenyu
    NEUROCOMPUTING, 2022, 491 : 44 - 56
  • [24] DATA: Domain-Aware and Task-Aware Self-supervised Learning
    Chang, Qing
    Peng, Junran
    Xie, Lingxi
    Sun, Jiajun
    Tian, Qi
    Zhang, Zhaoxiang
    Yin, Haoran
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9831 - 9840
  • [25] Siamese global location-aware network for visual object tracking
    Jiafeng Li
    Bin Li
    Guodong Ding
    Li Zhuo
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3607 - 3620
  • [26] Two-stage aware attentional Siamese network for visual tracking
    Sun, Xinglong
    Han, Guangliang
    Guo, Lihong
    Yang, Hang
    Wu, Xiaotian
    Li, Qingqing
    PATTERN RECOGNITION, 2022, 124
  • [27] Siamese global location-aware network for visual object tracking
    Li, Jiafeng
    Li, Bin
    Ding, Guodong
    Zhuo, Li
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3607 - 3620
  • [28] HammerDrive: A Task-Aware Driving Visual Attention Model
    Amadori, Pierluigi Vito
    Fischer, Tobias
    Demiris, Yiannis
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) : 5573 - 5585
  • [29] TEFNet: Target-Aware Enhanced Fusion Network for RGB-T Tracking
    Chen, Panfeng
    Gong, Shengrong
    Ying, Wenhao
    Du, Xin
    Zhong, Shan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 432 - 443
  • [30] Target-Aware Transformer for Satellite Video Object Tracking
    Lai, Pujian
    Zhang, Meili
    Cheng, Gong
    Li, Shengyang
    Huang, Xiankai
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 10