SiamATA: an asymmetric target-aware and frequency domain task-aware Siamese network for visual tracking

被引:0
|
作者
Liang, Xingzhu [1 ,2 ,3 ]
Xiao, Yunzhuang [1 ]
Lin, Yu-e [1 ]
Yan, Xinyun [4 ]
机构
[1] Anhui Univ Sci & Technol, Sch Comp Sci & Engn, Huainan 232001, Anhui, Peoples R China
[2] Anhui Univ Sci & Technol, Huainan Peoples Hosp 1, Affiliated Hosp 1, Huainan 232007, Anhui, Peoples R China
[3] Anhui Univ Sci & Technol, Inst Environm Friendly Mat & Occupat Hlth, Wuhu 241003, Anhui, Peoples R China
[4] Jinling Inst Technol, Jiangsu AI Transportat Innovat & Applicat Engn Res, Nanjing 211169, Jiangsu, Peoples R China
关键词
Target-aware attention; Task-aware attention; Siamese network; Visual tracking; OBJECT TRACKING;
D O I
10.1007/s13042-024-02394-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Siamese-based trackers have achieved promising results in visual object tracking. However, the feature extraction capability of current popular Siamese-like networks is limited, making it difficult to fully distinguish the object from the background. Trackers are susceptible to drifting caused by factors such as occlusion, scale variation, and fast motion. In this paper, we propose a novel tracker, dubbed Siamese network with asymmetric target-aware and task-aware (SiamATA). The network is based on the asymmetric structure of the classification-regression branches, including the template classification branch, template regression branch, search region classification branch, and search region regression branch, to alleviate overfitting. Meanwhile, a target-aware attention module is introduced to learn powerful context information through spatial attention and selectively emphasize dependency channel features through channel attention, providing target-aware semantic features for each branch. In addition, we adopt the nonlocal pixel-wise correlation method to suppress the influence of similar object interference. Finally, we design a frequency domain task-aware attention module to explore the self-semantic information of classification and regression branches. Extensive experiments demonstrate the effectiveness of our tracker on six benchmarks: OTB100, UAV123, VOT2018, VOT2019, GOT-10K, and LaSOT.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] TASK-AWARE GRAPH CONVOLUTIONAL NETWORK FOR ACTIVE LEARNING
    Ye, Yujia
    Wu, Zhangquan
    Su, Guoliang
    Zhou, Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 495 - 499
  • [32] Target-Aware Tracking With Spatial-Temporal Context Attention
    He, Kai-Jie
    Zhang, Can-Long
    Xie, Sheng
    Li, Zhi-Xin
    Wang, Zhi-Wen
    Qin, Rui-Guo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7176 - 7189
  • [33] TRACKING METHOD OF ONLINE TARGET-AWARE VIA SHRINKAGE LOSS
    Zhang, Jianwei
    Wang, He
    Zhang, Huanlong
    Wang, Jingchao
    Miao, Meng-en
    Wang, Jiandong
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (05): : 1395 - 1411
  • [34] PointSiamRCNN: Target-aware Voxel-based Siamese Tracker for Point Clouds
    Zou, Hao
    Zhang, Chujuan
    Liu, Yong
    Li, Wanlong
    Wen, Feng
    Zhang, Hongbo
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 7029 - 7035
  • [35] A location-aware siamese network for high-speed visual tracking
    Lifang Zhou
    Xiang Ding
    Weisheng Li
    Jiaxu Leng
    Bangjun Lei
    Weibin Yang
    Applied Intelligence, 2023, 53 : 4431 - 4447
  • [36] Relation-aware Siamese region proposal network for visual object tracking
    Jiaming Zhu
    Guopeng Zhang
    Shibin Zhou
    Kun Li
    Multimedia Tools and Applications, 2021, 80 : 15469 - 15485
  • [37] Relation-aware Siamese region proposal network for visual object tracking
    Zhu, Jiaming
    Zhang, Guopeng
    Zhou, Shibin
    Li, Kun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 15469 - 15485
  • [38] A location-aware siamese network for high-speed visual tracking
    Zhou, Lifang
    Ding, Xiang
    Li, Weisheng
    Leng, Jiaxu
    Lei, Bangjun
    Yang, Weibin
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4431 - 4447
  • [39] An IoU-aware Siamese network for real-time visual tracking
    Wei, Bingbing
    Chen, Hongyu
    Cao, Siqi
    Ding, Qinghai
    Luo, Haibo
    NEUROCOMPUTING, 2023, 527 : 13 - 26
  • [40] Target-aware transformer tracking with hard occlusion instance generation
    Xiao, Dingkun
    Wei, Zhenzhong
    Zhang, Guangjun
    FRONTIERS IN NEUROROBOTICS, 2024, 17