SiamATA: an asymmetric target-aware and frequency domain task-aware Siamese network for visual tracking

被引:0
|
作者
Liang, Xingzhu [1 ,2 ,3 ]
Xiao, Yunzhuang [1 ]
Lin, Yu-e [1 ]
Yan, Xinyun [4 ]
机构
[1] Anhui Univ Sci & Technol, Sch Comp Sci & Engn, Huainan 232001, Anhui, Peoples R China
[2] Anhui Univ Sci & Technol, Huainan Peoples Hosp 1, Affiliated Hosp 1, Huainan 232007, Anhui, Peoples R China
[3] Anhui Univ Sci & Technol, Inst Environm Friendly Mat & Occupat Hlth, Wuhu 241003, Anhui, Peoples R China
[4] Jinling Inst Technol, Jiangsu AI Transportat Innovat & Applicat Engn Res, Nanjing 211169, Jiangsu, Peoples R China
关键词
Target-aware attention; Task-aware attention; Siamese network; Visual tracking; OBJECT TRACKING;
D O I
10.1007/s13042-024-02394-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Siamese-based trackers have achieved promising results in visual object tracking. However, the feature extraction capability of current popular Siamese-like networks is limited, making it difficult to fully distinguish the object from the background. Trackers are susceptible to drifting caused by factors such as occlusion, scale variation, and fast motion. In this paper, we propose a novel tracker, dubbed Siamese network with asymmetric target-aware and task-aware (SiamATA). The network is based on the asymmetric structure of the classification-regression branches, including the template classification branch, template regression branch, search region classification branch, and search region regression branch, to alleviate overfitting. Meanwhile, a target-aware attention module is introduced to learn powerful context information through spatial attention and selectively emphasize dependency channel features through channel attention, providing target-aware semantic features for each branch. In addition, we adopt the nonlocal pixel-wise correlation method to suppress the influence of similar object interference. Finally, we design a frequency domain task-aware attention module to explore the self-semantic information of classification and regression branches. Extensive experiments demonstrate the effectiveness of our tracker on six benchmarks: OTB100, UAV123, VOT2018, VOT2019, GOT-10K, and LaSOT.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] SiamBAN: Target-Aware Tracking With Siamese Box Adaptive Network
    Chen, Zedu
    Zhong, Bineng
    Li, Guorong
    Zhang, Shengping
    Ji, Rongrong
    Tang, Zhenjun
    Li, Xianxian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 5158 - 5173
  • [2] SiamCA: Siamese visual tracking with customized anchor and target-aware interaction
    Pan, Shuqi
    Zhang, Canlong
    Li, Zhixin
    Hu, Liaojie
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [3] Tracking Algorithm for Siamese Network Based on Target-Aware Feature Selection
    Chen Zhiwang
    Zhang Zhongxin
    Song Juan
    Luo Hongfu
    Peng Yong
    ACTA OPTICA SINICA, 2020, 40 (09)
  • [4] Target-Aware Siamese Networks Based on Masked Attention Mechanism for Visual Object Tracking
    Su, Yao-Hui
    Shieh, Ming-Der
    Tsai, Chia-Chi
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR 2024, 2024, : 28 - 34
  • [5] Target-Aware State Estimation for Visual Tracking
    Zhou, Zikun
    Li, Xin
    Fan, Nana
    Wang, Hongpeng
    He, Zhenyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2908 - 2920
  • [6] Learning target-aware correlation filters for visual tracking
    Li, Dongdong
    Wen, Gongjian
    Kuai, Yangliu
    Xiao, Jingjing
    Porikli, Fatih
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 : 149 - 159
  • [7] Target-Aware Deep Tracking
    Li, Xin
    Ma, Chao
    Wu, Baoyuan
    He, Zhenyu
    Yang, Ming-Hsuan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1369 - 1378
  • [8] DomainSiam: Domain-Aware Siamese Network for Visual Object Tracking
    Abdelpakey, Mohamed H.
    Shehata, Mohamed S.
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 45 - 58
  • [9] Target-Aware Transformer Tracking
    Zheng, Yuhui
    Zhang, Yan
    Xiao, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4542 - 4551
  • [10] Visual object tracking using learnable target-aware token emphasis
    Park, Minho
    Song, Jinjoo
    Yoon, Sang Min
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 149