Cross-Parallel Attention and Efficient Match Transformer for Aerial Tracking

被引:1
|
作者
Deng, Anping [1 ,2 ]
Han, Guangliang [1 ]
Zhang, Zhongbo [3 ]
Chen, Dianbing [1 ]
Ma, Tianjiao [1 ]
Liu, Zhichao [1 ,2 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys CIOMP, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[3] Jilin Univ, Sch Math, Changchun 130012, Peoples R China
关键词
visual object tracking; UAV tracking; efficient match transformer; attention method;
D O I
10.3390/rs16060961
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Visual object tracking is a key technology that is used in unmanned aerial vehicles (UAVs) to achieve autonomous navigation. In recent years, with the rapid development of deep learning, tracking algorithms based on Siamese neural networks have received widespread attention. However, because of complex and diverse tracking scenarios, as well as limited computational resources, most existing tracking algorithms struggle to ensure real-time stable operation while improving tracking performance. Therefore, studying efficient and fast-tracking frameworks, and enhancing the ability of algorithms to respond to complex scenarios has become crucial. Therefore, this paper proposes a cross-parallel attention and efficient match transformer for aerial tracking (SiamEMT). Firstly, we carefully designed the cross-parallel attention mechanism to encode global feature information and to achieve cross-dimensional interaction and feature correlation aggregation via parallel branches, highlighting feature saliency and reducing global redundancy information, as well as improving the tracking algorithm's ability to distinguish between targets and backgrounds. Meanwhile, we implemented an efficient match transformer to achieve feature matching. This network utilizes parallel, lightweight, multi-head attention mechanisms to pass template information to the search region features, better matching the global similarity between the template and search regions, and improving the algorithm's ability to perceive target location and feature information. Experiments on multiple drone public benchmark tests verified the accuracy and robustness of the proposed tracker in drone tracking scenarios. In addition, on the embedded artificial intelligence (AI) platform AGX Xavier, our algorithm achieved real-time tracking speed, indicating that our algorithm can be effectively applied to UAV tracking scenarios.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker
    Li, Yunfeng
    Wang, Bo
    Sun, Jiuran
    Wu, Xueyi
    Li, Ye
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2260 - 2275
  • [22] An asymmetric cross-parallel high step-up DC-DC converter
    Qiao W.
    Zhang S.
    Zhang M.
    Zhao Z.
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2019, 47 (16): : 151 - 158
  • [23] Ship Attitude Prediction Model Based on Cross-Parallel Algorithm Optimized Neural Network
    Jiang, Yanshu
    Jia, Mingqi
    Zhang, Biao
    Deng, Liwei
    IEEE ACCESS, 2022, 10 : 77857 - 77871
  • [24] Interframe Saliency Transformer and Lightweight Multidimensional Attention Network for Real-Time Unmanned Aerial Vehicle Tracking
    Deng, Anping
    Han, Guangliang
    Chen, Dianbing
    Ma, Tianjiao
    Wei, Xilai
    Liu, Zhichao
    REMOTE SENSING, 2023, 15 (17)
  • [25] Simple Online Unmanned Aerial Vehicle Tracking with Transformer
    Liu, Yang
    Wang, Ershen
    Xu, Song
    Wang, Zhi
    Liu, Meizhi
    Shu, Wansen
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1235 - 1239
  • [26] Local Perception-Aware Transformer for Aerial Tracking
    Fu, Changhong
    Peng, Weiyu
    Li, Sihang
    Ye, Junjie
    Cao, Ziang
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 12122 - 12129
  • [27] MTAtrack: Multilevel transformer attention for visual tracking
    An, Dong
    Zhang, Fan
    Zhao, Yuqian
    Luo, Biao
    Yang, Chunhua
    Chen, Baifan
    Yu, Lingli
    OPTICS AND LASER TECHNOLOGY, 2023, 166
  • [28] Transformer Tracking with Cyclic Shifting Window Attention
    Song, Zikai
    Yu, Junqing
    Chen, Yi-Ping Phoebe
    Yang, Wei
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8781 - 8790
  • [29] ECDet: efficient oriented object detection on the aerial image with cross-layer attention
    Lyu, Xueqiang
    Tian, Lianghai
    Teng, Shangzhi
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2025, 22 (01)
  • [30] CASwin Transformer: A Hierarchical Cross Attention Transformer for Depth Completion
    Feng, Chunyu
    Wang, Xiaonian
    Zhang, Yangyang
    Zhao, Chengfeng
    Song, Mengxuan
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 2836 - 2841