Online Multi-Scale Classification and Global Feature Modulation for Robust Visual Tracking

被引:3
|
作者
Gao, Qi [1 ]
Yin, Mingfeng [2 ]
Wu, Xiang [3 ]
Liu, Di [4 ]
Bo, Yuming [3 ]
机构
[1] Jiangsu Univ Technol, Coll Mech Engn, Changzhou 213001, Peoples R China
[2] Jiangsu Univ Technol, Sch Automobile & Traff Engn, Changzhou 213001, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Automat, Nanjing 210094, Peoples R China
[4] Nanjing Inst Technol, Sch Automat, Nanjing 211167, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Target tracking; Accuracy; Fuses; Modulation; Transformers; Real-time systems; Visual object tracking; coordinate attention; online multi-scale classification; global feature modulation; OBJECT TRACKING;
D O I
10.1109/TCSVT.2023.3343949
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent advanced trackers, composed of discriminative classification and dedicated bounding box estimation, have achieved remarkable advancements in performance of visual object tracking. However, existing methods cannot satisfy the demands of tracking tasks in complex scenes, such as occlusion, scale variations, and etc. To this end, we propose a novel online multi-scale classification and global feature modulation for robust visual tracking, which is developed over accurate tracking by overlap maximization, named ATOM+. First, coordinate attention (CA) is applied to enhance the target features in the channel dimension and spatial dimension, which can effectively optimize the feature representation ability of the backbone network. Second, an online multi-scale classification (OMC) module is designed. During the online tracking phase, more reliable matching responses are comprehensively generated by aggregating information from different scales related to the target. This new operation enables stable perception of the target by the tracker, particularly when severe changes in the appearance and posture of the target are encountered. Third, a global feature modulation (GFM) mechanism is constructed, which requires only a small amount of computational resources, to fuse the spatial contextual information of the template image into the search region. This integration refines the bounding box to obtain an accurate estimate of the target state. Finally, comprehensive experiments on conventional tracking benchmarks of OTB100, LaSOT, and VOT2018 show that our tracker can sufficiently address different challenging scenarios, and achieves state-of-the-art performance. For the average running speed, our tracker can achieve 37 FPS in real time.
引用
收藏
页码:5321 / 5334
页数:14
相关论文
共 50 条
  • [31] Classification of Star Spectrum Based on Multi-Scale Feature Fusion
    Han Bo-chong
    Song Yi-han
    Zhao Yong-heng
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44 (08) : 2284 - 2288
  • [32] A Vehicle Classification Model Based on Multi-scale Feature Fusion
    Wang, Xuanhong
    Yang, Shiyu
    Sun, Zengguo
    Li, Xiaojun
    Xiao, Yun
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7180 - 7185
  • [33] Classification of crop pests based on multi-scale feature fusion
    Wei, Depeng
    Chen, Jiqing
    Luo, Tian
    Long, Teng
    Wang, Huabin
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 194
  • [34] Fast multi-scale feature fusion for ECG heartbeat classification
    Danni Ai
    Jian Yang
    Zeyu Wang
    Jingfan Fan
    Changbin Ai
    Yongtian Wang
    EURASIP Journal on Advances in Signal Processing, 2015
  • [35] An improved KCF tracking algorithm based on multi-feature and multi-scale
    Wu, Wei
    Wang, Ding
    Luo, Xin
    Su, Yang
    Tian, Weiye
    MIPPR 2017: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2018, 10608
  • [36] Multi-Scale Feature Fusion and Distribution Similarity Network for Few-Shot Automatic Modulation Classification
    Tan, Haoyue
    Zhang, Zhenxi
    Li, Yu
    Shi, Xiaoran
    Zhou, Feng
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2890 - 2894
  • [37] Global context-aware feature modulation networks for unified multi-scale super-resolution
    Zhang, Dacheng
    Lei, Weimin
    Zhang, Wei
    Chen, Xinyi
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [38] Multi-scale predictions fusion for robust hand detection and classification
    Ding, Lu
    Wang, Yong
    Laganiere, Robert
    Luo, Xinbin
    Fu, Shan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (24) : 35633 - 35650
  • [39] Robust multi-scale superpixel classification for optic cup localization
    Tan, Ngan-Meng
    Xu, Yanwu
    Goh, Wooi Boon
    Liu, Jiang
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 40 : 182 - 193
  • [40] Multi-scale predictions fusion for robust hand detection and classification
    Lu Ding
    Yong Wang
    Robert Laganière
    Xinbin Luo
    Shan Fu
    Multimedia Tools and Applications, 2019, 78 : 35633 - 35650