Online Multi-Scale Classification and Global Feature Modulation for Robust Visual Tracking

被引:3
|
作者
Gao, Qi [1 ]
Yin, Mingfeng [2 ]
Wu, Xiang [3 ]
Liu, Di [4 ]
Bo, Yuming [3 ]
机构
[1] Jiangsu Univ Technol, Coll Mech Engn, Changzhou 213001, Peoples R China
[2] Jiangsu Univ Technol, Sch Automobile & Traff Engn, Changzhou 213001, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Automat, Nanjing 210094, Peoples R China
[4] Nanjing Inst Technol, Sch Automat, Nanjing 211167, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Target tracking; Accuracy; Fuses; Modulation; Transformers; Real-time systems; Visual object tracking; coordinate attention; online multi-scale classification; global feature modulation; OBJECT TRACKING;
D O I
10.1109/TCSVT.2023.3343949
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent advanced trackers, composed of discriminative classification and dedicated bounding box estimation, have achieved remarkable advancements in performance of visual object tracking. However, existing methods cannot satisfy the demands of tracking tasks in complex scenes, such as occlusion, scale variations, and etc. To this end, we propose a novel online multi-scale classification and global feature modulation for robust visual tracking, which is developed over accurate tracking by overlap maximization, named ATOM+. First, coordinate attention (CA) is applied to enhance the target features in the channel dimension and spatial dimension, which can effectively optimize the feature representation ability of the backbone network. Second, an online multi-scale classification (OMC) module is designed. During the online tracking phase, more reliable matching responses are comprehensively generated by aggregating information from different scales related to the target. This new operation enables stable perception of the target by the tracker, particularly when severe changes in the appearance and posture of the target are encountered. Third, a global feature modulation (GFM) mechanism is constructed, which requires only a small amount of computational resources, to fuse the spatial contextual information of the template image into the search region. This integration refines the bounding box to obtain an accurate estimate of the target state. Finally, comprehensive experiments on conventional tracking benchmarks of OTB100, LaSOT, and VOT2018 show that our tracker can sufficiently address different challenging scenarios, and achieves state-of-the-art performance. For the average running speed, our tracker can achieve 37 FPS in real time.
引用
收藏
页码:5321 / 5334
页数:14
相关论文
共 50 条
  • [41] ONLINE LEARNING OF MULTI-FEATURE WEIGHTS FOR ROBUST OBJECT TRACKING
    Zhou, Tao
    Bhaskar, Harish
    Xie, Kai
    Yang, Jie
    He, Xiangjian
    Shi, Pengfei
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 725 - 729
  • [42] Multi-scale image preprocessing and feature tracking for remote CME characterization
    Stepanyuk, Oleg
    Kozarev, Kamen
    Nedal, Mohamed
    JOURNAL OF SPACE WEATHER AND SPACE CLIMATE, 2022, 12
  • [43] Target tracking based on multi-scale feature extraction Kalman filter
    Kong Jun
    Tang Xin-Yi
    Jiang Min
    Liu Shi-Jian
    Li Dan
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2011, 30 (05) : 446 - 450
  • [44] Robust Visual Tracking with Distribution Fields Feature Selection Based on Online Discrimination
    Guo Q.
    Wu C.-D.
    Zhao Y.-C.
    Guo, Qiang (royinchina@163.com), 2017, Northeast University (38): : 305 - 309
  • [45] Multi Feature Representation and Aggregation Network for Accurate and Robust Visual Tracking
    Yang, Yijin
    Gu, Xiaodong
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [46] Robust human gesture recognition by leveraging multi-scale feature fusion
    Deng, Minwei
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 83
  • [47] A Multi-scale feature modulation network for efficient underwater image enhancement
    Zheng, Shijian
    Wang, Rujing
    Zheng, Shitao
    Wang, Fenmei
    Wang, Liusan
    Liu, Zhigui
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (01)
  • [48] Multi-Scale Feature For Recognition
    Lei, Songze
    Hao, Chongyang
    Qi, Min
    ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 277 - 280
  • [49] Customizing the feature modulation for visual tracking
    Zhang, Yuping
    Yang, Zepeng
    Ma, Bo
    Wu, Jiahao
    Jin, Fusheng
    VISUAL COMPUTER, 2024, 40 (09): : 6547 - 6566
  • [50] Robust visual tracking algorithm based on structural multi-scale features adaptive fusion in co-training
    Zheng, Chao
    Jin, Wei
    Fang, Fang
    Tang, Chong
    Ling, Yongshun
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 588 - 592