Learning Multifrequency Integration Network for RGBT Tracking

被引:5
|
作者
Mei, Jiatian [1 ]
Zhou, Juxiang [1 ]
Wang, Jun [1 ]
Hao, Jia [1 ]
Zhou, Dongming [2 ]
Cao, Jinde [3 ,4 ]
机构
[1] Yunnan Normal Univ, Yunnan Key Lab Smart Educ, Key Lab Educ Informat Nationalities, Minist Educ, Kunming 650500, Yunnan, Peoples R China
[2] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650091, Yunnan, Peoples R China
[3] Southeast Univ, Sch Math, Nanjing 211189, Peoples R China
[4] Ahlia Univ, Manama 10878, Bahrain
基金
中国国家自然科学基金;
关键词
Intermodal; intramodal; modal heterogeneity; multifrequency integration (MI); RGBT tracking; FUSION;
D O I
10.1109/JSEN.2024.3370144
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGBT tracking is an attractive topic that benefits from the complementarity of visible and thermal sensors to better handle tracking tasks in atrocious scenarios. Existing RGBT trackers typically introduce self-attention (SA) to capture long-range dependencies. However, recent findings suggest that SA is a low-pass filter, meaning that high-frequency clues involving local edges and texture may be repressed. Aiming at the problem, this article comprehensively considers the multifrequency knowledge of heterogeneous modalities and proposes a learning multifrequency integration network (LMINet) for RGBT tracking to effectively implement adaptive extraction, enhancement, and integration of multifrequency cues. The proposed LMINet primarily benefits from the deployment of three crucial components: pattern-aware reinforcement (PR), multifrequency enhancement (ME), and MI. Specifically, the PR part consists of a carefully designed reinforcement unit (RU) and learnable weighting strategy 1 (LWS1). The former extracts information from the data flow to enhance the backbone, while the latter is a data-driven regulation mechanism that adaptively adjusts the enhancement intensity via learning the input. Then, the ME component separates high- and low-frequency knowledge via high-level branch (HB) and common unit (CU) and further adjusts the improvement intensity of multifrequency cues via the learning of LWS2 to achieve intramodal refinement. Moreover, the MI part first extracts high- and low-frequency signals via HB and low-level branch (LB) and implements cross-modal integration of high- and low-frequency cues through LWS3, respectively. Extensive experimental results on GTOT, RGBT234, and LasHeR demonstrate that the proposed LMINet is effective and competitive with state-of-the-art algorithms. The code will be open-sourced at https://github.com/mjt1312/Lminet.
引用
收藏
页码:15517 / 15530
页数:14
相关论文
共 50 条
  • [41] External-attention dual-modality fusion network for RGBT tracking
    Kaixiang Yan
    Jiatian Mei
    Dongming Zhou
    Lifen Zhou
    The Journal of Supercomputing, 2023, 79 : 17020 - 17041
  • [42] RGBT Tracking via Multi-Adapter Network with Hierarchical Divergence Loss
    Lu, Andong
    Li, Chenglong
    Yan, Yuqing
    Tang, Jin
    Luo, Bin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5613 - 5625
  • [43] MTNet: Learning Modality-aware Representation with Transformer for RGBT Tracking
    Hou, Ruichao
    Xu, Boyue
    Ren, Tongwei
    Wu, Gangshan
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1163 - 1168
  • [44] Siamese transformer RGBT tracking
    Futian Wang
    Wenqi Wang
    Lei Liu
    Chenglong Li
    Jing Tang
    Applied Intelligence, 2023, 53 : 24709 - 24723
  • [45] RGBT tracking: A comprehensive review
    Feng, Mingzheng
    Su, Jianbo
    INFORMATION FUSION, 2024, 110
  • [46] Temporal Aggregation for Adaptive RGBT Tracking
    Tang, Zhangyong
    Xu, Tianyang
    Wu, Xiao-Jun
    arXiv, 2022,
  • [47] External-attention dual-modality fusion network for RGBT tracking
    Yan, Kaixiang
    Mei, Jiatian
    Zhou, Dongming
    Zhou, Lifen
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (15): : 17020 - 17041
  • [48] RGBT Image Fusion Tracking via Sparse Trifurcate Transformer Aggregation Network
    Feng, Mingzheng
    Su, Jianbo
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 10
  • [49] SiamCAF: Complementary Attention Fusion-Based Siamese Network for RGBT Tracking
    Xue, Yingjian
    Zhang, Jianwei
    Lin, Zhoujin
    Li, Chenglong
    Huo, Bihan
    Zhang, Yan
    REMOTE SENSING, 2023, 15 (13)
  • [50] Multiple frequency-spatial network for RGBT tracking in the presence of motion blur
    Fan, Shenghua
    Chen, Xi
    He, Chu
    Yu, Lei
    Mao, Zhongjie
    Zheng, Yujin
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (34): : 24389 - 24406