Learning Multifrequency Integration Network for RGBT Tracking

被引:5
|
作者
Mei, Jiatian [1 ]
Zhou, Juxiang [1 ]
Wang, Jun [1 ]
Hao, Jia [1 ]
Zhou, Dongming [2 ]
Cao, Jinde [3 ,4 ]
机构
[1] Yunnan Normal Univ, Yunnan Key Lab Smart Educ, Key Lab Educ Informat Nationalities, Minist Educ, Kunming 650500, Yunnan, Peoples R China
[2] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650091, Yunnan, Peoples R China
[3] Southeast Univ, Sch Math, Nanjing 211189, Peoples R China
[4] Ahlia Univ, Manama 10878, Bahrain
基金
中国国家自然科学基金;
关键词
Intermodal; intramodal; modal heterogeneity; multifrequency integration (MI); RGBT tracking; FUSION;
D O I
10.1109/JSEN.2024.3370144
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGBT tracking is an attractive topic that benefits from the complementarity of visible and thermal sensors to better handle tracking tasks in atrocious scenarios. Existing RGBT trackers typically introduce self-attention (SA) to capture long-range dependencies. However, recent findings suggest that SA is a low-pass filter, meaning that high-frequency clues involving local edges and texture may be repressed. Aiming at the problem, this article comprehensively considers the multifrequency knowledge of heterogeneous modalities and proposes a learning multifrequency integration network (LMINet) for RGBT tracking to effectively implement adaptive extraction, enhancement, and integration of multifrequency cues. The proposed LMINet primarily benefits from the deployment of three crucial components: pattern-aware reinforcement (PR), multifrequency enhancement (ME), and MI. Specifically, the PR part consists of a carefully designed reinforcement unit (RU) and learnable weighting strategy 1 (LWS1). The former extracts information from the data flow to enhance the backbone, while the latter is a data-driven regulation mechanism that adaptively adjusts the enhancement intensity via learning the input. Then, the ME component separates high- and low-frequency knowledge via high-level branch (HB) and common unit (CU) and further adjusts the improvement intensity of multifrequency cues via the learning of LWS2 to achieve intramodal refinement. Moreover, the MI part first extracts high- and low-frequency signals via HB and low-level branch (LB) and implements cross-modal integration of high- and low-frequency cues through LWS3, respectively. Extensive experimental results on GTOT, RGBT234, and LasHeR demonstrate that the proposed LMINet is effective and competitive with state-of-the-art algorithms. The code will be open-sourced at https://github.com/mjt1312/Lminet.
引用
收藏
页码:15517 / 15530
页数:14
相关论文
共 50 条
  • [1] Asymmetric Global-Local Mutual Integration Network for RGBT Tracking
    Mei, Jiatian
    Liu, Yanyu
    Wang, Changcheng
    Zhou, Dongming
    Nie, Rencan
    Cao, Jinde
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [2] Fusion Tree Network for RGBT Tracking
    Cheng, Zhiyuan
    Lu, Andong
    Zhang, Zhang
    Li, Chenglong
    Wang, Liang
    2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [3] RGBT Tracking by Trident Fusion Network
    Zhu, Yabin
    Li, Chenglong
    Tang, Jin
    Luo, Bin
    Wang, Liang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 579 - 592
  • [4] Dynamic Fusion Network for RGBT Tracking
    Peng, Jingchao
    Zhao, Haitao
    Hu, Zhengwei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) : 3822 - 3832
  • [5] Deep Triply Attention Network for RGBT Tracking
    Yang, Rui
    Wang, Xiao
    Zhu, Yabin
    Tang, Jin
    COGNITIVE COMPUTATION, 2023, 15 (06) : 1934 - 1946
  • [6] Multibranch Adaptive Fusion Network for RGBT Tracking
    Li, Yadong
    Lai, Huicheng
    Wang, Liejun
    Jia, Zhenhong
    IEEE SENSORS JOURNAL, 2022, 22 (07) : 7084 - 7093
  • [7] Deep Triply Attention Network for RGBT Tracking
    Rui Yang
    Xiao Wang
    Yabin Zhu
    Jin Tang
    Cognitive Computation, 2023, 15 : 1934 - 1946
  • [8] RMFNet: Redetection Multimodal Fusion Network for RGBT Tracking
    Zhao, Yanjie
    Lai, Huicheng
    Gao, Guxue
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [9] Learning Multi-Layer Attention Aggregation Siamese Network for Robust RGBT Tracking
    Feng, Mingzheng
    Su, Jianbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3378 - 3391
  • [10] Learning a multimodal feature transformer for RGBT tracking
    Shi, Huiwei
    Mu, Xiaodong
    Shen, Danyao
    Zhong, Chengliang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 239 - 250