Learning Multifrequency Integration Network for RGBT Tracking

被引:5
|
作者
Mei, Jiatian [1 ]
Zhou, Juxiang [1 ]
Wang, Jun [1 ]
Hao, Jia [1 ]
Zhou, Dongming [2 ]
Cao, Jinde [3 ,4 ]
机构
[1] Yunnan Normal Univ, Yunnan Key Lab Smart Educ, Key Lab Educ Informat Nationalities, Minist Educ, Kunming 650500, Yunnan, Peoples R China
[2] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650091, Yunnan, Peoples R China
[3] Southeast Univ, Sch Math, Nanjing 211189, Peoples R China
[4] Ahlia Univ, Manama 10878, Bahrain
基金
中国国家自然科学基金;
关键词
Intermodal; intramodal; modal heterogeneity; multifrequency integration (MI); RGBT tracking; FUSION;
D O I
10.1109/JSEN.2024.3370144
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGBT tracking is an attractive topic that benefits from the complementarity of visible and thermal sensors to better handle tracking tasks in atrocious scenarios. Existing RGBT trackers typically introduce self-attention (SA) to capture long-range dependencies. However, recent findings suggest that SA is a low-pass filter, meaning that high-frequency clues involving local edges and texture may be repressed. Aiming at the problem, this article comprehensively considers the multifrequency knowledge of heterogeneous modalities and proposes a learning multifrequency integration network (LMINet) for RGBT tracking to effectively implement adaptive extraction, enhancement, and integration of multifrequency cues. The proposed LMINet primarily benefits from the deployment of three crucial components: pattern-aware reinforcement (PR), multifrequency enhancement (ME), and MI. Specifically, the PR part consists of a carefully designed reinforcement unit (RU) and learnable weighting strategy 1 (LWS1). The former extracts information from the data flow to enhance the backbone, while the latter is a data-driven regulation mechanism that adaptively adjusts the enhancement intensity via learning the input. Then, the ME component separates high- and low-frequency knowledge via high-level branch (HB) and common unit (CU) and further adjusts the improvement intensity of multifrequency cues via the learning of LWS2 to achieve intramodal refinement. Moreover, the MI part first extracts high- and low-frequency signals via HB and low-level branch (LB) and implements cross-modal integration of high- and low-frequency cues through LWS3, respectively. Extensive experimental results on GTOT, RGBT234, and LasHeR demonstrate that the proposed LMINet is effective and competitive with state-of-the-art algorithms. The code will be open-sourced at https://github.com/mjt1312/Lminet.
引用
收藏
页码:15517 / 15530
页数:14
相关论文
共 50 条
  • [31] Drone Based RGBT Tracking with Dual-Feature Aggregation Network
    Gao, Zhinan
    Li, Dongdong
    Wen, Gongjian
    Kuai, Yangliu
    Chen, Rui
    DRONES, 2023, 7 (09)
  • [32] HDINet: Hierarchical Dual-Sensor Interaction Network for RGBT Tracking
    Mei, Jiatian
    Zhou, Dongming
    Cao, Jinde
    Nie, Rencan
    Guo, Yanbu
    IEEE SENSORS JOURNAL, 2021, 21 (15) : 16915 - 16926
  • [33] A Comprehensive Review of RGBT Tracking
    Zhang, Haiping
    Yuan, Di
    Shu, Xiu
    Li, Zhihui
    Liu, Qiao
    Chang, Xiaojun
    He, Zhenyu
    Shi, Guangming
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [34] QueryTrack: Joint-Modality Query Fusion Network for RGBT Tracking
    Fan, Huijie
    Yu, Zhencheng
    Wang, Qiang
    Fan, Baojie
    Tang, Yandong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3187 - 3199
  • [35] RGBT Tracking via Multi-stage Matching Guidance and Context integration
    Yan, Kaixiang
    Wang, Changcheng
    Zhou, Dongming
    Zhou, Ziwei
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 11073 - 11087
  • [36] Learning modality feature fusion via transformer for RGBT-tracking
    Cai, Yujue
    Sui, Xiubao
    Gu, Guohua
    Chen, Qian
    INFRARED PHYSICS & TECHNOLOGY, 2023, 133
  • [37] RGBT Tracking via Progressive Fusion Transformer With Dynamically Guided Learning
    Zhu, Yabin
    Li, Chenglong
    Wang, Xiao
    Tang, Jin
    Huang, Zhixiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8722 - 8735
  • [38] Quality-Aware Feature Aggregation Network for Robust RGBT Tracking
    Zhu, Yabin
    Li, Chenglong
    Tang, Jin
    Luo, Bin
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2021, 6 (01): : 121 - 130
  • [39] Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
    Xiao, Xianbing
    Xiong, Xingzhong
    Meng, Fanqin
    Chen, Zhen
    SENSORS, 2023, 23 (07)
  • [40] Dual-Modality Space-Time Memory Network for RGBT Tracking
    Zhang, Fan
    Peng, Hanwei
    Yu, Lingli
    Zhao, Yuqian
    Chen, Baifan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72