Learning Multifrequency Integration Network for RGBT Tracking

被引:5
|
作者
Mei, Jiatian [1 ]
Zhou, Juxiang [1 ]
Wang, Jun [1 ]
Hao, Jia [1 ]
Zhou, Dongming [2 ]
Cao, Jinde [3 ,4 ]
机构
[1] Yunnan Normal Univ, Yunnan Key Lab Smart Educ, Key Lab Educ Informat Nationalities, Minist Educ, Kunming 650500, Yunnan, Peoples R China
[2] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650091, Yunnan, Peoples R China
[3] Southeast Univ, Sch Math, Nanjing 211189, Peoples R China
[4] Ahlia Univ, Manama 10878, Bahrain
基金
中国国家自然科学基金;
关键词
Intermodal; intramodal; modal heterogeneity; multifrequency integration (MI); RGBT tracking; FUSION;
D O I
10.1109/JSEN.2024.3370144
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGBT tracking is an attractive topic that benefits from the complementarity of visible and thermal sensors to better handle tracking tasks in atrocious scenarios. Existing RGBT trackers typically introduce self-attention (SA) to capture long-range dependencies. However, recent findings suggest that SA is a low-pass filter, meaning that high-frequency clues involving local edges and texture may be repressed. Aiming at the problem, this article comprehensively considers the multifrequency knowledge of heterogeneous modalities and proposes a learning multifrequency integration network (LMINet) for RGBT tracking to effectively implement adaptive extraction, enhancement, and integration of multifrequency cues. The proposed LMINet primarily benefits from the deployment of three crucial components: pattern-aware reinforcement (PR), multifrequency enhancement (ME), and MI. Specifically, the PR part consists of a carefully designed reinforcement unit (RU) and learnable weighting strategy 1 (LWS1). The former extracts information from the data flow to enhance the backbone, while the latter is a data-driven regulation mechanism that adaptively adjusts the enhancement intensity via learning the input. Then, the ME component separates high- and low-frequency knowledge via high-level branch (HB) and common unit (CU) and further adjusts the improvement intensity of multifrequency cues via the learning of LWS2 to achieve intramodal refinement. Moreover, the MI part first extracts high- and low-frequency signals via HB and low-level branch (LB) and implements cross-modal integration of high- and low-frequency cues through LWS3, respectively. Extensive experimental results on GTOT, RGBT234, and LasHeR demonstrate that the proposed LMINet is effective and competitive with state-of-the-art algorithms. The code will be open-sourced at https://github.com/mjt1312/Lminet.
引用
收藏
页码:15517 / 15530
页数:14
相关论文
共 50 条
  • [21] Attribute-Based Progressive Fusion Network for RGBT Tracking
    Xiao, Yun
    Yang, Mengmeng
    Li, Chenglong
    Liu, Lei
    Tang, Jin
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2831 - 2838
  • [22] Learning reliable modal weight with transformer for robust RGBT tracking
    Feng, Mingzheng
    Su, Jianbo
    KNOWLEDGE-BASED SYSTEMS, 2022, 249
  • [23] HATFNet: Hierarchical adaptive trident fusion network for RGBT tracking
    Zhao, Yanjie
    Lai, Huicheng
    Gao, Guxue
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24187 - 24201
  • [24] Duality-Gated Mutual Condition Network for RGBT Tracking
    Lu, Andong
    Qian, Cun
    Li, Chenglong
    Tang, Jin
    Wang, Liang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022,
  • [25] HATFNet: Hierarchical adaptive trident fusion network for RGBT tracking
    Yanjie Zhao
    Huicheng Lai
    Guxue Gao
    Applied Intelligence, 2023, 53 : 24187 - 24201
  • [26] Deep Adaptive Fusion Network for High Performance RGBT Tracking
    Gao, Yuan
    Li, Chenglong
    Zhu, Yabin
    Tang, Jin
    He, Tao
    Wang, Futian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 91 - 99
  • [27] RGBT dual-modal Siamese tracking network with feature fusion
    Shen Y.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2021, 50 (03):
  • [28] Multiple frequency–spatial network for RGBT tracking in the presence of motion blur
    Shenghua Fan
    Xi Chen
    Chu He
    Lei Yu
    Zhongjie Mao
    Yujin Zheng
    Neural Computing and Applications, 2023, 35 : 24389 - 24406
  • [29] Siamese transformer RGBT tracking
    Wang, Futian
    Wang, Wenqi
    Liu, Lei
    Li, Chenglong
    Tang, Jing
    APPLIED INTELLIGENCE, 2023, 53 (21) : 24709 - 24723
  • [30] RGBT Tracking via Multi-stage Matching Guidance and Context integration
    Kaixiang Yan
    Changcheng Wang
    Dongming Zhou
    Ziwei Zhou
    Neural Processing Letters, 2023, 55 : 11073 - 11087