Multi-Level Fusion for Robust RGBT Tracking via Enhanced Thermal Representation

被引:1
|
作者
Tang, Zhangyong [1 ]
Xu, Tianyang [1 ]
Wu, Xiao-jun [1 ]
Kittler, Josef [2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford, England
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Visual object tracking; RGBT tracking; thermal enhancement; multi-modal multi-level fusion; BENCHMARK;
D O I
10.1145/3678176
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the limitations of visible (RGB) sensors in challenging scenarios, such as nighttime and foggy environments, the thermal infrared (TIR) modality draws increasing attention as an auxiliary source for robust tracking systems. Currently, the existing methods extract both the RGB and TIR (RGBT) clues in a similar approach, i.e., utilising RGB-pretrained models with or without finetuning, and then aggregate the multi-modal information through a fusion block embedded in a single level. However, the different imaging principles of RGB and TIR data raise questions about the suitability of RGB-pretrained models for thermal data. In this article, it is argued that the modality gap is overlooked, and an alternative training paradigm is proposed for TIR data to ensure consistency between the training and test data, which is achieved by optimising the TIR feature extractor with only TIR data involved. Furthermore, with the goal of making better use of the enhanced thermal representations, a multi-level fusion strategy is inspired by the observation that various fusion strategies at different levels can contribute to a better performance. Specifically, fusion modules at both the feature and decision levels are derived for a comprehensive fusion procedure while the pixel-level fusion strategy is not considered due to the misalignment of multi-modal image pairs. The effectiveness of our method is demonstrated by extensive qualitative and quantitative experiments conducted on several challenging benchmarks. Code will be released at https://github.com/Zhangyong-Tang/MELT.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Traffic density estimation via a multi-level feature fusion network
    Ying-Xiang Hu
    Rui-Sheng Jia
    Yong-Chao Li
    Qi Zhang
    Hong-Mei Sun
    Applied Intelligence, 2022, 52 : 10417 - 10429
  • [32] Multi-Cue Visual Tracking Using Robust Feature-Level Fusion Based on Joint Sparse Representation
    Lan, Xiangyuan
    Ma, Andy J.
    Yuen, Pong C.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1194 - 1201
  • [33] A Multi-Level Eigenvalue Fusion Algorithm for 3D Multi-Object Tracking
    Liu, Hantao
    Hu, Jianming
    Li, Xingyu
    Peng, Lihui
    INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2022: APPLICATION OF EMERGING TECHNOLOGIES, 2022, : 235 - 245
  • [34] SiamMMF: multi-modal multi-level fusion object tracking based on Siamese networks
    Zhen Yang
    Peng Huang
    Dunyun He
    Zhongwang Cai
    Zhijian Yin
    Machine Vision and Applications, 2023, 34
  • [35] SiamMMF: multi-modal multi-level fusion object tracking based on Siamese networks
    Yang, Zhen
    Huang, Peng
    He, Dunyun
    Cai, Zhongwang
    Yin, Zhijian
    MACHINE VISION AND APPLICATIONS, 2023, 34 (01)
  • [36] Multi-level modelling via stochastic multi-level multiset rewriting
    Oury, Nicolas
    Plotkin, Gordon
    MATHEMATICAL STRUCTURES IN COMPUTER SCIENCE, 2013, 23 (02) : 471 - 503
  • [37] RGBT Tracking via Multi-Adapter Network with Hierarchical Divergence Loss
    Lu, Andong
    Li, Chenglong
    Yan, Yuqing
    Tang, Jin
    Luo, Bin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5613 - 5625
  • [38] Multi-level data fusion method
    Lan, JH
    Ma, BH
    Zhou, ZY
    ISTM/2001: 4TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2001, : 235 - 238
  • [39] An Ontology for Multi-Level Data Fusion
    Steinberg, Alan N.
    2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,
  • [40] SiamMGT: robust RGBT tracking via graph attention and reliable modality weight learning
    Geng, Lizhi
    Zhou, Dongming
    Wang, Kerui
    Liu, Yisong
    Yan, Kaixiang
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (18): : 25888 - 25910