Region Selective Fusion Network for Robust RGB-T Tracking

被引:7
|
作者
Yu, Zhencheng [1 ,2 ,3 ]
Fan, Huijie [1 ,2 ]
Wang, Qiang [4 ]
Li, Ziwan [1 ,5 ]
Tang, Yandong [1 ,2 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] Shenyang Univ, Key Lab Mfg Ind Integrated, Shenyang 110096, Peoples R China
[5] Shenyang Univ Chem Technol, Sch Informat Engn, Shenyang 110142, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Feature extraction; Reliability; Mobile computing; Ad hoc networks; Head; Visualization; Deep visual tracking; neural networks; visible-infrared fusion; vision transformer;
D O I
10.1109/LSP.2023.3316021
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGB-T tracking utilizes thermal infrared images as a complement to visible light images in order to perform more robust visual tracking in various scenarios. However, the highly aligned RGB-T image pairs introduces redundant information, the modal quality fluctuation during tracking also brings unreliable information. Existing RGB-T trackers usually use channel-wise multi-modal feature fusion in which the low-quality features degrades the fused features and causes trackers to drift. In this work, we propose a region selective fusion network that first evaluates each image region by cross-modal and cross-region modeling, then removes low-quality redundant region features to alleviate the negative effects caused by unreliable information in multi-modal fusion. Besides, the region removal scheme brings a efficiency boost as redundant features are removed progressively, this enables the tracker to run at a high tracking speed. Extensive experiments show that the proposed tracker achieves competitive performance with a real-time tracking speed on multiple RGB-T tracking benchmarks including LasHeR, RGBT234 and GTOT.
引用
收藏
页码:1357 / 1361
页数:5
相关论文
共 50 条
  • [21] RGB-T tracking with frequency hybrid awareness
    Lei, Lei
    Li, Xianxian
    IMAGE AND VISION COMPUTING, 2024, 152
  • [22] Toward Modalities Correlation for RGB-T Tracking
    Hu, Xiantao
    Zhong, Bineng
    Liang, Qihua
    Zhang, Shengping
    Li, Ning
    Li, Xianxian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9102 - 9111
  • [23] DaCFN: divide-and-conquer fusion network for RGB-T object detection
    Wang, Bofan
    Zhao, Haitao
    Zhuang, Yi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2407 - 2420
  • [24] Unified Single- Stage Transformer Network for Efficient RGB-T Tracking
    Xia, Jianqiang
    Shi, Dianxi
    Song, Ke
    Song, Linna
    Wang, Xiaolei
    Jin, Songchang
    Zhao, Chenran
    Cheng, Yu
    Jin, Lei
    Zhu, Zheng
    Li, Jianan
    Wang, Gang
    Xing, Junliang
    Zhao, Jian
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1471 - 1479
  • [25] FADSiamNet: feature affinity drift siamese network for RGB-T target tracking
    Li, Haiyan
    Cao, Yonghui
    Guo, Lei
    Chen, Quan
    Ding, Zhaisheng
    Xie, Shidong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, : 2779 - 2799
  • [26] Anchor free based Siamese network tracker with transformer for RGB-T tracking
    Fan, Liangsong
    Kim, Pyeoungkee
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [27] Anchor free based Siamese network tracker with transformer for RGB-T tracking
    Liangsong Fan
    Pyeoungkee Kim
    Scientific Reports, 13
  • [28] Weighted Guided Optional Fusion Network for RGB-T Salient Object Detection
    Wang, Jie
    Li, Guoqiang
    Shi, Jie
    Xi, Jinwen
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (05)
  • [29] Learning Multi-domain Convolutional Network for RGB-T Visual Tracking
    Zhang, Xingming
    Zhang, Xuehan
    Du, Xuedan
    Zhou, Xiangming
    Yin, Jun
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [30] Unsupervised RGB-T object tracking with attentional multi-modal feature fusion
    Shenglan Li
    Rui Yao
    Yong Zhou
    Hancheng Zhu
    Bing Liu
    Jiaqi Zhao
    Zhiwen Shao
    Multimedia Tools and Applications, 2023, 82 : 23595 - 23613