Revisiting Feature Fusion for RGB-T Salient Object Detection

被引:106
|
作者
Zhang, Qiang [1 ,2 ]
Xiao, Tonglin [2 ]
Huang, Nianchang [2 ]
Zhang, Dingwen [2 ]
Han, Jungong [3 ]
机构
[1] Xidian Univ, Minist Educ, Key Lab Elect Equipment Struct Design, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Mechanoelect Engn, Ctr Complex Syst, Xian 710071, Peoples R China
[3] Aberystwyth Univ, Comp Sci Dept, Aberystwyth SY23 3FL, Dyfed, Wales
基金
中国国家自然科学基金;
关键词
Object detection; Feature extraction; Saliency detection; Computational modeling; Semantics; Lighting; Task analysis; Salient object detection; RGB-T; multi-scale; multi-modality; multi-level; feature fusion; SEGMENTATION; NETWORK; MODEL;
D O I
10.1109/TCSVT.2020.3014663
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
While many RGB-based saliency detection algorithms have recently shown the capability of segmenting salient objects from an image, they still suffer from unsatisfactory performance when dealing with complex scenarios, insufficient illumination or occluded appearances. To overcome this problem, this article studies RGB-T saliency detection, where we take advantage of thermal modality's robustness against illumination and occlusion. To achieve this goal, we revisit feature fusion for mining intrinsic RGB-T saliency patterns and propose a novel deep feature fusion network, which consists of the multi-scale, multi-modality, and multi-level feature fusion modules. Specifically, the multi-scale feature fusion module captures rich contexture features from each modality feature, while the multi-modality and multi-level feature fusion modules integrate complementary features from different modality features and different level of features, respectively. To demonstrate the effectiveness of the proposed approach, we conduct comprehensive experiments on the RGB-T saliency detection benchmark. The experimental results demonstrate that our approach outperforms other state-of-the-art methods and the conventional feature fusion modules by a large margin.
引用
收藏
页码:1804 / 1818
页数:15
相关论文
共 50 条
  • [41] Leveraging modality-specific and shared features for RGB-T salient object detection
    Wang, Shuo
    Yang, Gang
    Xu, Qiqi
    Dai, Xun
    IET COMPUTER VISION, 2024, 18 (08) : 1285 - 1299
  • [42] Asymmetric cross-modal activation network for RGB-T salient object detection
    Xu, Chang
    Li, Qingwu
    Zhou, Qingkai
    Jiang, Xiongbiao
    Yu, Dabing
    Zhou, Yaqin
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [43] DaCFN: divide-and-conquer fusion network for RGB-T object detection
    Wang, Bofan
    Zhao, Haitao
    Zhuang, Yi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2407 - 2420
  • [44] Unsupervised RGB-T object tracking with attentional multi-modal feature fusion
    Shenglan Li
    Rui Yao
    Yong Zhou
    Hancheng Zhu
    Bing Liu
    Jiaqi Zhao
    Zhiwen Shao
    Multimedia Tools and Applications, 2023, 82 : 23595 - 23613
  • [45] RGB-T Object Detection With Failure Scenarios
    Wang, Qingwang
    Sun, Yuxuan
    Chi, Yongke
    Shen, Tao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 3000 - 3010
  • [46] EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection
    Haiyang He
    Jing Wang
    Xiaolin Li
    Minglin Hong
    Shiguo Huang
    Tao Zhou
    Machine Vision and Applications, 2022, 33
  • [47] Unsupervised RGB-T object tracking with attentional multi-modal feature fusion
    Li, Shenglan
    Yao, Rui
    Zhou, Yong
    Zhu, Hancheng
    Liu, Bing
    Zhao, Jiaqi
    Shao, Zhiwen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (15) : 23595 - 23613
  • [48] RGB-T Salient Object Detection via Fusing Multi-Level CNN Features
    Zhang, Qiang
    Huang, Nianchang
    Yao, Lin
    Zhang, Dingwen
    Shan, Caifeng
    Han, Jungong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3321 - 3335
  • [49] DaCFN: divide-and-conquer fusion network for RGB-T object detection
    Bofan Wang
    Haitao Zhao
    Yi Zhuang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2407 - 2420
  • [50] Efficient Context-Guided Stacked Refinement Network for RGB-T Salient Object Detection
    Huo, Fushuo
    Zhu, Xuegui
    Zhang, Lei
    Liu, Qifeng
    Shu, Yu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3111 - 3124