Does Thermal Really Always Matter for RGB-T Salient Object Detection?

被引:46
|
作者
Cong, Runmin [1 ,2 ,3 ]
Zhang, Kepu [1 ,2 ]
Zhang, Chen [1 ,2 ]
Zheng, Feng [4 ,5 ]
Zhao, Yao [1 ,2 ]
Huang, Qingming [6 ,7 ,8 ]
Kwong, Sam [3 ,9 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Network Technol, Beijing Key Lab Adv Informat Sci, Beijing 100044, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[4] Southern Univ Sci & Technol, Dept Comp Sci & Technol, Shenzhen 518055, Peoples R China
[5] Res Inst Trustworthy Autonomous Syst, Shenzhen 518055, Peoples R China
[6] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China
[7] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[8] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[9] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 51800, Peoples R China
基金
北京市自然科学基金; 国家重点研发计划; 中国国家自然科学基金;
关键词
Task analysis; Decoding; Semantics; Object detection; Location awareness; Lighting; Feature extraction; RGB-T images; salient object detection; global illumination estimation; semantic constraint provider; localization and complementation; FUSION NETWORK;
D O I
10.1109/TMM.2022.3216476
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, RGB-T salient object detection (SOD) has attracted continuous attention, which makes it possible to identify salient objects in environments such as low light by introducing thermal image. However, most of the existing RGB-T SOD models focus on how to perform cross-modality feature fusion, ignoring whether thermal image is really always matter in SOD task. Starting from the definition and nature of this task, this paper rethinks the connotation of thermal modality, and proposes a network named TNet to solve the RGB-T SOD task. In this paper, we introduce a global illumination estimation module to predict the global illuminance score of the image, so as to regulate the role played by the two modalities. In addition, considering the role of thermal modality, we set up different cross-modality interaction mechanisms in the encoding phase and the decoding phase. On the one hand, we introduce a semantic constraint provider to enrich the semantics of thermal images in the encoding phase, which makes thermal modality more suitable for the SOD task. On the other hand, we introduce a two-stage localization and complementation module in the decoding phase to transfer object localization cue and internal integrity cue in thermal features to the RGB modality. Extensive experiments on three datasets show that the proposed TNet achieves competitive performance compared with 20 state-of-the-art methods.
引用
收藏
页码:6971 / 6982
页数:12
相关论文
共 50 条
  • [41] UMINet: a unified multi-modality interaction network for RGB-D and RGB-T salient object detection
    Lina Gao
    Ping Fu
    Mingzhu Xu
    Tiantian Wang
    Bing Liu
    The Visual Computer, 2024, 40 : 1565 - 1582
  • [42] SwinNet: Swin Transformer Drives Edge-Aware RGB-D and RGB-T Salient Object Detection
    Liu, Zhengyi
    Tan, Yacheng
    He, Qian
    Xiao, Yun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4486 - 4497
  • [43] CAFCNet: Cross-modality asymmetric feature complement network for RGB-T salient object detection
    Jin, Dongze
    Shao, Feng
    Xie, Zhengxuan
    Mu, Baoyang
    Chen, Hangwei
    Jiang, Qiuping
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
  • [44] Wavelet-Driven Multi-Band Feature Fusion for RGB-T Salient Object Detection
    Zhao, Jianxun
    Wen, Xin
    He, Yu
    Yang, Xiaowei
    Song, Kechen
    Sensors, 2024, 24 (24)
  • [45] Thermal images-aware guided early fusion network for cross-illumination RGB-T salient object detection
    Wang, Han
    Song, Kechen
    Huang, Liming
    Wen, Hongwei
    Yan, Yunhui
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 118
  • [46] EDGE-Net: an edge-guided enhanced network for RGB-T salient object detection
    Zheng, Xin
    Wang, Boyang
    Ai, Liefu
    Tang, Pan
    Liu, Deyang
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (06)
  • [47] Intra-Modality Self-Enhancement Mirror Network for RGB-T Salient Object Detection
    Wang, Jie
    Li, Guoqiang
    Yu, Hongjie
    Xi, Jinwen
    Shi, Jie
    Wu, Xueying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2513 - 2525
  • [48] Modality Registration and Object Search Framework for UAV-Based Unregistered RGB-T Image Salient Object Detection
    Song, Kechen
    Wen, Hongwei
    Xue, Xiaotong
    Huang, Liming
    Ji, Yingying
    Yan, Yunhui
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 15
  • [49] Lightweight Cross-Modal Information Mutual Reinforcement Network for RGB-T Salient Object Detection
    Lv, Chengtao
    Wan, Bin
    Zhou, Xiaofei
    Sun, Yaoqi
    Zhang, Jiyong
    Yan, Chenggang
    ENTROPY, 2024, 26 (02)
  • [50] Cross-Modality Double Bidirectional Interaction and Fusion Network for RGB-T Salient Object Detection
    Xie, Zhengxuan
    Shao, Feng
    Chen, Gang
    Chen, Hangwei
    Jiang, Qiuping
    Meng, Xiangchao
    Ho, Yo-Sung
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4149 - 4163