A Feature Divide-and-Conquer Network for RGB-T Semantic Segmentation

被引:22
|
作者
Zhao, Shenlu [1 ,2 ]
Zhang, Qiang [1 ,2 ]
机构
[1] Xidian Univ, Key Lab Elect Equipment Struct Design, Minist Educ, Xian 710071, Shaanxi, Peoples R China
[2] Xidian Univ, Ctr Complex Syst, Sch Mechanoelect Engn, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantic segmentation; Data mining; Semantics; Lighting; Decoding; Thermal sensors; RGB-T semantic segmentation; feature divide-and-conquer strategy; multi-scale contextual information;
D O I
10.1109/TCSVT.2022.3229359
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Similar to other multi-modal pixel-level prediction tasks, existing RGB-T semantic segmentation methods usually employ a two-stream structure to extract RGB and thermal infrared (TIR) features, respectively, and adopt the same fusion strategies to integrate different levels of unimodal features. This will result in inadequate extraction of unimodal features and exploitation of cross-modal information from the paired RGB and TIR images. Alternatively, in this paper, we present a novel RGB-T semantic segmentation model, i.e., FDCNet, where a feature divide-and-conquer strategy performs unimodal feature extraction and cross-modal feature fusion in one go. Concretely, we first employ a two-stream structure to extract unimodal low-level features, followed by a Siamese structure to extract unimodal high-level features from the paired RGB and TIR images. This concise but efficient structure enables to take into account both the modality discrepancies of low-level features and the underlying semantic consistency of high-level features across the paired RGB and TIR images. Furthermore, considering the characteristics of different layers of features, a Cross-modal Spatial Activation (CSA) module and a Cross-modal Channel Activation (CCA) module are presented for the fusion of low-level RGB and TIR features and for the fusion of high-level RGB and TIR features, respectively, thus facilitating the capture of cross-modal information. On top of that, with an embedded Cross-scale Interaction Context (CIC) module for mining multi-scale contextual information, our proposed model (i.e., FDCNet) for RGB-T semantic segmentation achieves new state-of-the-art experimental results on MFNet dataset and PST900 dataset.
引用
收藏
页码:2892 / 2905
页数:14
相关论文
共 50 条
  • [41] Cascaded Feature Network for Semantic Segmentation of RGB-D Images
    Lin, Di
    Chen, Guangyong
    Cohen-Or, Daniel
    Heng, Pheng-Ann
    Huang, Hui
    Proceedings of the IEEE International Conference on Computer Vision, 2017, 2017-October : 1320 - 1328
  • [42] Region-adaptive and context-complementary cross modulation for RGB-T semantic segmentation
    Peng, Fengguang
    Ding, Zihan
    Chen, Ziming
    Wang, Gang
    Hui, Tianrui
    Liu, Si
    Shi, Hang
    PATTERN RECOGNITION, 2024, 147
  • [43] Retinal vessel segmentation by a divide-and-conquer funnel-structured classification framework
    Wang, Xiaohong
    Jiang, Xudong
    SIGNAL PROCESSING, 2019, 165 : 104 - 114
  • [44] DCSegNet: Deep Learning Framework Based on Divide-and-Conquer Method for Liver Segmentation
    Li, Congsheng
    Yao, Guorong
    Xu, Xu
    Yang, Lei
    Zhang, Yi
    Wu, Tongning
    Sun, Junhui
    IEEE ACCESS, 2020, 8 (08): : 146838 - 146846
  • [45] Divide-and-conquer signal processing, feature extraction, and machine learning for big data
    Chen Bo-Wei
    Wen Ji
    Rho, Seungmin
    NEUROCOMPUTING, 2016, 174 : 383 - 383
  • [46] ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection
    Zhou, Wujie
    Guo, Qinling
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1224 - 1235
  • [47] Edge-guided feature fusion network for RGB-T salient object detection
    Chen, Yuanlin
    Sun, Zengbao
    Yan, Cheng
    Zhao, Ming
    FRONTIERS IN NEUROROBOTICS, 2024, 18
  • [48] Obfuscating Community Structure in Complex Network With Evolutionary Divide-and-Conquer Strategy
    Zhao, Jie
    Cheong, Kang Hao
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (06) : 1926 - 1940
  • [49] Divide-and-conquer dispatching scheme for satellite Clos-network switches
    Zhang, Mao-Sen
    Qiu, Zhi-Liang
    Gao, Ya
    Li, Jun
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2012, 34 (11): : 2734 - 2740
  • [50] Multi-modal neural networks with multi-scale RGB-T fusion for semantic segmentation
    Lyu, Y.
    Schiopu, I.
    Munteanu, A.
    ELECTRONICS LETTERS, 2020, 56 (18) : 920 - 922