Multi-modal remote sensing image segmentation based on attention-driven dual-branch encoding framework

被引:0
|
作者
Li, Zhiyang [1 ]
Pan, Xuran [1 ]
Xu, Kexing [1 ]
Yang, Xinqi [1 ]
机构
[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin Econ & Technol Dev Area, Tianjin, Peoples R China
关键词
multi-modal remote sensing images; semantic segmentation; convolutional neural networks; attention-driven feature fusion; SEMANTIC SEGMENTATION; RESOLUTION; MULTISCALE; NETWORKS;
D O I
10.1117/1.JRS.18.026506
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The high resolution remote sensing images are characterized by rich surface details and diverse features, and the single-modality high-resolution images suffer from limited expressive ability in the earth object segmentation application scenarios. We propose a multi-modal remote sensing image segmentation method based on attention-driven dual-branch encoding framework. The method involves parallel encoding of multi-modal remote sensing data to thoroughly extract features from each modality. Furthermore, multistage multi-modal features are fused by attention-driven feature fusion modules to generate high-quality multi-modal feature representation. Extensive experiments are carried out on the International Society for Photogrammetry and Remote Sensing Vaihingen and Potsdam 2D semantic labeling datasets. The datasets include both RGB/IRRG images and digital surface model (DSM) images. Experimental results demonstrate that: (1) the elevation information of DSM images can bring obvious benefits to the earth objects with significant heights, and introducing DSM images properly can improve the segmentation accuracy compared to using only RGB/IRRG images; (2) the attention-driven feature fusion module outperforms traditional feature fusion methods in capturing cross-modal complementary features, leading to outstanding segmentation accuracy for each earth object.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] DBTrans: A Dual-Branch Vision Transformer for Multi-Modal Brain Tumor Segmentation
    Zeng, Xinyi
    Zeng, Pinxian
    Tang, Cheng
    Wang, Peng
    Yan, Binyu
    Wang, Yan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 502 - 512
  • [2] Based on Multi-Feature Information Attention Fusion for Multi-Modal Remote Sensing Image Semantic Segmentation
    Zhang, Chongyu
    2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2021), 2021, : 71 - 76
  • [3] DSMENet: A Road Segmentation Network Based on Dual-Branch Dynamic Snake Convolutional Encoding and Multi-modal Information Iterative Enhancement
    Li, Zhiyang
    Pan, Xuran
    Yang, Shuhao
    Yang, Xinqi
    Xu, Kexing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 168 - 179
  • [4] DBANet: Dual-branch Attention Network for hyperspectral remote sensing image classification
    Li, Zexu
    Chen, Gongchao
    Li, Guohou
    Zhou, Ling
    Pan, Xipeng
    Zhao, Wenyi
    Zhang, Weidong
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118
  • [5] DBDAN: Dual-Branch Dynamic Attention Network for Semantic Segmentation of Remote Sensing Images
    Che, Rui
    Ma, Xiaowen
    Hong, Tingfeng
    Wang, Xinyu
    Feng, Tian
    Zhang, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 306 - 317
  • [6] A Medical Image Segmentation Network with Multi-Scale and Dual-Branch Attention
    Zhu, Cancan
    Cheng, Ke
    Hua, Xuecheng
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [7] Remote Sensing Image Segmentation Based on Multi-modal Feature Extraction and Hierarchical Perception
    Zhang, Yinsheng
    Shan, Mengjiao
    Chen, Xin
    Chen, Ge
    Tong, Junyi
    Ji, Ru
    Shan, Huilin
    Journal of Geo-Information Science, 2024, 26 (12) : 2741 - 2758
  • [8] Multi-Stage Fusion and Multi-Source Attention Network for Multi-Modal Remote Sensing Image Segmentation
    Zhao, Jiaqi
    Zhou, Yong
    Shi, Boyu
    Yang, Jingsong
    Zhang, Di
    Yao, Rui
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (06)
  • [9] A SAM-based dual-branch network for remote sensing semantic segmentation
    Zhang, Hui
    REMOTE SENSING LETTERS, 2025, 16 (04) : 365 - 375
  • [10] Multi-Stage Fusion and Multi-Source Attention Network for Multi-Modal Remote Sensing Image Segmentation
    Zhao, Jiaqi
    Zhou, Yong
    Shi, Boyu
    Yang, Jingsong
    Zhang, Di
    Yao, Rui
    ACM Transactions on Intelligent Systems and Technology, 2021, 12 (06):