MBT-UNet: Multi-Branch Transform Combined with UNet for Semantic Segmentation of Remote Sensing Images

被引:2
|
作者
Liu, Bin [1 ]
Li, Bing [1 ]
Sreeram, Victor [2 ]
Li, Shuofeng [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
[2] Univ Western Australia, Sch Elect Elect & Comp Engn, Perth 6009, Australia
关键词
transformer; semantic segmentation; convolutional neural network; remote sensing; NETWORK; BUILDINGS; MODEL;
D O I
10.3390/rs16152776
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing (RS) images play an indispensable role in many key fields such as environmental monitoring, precision agriculture, and urban resource management. Traditional deep convolutional neural networks have the problem of limited receptive fields. To address this problem, this paper introduces a hybrid network model that combines the advantages of CNN and Transformer, called MBT-UNet. First, a multi-branch encoder design based on the pyramid vision transformer (PVT) is proposed to effectively capture multi-scale feature information; second, an efficient feature fusion module (FFM) is proposed to optimize the collaboration and integration of features at different scales; finally, in the decoder stage, a multi-scale upsampling module (MSUM) is proposed to further refine the segmentation results and enhance segmentation accuracy. We conduct experiments on the ISPRS Vaihingen dataset, the Potsdam dataset, the LoveDA dataset, and the UAVid dataset. Experimental results show that MBT-UNet surpasses state-of-the-art algorithms in key performance indicators, confirming its superior performance in high-precision remote sensing image segmentation tasks.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] UMF-Net: A UNet-based multi-branch feature fusion network for colon polyp segmentation
    Wan, Yulong
    Zhou, Dongming
    Wang, Changcheng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [32] Water body segmentation in remote sensing images based on multi-scale fusion attention module improved UNet
    Shi, Tian-Tan
    Guo, Zhong-Hua
    Yan, Xiang
    Wei, Shi-Qin
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (03) : 397 - 408
  • [33] Reconsidering Multi-Branch Aggregation for Semantic Segmentation
    Cai, Pengjie
    Yang, Derong
    Zou, Yonglin
    Chen, Ruihan
    Dai, Ming
    ELECTRONICS, 2023, 12 (15)
  • [34] T-UNet: triplet UNet for change detection in high-resolution remote sensing images
    Zhong, Huan
    Wu, Chen
    GEO-SPATIAL INFORMATION SCIENCE, 2024,
  • [35] MCANet: A Multi-Branch Network for Cloud/Snow Segmentation in High-Resolution Remote Sensing Images
    Hu, Kai
    Zhang, Enwei
    Xia, Min
    Weng, Liguo
    Lin, Haifeng
    REMOTE SENSING, 2023, 15 (04)
  • [36] Multi-Branch Supervised Learning on Semantic Segmentation
    Chen, Wenxin
    Zhang, Ting
    Zhao, Xing
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 6841 - 6845
  • [37] REMOTE SENSING IMAGES FEATURE LEARNING BASED ON MULTI-BRANCH NETWORKS
    Liu, Chao
    Tang, Xu
    Ma, Jingjing
    Zhang, Xiangrong
    Liu, Fang
    Ma, Junyong
    Jiao, Licheng
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2057 - 2060
  • [38] Review of Semantic Segmentation of Medical Images Using Modified Architectures of UNET
    Krithika Alias AnbuDevi, M.
    Suganthi, K.
    DIAGNOSTICS, 2022, 12 (12)
  • [39] nmODE-Unet: A Novel Network for Semantic Segmentation of Medical Images
    Wang, Shubin
    Chen, Yuanyuan
    Yi, Zhang
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [40] UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery
    Wang, Libo
    Li, Rui
    Zhang, Ce
    Fang, Shenghui
    Duan, Chenxi
    Meng, Xiaoliang
    Atkinson, Peter M.
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 190 : 196 - 214