Remote Sensing Image Semantic Segmentation Algorithm Based on TransMANet

被引:0
|
作者
Song Xirui [1 ,2 ]
Ge Hongwei [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
关键词
image processing; semantic segmentation; attention mechanism; Transformer; high-resolution remote sensing image;
D O I
10.3788/LOP232052
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Herein, we propose a Transformer multiattention network (TransMANet), a network structure based on Transformer and attention mechanisms, to address the issues of low segmentation accuracy, inadequate global feature extraction, and insufficient association between the multiattention network (MANet) algorithm and image semantic information. This network structure features a dual-branch decoder that combines local and global contexts and enhances the semantic information of shallow networks. First, we introduce a local attention embedding mechanism that enhances the embedding of context information and semantic information of high-level features into low-level features. Then, we design a dual-branch decoder that combines Transformer and convolutional neural networks, which extracts global context information and detailed information with different scales, thereby modeling global and local information. Finally, we improve the original loss function and use a joint loss function that combines cross-entropy loss and Dice loss to address the class imbalance problem often encountered in remote sensing datasets and thus improve segmentation accuracy. Our experimental results demonstrate the superiority of TransMANet over MANet and other advanced methods in terms of intersection over union on UAVid, LoveDA, Potsdam, and Vaihingen datasets. This indicates the strong generalization capability of TransMANet and its effectiveness in achieving accurate segmentation results.
引用
收藏
页数:12
相关论文
共 29 条
  • [11] Multiattention Network for Semantic Segmentation of Fine-Resolution Remote Sensing Images
    Li, Rui
    Zheng, Shunyi
    Zhang, Ce
    Duan, Chenxi
    Su, Jianlin
    Wang, Libo
    Atkinson, Peter M.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [12] ABCNet: Attentive bilateral contextual network for efficient semantic segmentation of Fine-Resolution remotely sensed imagery
    Li, Rui
    Zheng, Shunyi
    Zhang, Ce
    Duan, Chenxi
    Wang, Libo
    Atkinson, Peter M.
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 181 : 84 - 98
  • [13] Applications of object detection networks in high-power laser systems and experiments
    Lin, Jinpu
    Haberstroh, Florian
    Karsch, Stefan
    Doepp, Andreas
    [J]. HIGH POWER LASER SCIENCE AND ENGINEERING, 2023, 11
  • [14] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [15] CFPNET: CHANNEL-WISE FEATURE PYRAMID FOR REAL-TIME SEMANTIC SEGMENTATION
    Lou, Ange
    Loew, Murray
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1894 - 1898
  • [16] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [17] Efficient semantic segmentation with pyramidal fusion
    Orsic, Marin
    Segvic, Sinisa
    [J]. PATTERN RECOGNITION, 2021, 110
  • [18] 利用深度学习模型进行城市内涝影响分析
    潘银
    邵振峰
    程涛
    贺蔚
    [J]. 武汉大学学报(信息科学版), 2019, 44 (01) : 132 - 138
  • [19] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [20] Fully Convolutional Networks for Semantic Segmentation
    Shelhamer, Evan
    Long, Jonathan
    Darrell, Trevor
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 640 - 651