Remote Sensing Image Semantic Segmentation Algorithm Based on TransMANet

被引：0

作者：

Song Xirui ^{[1
,2
]}

Ge Hongwei ^{[1
,2
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China

[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2024年 / 61卷 / 10期

关键词：

image processing; semantic segmentation; attention mechanism; Transformer; high-resolution remote sensing image;

D O I：

10.3788/LOP232052

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Herein, we propose a Transformer multiattention network (TransMANet), a network structure based on Transformer and attention mechanisms, to address the issues of low segmentation accuracy, inadequate global feature extraction, and insufficient association between the multiattention network (MANet) algorithm and image semantic information. This network structure features a dual-branch decoder that combines local and global contexts and enhances the semantic information of shallow networks. First, we introduce a local attention embedding mechanism that enhances the embedding of context information and semantic information of high-level features into low-level features. Then, we design a dual-branch decoder that combines Transformer and convolutional neural networks, which extracts global context information and detailed information with different scales, thereby modeling global and local information. Finally, we improve the original loss function and use a joint loss function that combines cross-entropy loss and Dice loss to address the class imbalance problem often encountered in remote sensing datasets and thus improve segmentation accuracy. Our experimental results demonstrate the superiority of TransMANet over MANet and other advanced methods in terms of intersection over union on UAVid, LoveDA, Potsdam, and Vaihingen datasets. This indicates the strong generalization capability of TransMANet and its effectiveness in achieving accurate segmentation results.

引用

页数：12

共 29 条

[11] Multiattention Network for Semantic Segmentation of Fine-Resolution Remote Sensing Images
Li, Rui
Zheng, Shunyi
Zhang, Ce
Duan, Chenxi
Su, Jianlin
Wang, Libo
Atkinson, Peter M.
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[12] ABCNet: Attentive bilateral contextual network for efficient semantic segmentation of Fine-Resolution remotely sensed imagery
Li, Rui
Zheng, Shunyi
Zhang, Ce
Duan, Chenxi
Wang, Libo
Atkinson, Peter M.
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 181 : 84 - 98
[13] Applications of object detection networks in high-power laser systems and experiments
Lin, Jinpu
Haberstroh, Florian
Karsch, Stefan
Doepp, Andreas
[J]. HIGH POWER LASER SCIENCE AND ENGINEERING, 2023, 11
[14] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Liu, Ze
Lin, Yutong
Cao, Yue
Hu, Han
Wei, Yixuan
Zhang, Zheng
Lin, Stephen
Guo, Baining
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
[15] CFPNET: CHANNEL-WISE FEATURE PYRAMID FOR REAL-TIME SEMANTIC SEGMENTATION
Lou, Ange
Loew, Murray
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1894 - 1898
[16] U-Net: Convolutional Networks for Biomedical Image Segmentation
Ronneberger, Olaf
Fischer, Philipp
Brox, Thomas
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
[17] Efficient semantic segmentation with pyramidal fusion
Orsic, Marin
Segvic, Sinisa
[J]. PATTERN RECOGNITION, 2021, 110
[18] 利用深度学习模型进行城市内涝影响分析
潘银
邵振峰
程涛
贺蔚
[J]. 武汉大学学报(信息科学版), 2019, 44 (01) : 132 - 138
[19] U-Net: Convolutional Networks for Biomedical Image Segmentation
Ronneberger, Olaf
Fischer, Philipp
Brox, Thomas
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
[20] Fully Convolutional Networks for Semantic Segmentation
Shelhamer, Evan
Long, Jonathan
Darrell, Trevor
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 640 - 651

← 1 2 3 →