Remote Sensing Image Semantic Segmentation Algorithm Based on TransMANet

被引：0

作者：

Song Xirui ^{[1
,2
]}

Ge Hongwei ^{[1
,2
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China

[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2024年 / 61卷 / 10期

关键词：

image processing; semantic segmentation; attention mechanism; Transformer; high-resolution remote sensing image;

D O I：

10.3788/LOP232052

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Herein, we propose a Transformer multiattention network (TransMANet), a network structure based on Transformer and attention mechanisms, to address the issues of low segmentation accuracy, inadequate global feature extraction, and insufficient association between the multiattention network (MANet) algorithm and image semantic information. This network structure features a dual-branch decoder that combines local and global contexts and enhances the semantic information of shallow networks. First, we introduce a local attention embedding mechanism that enhances the embedding of context information and semantic information of high-level features into low-level features. Then, we design a dual-branch decoder that combines Transformer and convolutional neural networks, which extracts global context information and detailed information with different scales, thereby modeling global and local information. Finally, we improve the original loss function and use a joint loss function that combines cross-entropy loss and Dice loss to address the class imbalance problem often encountered in remote sensing datasets and thus improve segmentation accuracy. Our experimental results demonstrate the superiority of TransMANet over MANet and other advanced methods in terms of intersection over union on UAVid, LoveDA, Potsdam, and Vaihingen datasets. This indicates the strong generalization capability of TransMANet and its effectiveness in achieving accurate segmentation results.

引用

页数：12

共 29 条

[1] [Anonymous], 2010, Int. J. Comput. Sci. Eng.
[2] Attention Mechanism-Based Object Detection Algorithm in Aerial Images
Bai, Zongbao
Zhang, Junju
Gao, Yuan
Hu, Youcheng
[J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (12)
[3] Chen J., 2021, arXiv, DOI DOI 10.48550/ARXIV.2102.04306
[4] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
[5] [陈玲 Chen Ling], 2019, [国土资源遥感, Remote Sensing for Land & Resources], V31, P1
[6] LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
Ding, Lei
Tang, Hao
Bruzzone, Lorenzo
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 426 - 435
[7] Data-driven science and machine learning methods in laser-plasma physics
Doepp, Andreas
Eberle, Christoph
Howard, Sunny
Irshad, Faran
Lin, Jinpu
Streeter, Matthew
[J]. HIGH POWER LASER SCIENCE AND ENGINEERING, 2023, 11
[8] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
[10] Optimal multi-level thresholding using a two-stage Otsu optimization approach
Huang, Deng-Yuan
Wang, Chia-Hung
[J]. PATTERN RECOGNITION LETTERS, 2009, 30 (03) : 275 - 284

← 1 2 3 →