Remote Sensing Image Semantic Segmentation Algorithm Based on TransMANet

被引:0
|
作者
Song Xirui [1 ,2 ]
Ge Hongwei [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
关键词
image processing; semantic segmentation; attention mechanism; Transformer; high-resolution remote sensing image;
D O I
10.3788/LOP232052
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Herein, we propose a Transformer multiattention network (TransMANet), a network structure based on Transformer and attention mechanisms, to address the issues of low segmentation accuracy, inadequate global feature extraction, and insufficient association between the multiattention network (MANet) algorithm and image semantic information. This network structure features a dual-branch decoder that combines local and global contexts and enhances the semantic information of shallow networks. First, we introduce a local attention embedding mechanism that enhances the embedding of context information and semantic information of high-level features into low-level features. Then, we design a dual-branch decoder that combines Transformer and convolutional neural networks, which extracts global context information and detailed information with different scales, thereby modeling global and local information. Finally, we improve the original loss function and use a joint loss function that combines cross-entropy loss and Dice loss to address the class imbalance problem often encountered in remote sensing datasets and thus improve segmentation accuracy. Our experimental results demonstrate the superiority of TransMANet over MANet and other advanced methods in terms of intersection over union on UAVid, LoveDA, Potsdam, and Vaihingen datasets. This indicates the strong generalization capability of TransMANet and its effectiveness in achieving accurate segmentation results.
引用
收藏
页数:12
相关论文
共 29 条
  • [1] [Anonymous], 2010, Int. J. Comput. Sci. Eng.
  • [2] Attention Mechanism-Based Object Detection Algorithm in Aerial Images
    Bai, Zongbao
    Zhang, Junju
    Gao, Yuan
    Hu, Youcheng
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (12)
  • [3] Chen J., 2021, arXiv, DOI DOI 10.48550/ARXIV.2102.04306
  • [4] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
  • [5] [陈玲 Chen Ling], 2019, [国土资源遥感, Remote Sensing for Land & Resources], V31, P1
  • [6] LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
    Ding, Lei
    Tang, Hao
    Bruzzone, Lorenzo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 426 - 435
  • [7] Data-driven science and machine learning methods in laser-plasma physics
    Doepp, Andreas
    Eberle, Christoph
    Howard, Sunny
    Irshad, Faran
    Lin, Jinpu
    Streeter, Matthew
    [J]. HIGH POWER LASER SCIENCE AND ENGINEERING, 2023, 11
  • [8] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [9] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [10] Optimal multi-level thresholding using a two-stage Otsu optimization approach
    Huang, Deng-Yuan
    Wang, Chia-Hung
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (03) : 275 - 284