A Semantic Segmentation Method for Remote Sensing Images Based on the Swin Transformer Fusion Gabor Filter

被引:17
|
作者
Feng, Dongdong
Zhang, Zhihua [1 ]
Yan, Kun
机构
[1] Lanzhou Jiaotong Univ, Fac Geomat, Lanzhou 730070, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Transformers; Remote sensing; Convolution; Semantics; Image edge detection; FAM; Gabor filter; remote sensing; semantic segmentation; Swin transformer; SCENE CLASSIFICATION; ATTENTION; MODEL;
D O I
10.1109/ACCESS.2022.3193248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of remote sensing images is increasingly important in urban planning, autonomous driving, disaster monitoring, and land cover classification. With the development of high-resolution remote sensing satellite technology, multilevel, large-scale, and high-precision segmentation has become the focus of current research. High-resolution remote sensing images have high intraclass diversity and low interclass separability, which pose challenges to the precision of the detailed representation of multiscale information. In this paper, a semantic segmentation method for remote sensing images based on Swin Transformer fusion with a Gabor filter is proposed. First, a Swin Transformer is used as the backbone network to extract image information at different levels. Then, the texture and edge features of the input image are extracted with a Gabor filter, and the multilevel features are merged by introducing a feature aggregation module (FAM) and an attentional embedding module (AEM). Finally, the segmentation result is optimized with the fully connected conditional random field (FC-CRF). Our proposed method, called Swin-S-GF, its mean Intersection over Union (mIoU) scored 80.14%, 66.50%, and 70.61% on the large-scale classification set, the fine land-cover classification set, and the "AI + Remote Sensing imaging dataset" (AI+RS dataset), respectively. Compared with DeepLabV3, mIoU increased by 0.67%, 3.43%, and 3.80%, respectively. Therefore, we believe that this model provides a good tool for the semantic segmentation of high-precision remote sensing images.
引用
收藏
页码:77432 / 77451
页数:20
相关论文
共 50 条
  • [31] Local-enhanced multi-scale aggregation swin transformer for semantic segmentation of high-resolution remote sensing images
    Ren, Dong
    Li, Falin
    Sun, Hang
    Liu, Li
    Ren, Shun
    Yu, Mei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (01) : 101 - 120
  • [32] MATNet: multiattention Transformer network for cropland semantic segmentation in remote sensing images
    Zhang, Zixuan
    Huang, Liang
    Tang, Bo-Hui
    Le, Weipeng
    Wang, Meiqi
    Cheng, Jiapei
    Wu, Qiang
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [33] Hybrid CNN and Transformer Network for Semantic Segmentation of UAV Remote Sensing Images
    Zhou X.
    Zhou L.
    Gong S.
    Zhang H.
    Zhong S.
    Xia Y.
    Huang Y.
    IEEE Journal on Miniaturization for Air and Space Systems, 2024, 5 (01): : 33 - 41
  • [34] AAFormer: Attention-Attended Transformer for Semantic Segmentation of Remote Sensing Images
    Li, Xin
    Xu, Feng
    Li, Linyang
    Xu, Nan
    Liu, Fan
    Yuan, Chi
    Chen, Ziqi
    Lyu, Xin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [35] SSDT: Scale-Separation Semantic Decoupled Transformer for Semantic Segmentation of Remote Sensing Images
    Zheng, Chengyu
    Jiang, Yanru
    Lv, Xiaowei
    Nie, Jie
    Liang, Xinyue
    Wei, Zhiqiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 9037 - 9052
  • [36] Swin-CDSA: The Semantic Segmentation of Remote Sensing Images Based on Cascaded Depthwise Convolution and Spatial Attention Mechanism
    Kang, Yuhan
    Ji, Jian
    Xu, Hekai
    Yang, Yong
    Chen, Peng
    Zhao, Hui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [37] Bidirectional Feature Fusion and Enhanced Alignment Based Multimodal Semantic Segmentation for Remote Sensing Images
    Liu, Qianqian
    Wang, Xili
    REMOTE SENSING, 2024, 16 (13)
  • [38] Semantic Segmentation of Remote-Sensing Images Based on Multiscale Feature Fusion and Attention Refinement
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Man
    Yao, Rui
    Liu, Bing
    Li, Haichao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [39] Semantic Segmentation of Remote-Sensing Images Based on Multiscale Feature Fusion and Attention Refinement
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Man
    Yao, Rui
    Liu, Bing
    Li, Haichao
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [40] FTransDeepLab: Multimodal Fusion Transformer-Based DeepLabv3+for Remote Sensing Semantic Segmentation
    Feng, Haixia
    Hu, Qingwu
    Zhao, Pengcheng
    Wang, Shunli
    Ai, Mingyao
    Zheng, Daoyuan
    Liu, Tiancheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63