A Semantic Segmentation Method for Remote Sensing Images Based on the Swin Transformer Fusion Gabor Filter

被引:17
|
作者
Feng, Dongdong
Zhang, Zhihua [1 ]
Yan, Kun
机构
[1] Lanzhou Jiaotong Univ, Fac Geomat, Lanzhou 730070, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Transformers; Remote sensing; Convolution; Semantics; Image edge detection; FAM; Gabor filter; remote sensing; semantic segmentation; Swin transformer; SCENE CLASSIFICATION; ATTENTION; MODEL;
D O I
10.1109/ACCESS.2022.3193248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of remote sensing images is increasingly important in urban planning, autonomous driving, disaster monitoring, and land cover classification. With the development of high-resolution remote sensing satellite technology, multilevel, large-scale, and high-precision segmentation has become the focus of current research. High-resolution remote sensing images have high intraclass diversity and low interclass separability, which pose challenges to the precision of the detailed representation of multiscale information. In this paper, a semantic segmentation method for remote sensing images based on Swin Transformer fusion with a Gabor filter is proposed. First, a Swin Transformer is used as the backbone network to extract image information at different levels. Then, the texture and edge features of the input image are extracted with a Gabor filter, and the multilevel features are merged by introducing a feature aggregation module (FAM) and an attentional embedding module (AEM). Finally, the segmentation result is optimized with the fully connected conditional random field (FC-CRF). Our proposed method, called Swin-S-GF, its mean Intersection over Union (mIoU) scored 80.14%, 66.50%, and 70.61% on the large-scale classification set, the fine land-cover classification set, and the "AI + Remote Sensing imaging dataset" (AI+RS dataset), respectively. Compared with DeepLabV3, mIoU increased by 0.67%, 3.43%, and 3.80%, respectively. Therefore, we believe that this model provides a good tool for the semantic segmentation of high-precision remote sensing images.
引用
收藏
页码:77432 / 77451
页数:20
相关论文
共 50 条
  • [21] A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images
    Wang, Libo
    Li, Rui
    Duan, Chenxi
    Zhang, Ce
    Meng, Xiaoliang
    Fang, Shenghui
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [22] An Improved Semantic Segmentation Method for Remote Sensing Images Based on Neural Network
    Jiang, Na
    Li, Jiyuan
    TRAITEMENT DU SIGNAL, 2020, 37 (02) : 271 - 278
  • [23] A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images
    Wang, Libo
    Li, Rui
    Duan, Chenxi
    Zhang, Ce
    Meng, Xiaoliang
    Fang, Shenghui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [24] A Semantic Segmentation Method for Remote Sensing Images Based on an Improved TransDeepLab Model
    Wang, Jinxin
    Wang, Manman
    Cong, Kaiwei
    Qin, Zilong
    LAND, 2025, 14 (01)
  • [25] Semantic Segmentation of Remote Sensing Images Using Multiway Fusion Network
    Wu, Xiaosuo
    Wang, Liling
    Wu, Chaoyang
    Guo, Cunge
    Yan, Haowen
    Qiao, Ze
    SIGNAL PROCESSING, 2024, 215
  • [26] Improved SegFormer Network Based Method for Semantic Segmentation of Remote Sensing Images
    Tian, Xuewei
    Wang, Jiali
    Chen, Ming
    Du, Shouqing
    Computer Engineering and Applications, 2023, 59 (08): : 217 - 226
  • [27] SwinSTFM: Remote Sensing Spatiotemporal Fusion Using Swin Transformer
    Chen, Guanyu
    Jiao, Peng
    Hu, Qing
    Xiao, Linjie
    Ye, Zijian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [28] Indoor semantic segmentation based on Swin-Transformer
    Zheng, Yunping
    Xu, Yuan
    Shu, Shiqiang
    Sarem, Mudar
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [29] Remote sensing segmentation through a filter bank based on Gabor functions
    Garcia-Consuegra, J
    Cisneros, G
    Ballesteros, J
    Molina, R
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1169 - 1171
  • [30] Hybrid Attention Fusion Embedded in Transformer for Remote Sensing Image Semantic Segmentation
    Chen, Yan
    Dong, Quan
    Wang, Xiaofeng
    Zhang, Qianchuan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4421 - 4435