A Semantic Segmentation Method for Remote Sensing Images Based on the Swin Transformer Fusion Gabor Filter

被引：17

作者：

Feng, Dongdong

Zhang, Zhihua ^{[1
]}

Yan, Kun

机构：

[1] Lanzhou Jiaotong Univ, Fac Geomat, Lanzhou 730070, Peoples R China

来源：

IEEE ACCESS | 2022年 / 10卷

基金：

中国国家自然科学基金;

关键词：

Image segmentation; Feature extraction; Transformers; Remote sensing; Convolution; Semantics; Image edge detection; FAM; Gabor filter; remote sensing; semantic segmentation; Swin transformer; SCENE CLASSIFICATION; ATTENTION; MODEL;

D O I：

10.1109/ACCESS.2022.3193248

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Semantic segmentation of remote sensing images is increasingly important in urban planning, autonomous driving, disaster monitoring, and land cover classification. With the development of high-resolution remote sensing satellite technology, multilevel, large-scale, and high-precision segmentation has become the focus of current research. High-resolution remote sensing images have high intraclass diversity and low interclass separability, which pose challenges to the precision of the detailed representation of multiscale information. In this paper, a semantic segmentation method for remote sensing images based on Swin Transformer fusion with a Gabor filter is proposed. First, a Swin Transformer is used as the backbone network to extract image information at different levels. Then, the texture and edge features of the input image are extracted with a Gabor filter, and the multilevel features are merged by introducing a feature aggregation module (FAM) and an attentional embedding module (AEM). Finally, the segmentation result is optimized with the fully connected conditional random field (FC-CRF). Our proposed method, called Swin-S-GF, its mean Intersection over Union (mIoU) scored 80.14%, 66.50%, and 70.61% on the large-scale classification set, the fine land-cover classification set, and the "AI + Remote Sensing imaging dataset" (AI+RS dataset), respectively. Compared with DeepLabV3, mIoU increased by 0.67%, 3.43%, and 3.80%, respectively. Therefore, we believe that this model provides a good tool for the semantic segmentation of high-precision remote sensing images.

引用

页码：77432 / 77451

页数：20

共 50 条

[31] Local-enhanced multi-scale aggregation swin transformer for semantic segmentation of high-resolution remote sensing images
Ren, Dong
Li, Falin
Sun, Hang
Liu, Li
Ren, Shun
Yu, Mei
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (01) : 101 - 120
[32] MATNet: multiattention Transformer network for cropland semantic segmentation in remote sensing images
Zhang, Zixuan
Huang, Liang
Tang, Bo-Hui
Le, Weipeng
Wang, Meiqi
Cheng, Jiapei
Wu, Qiang
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
[33] Hybrid CNN and Transformer Network for Semantic Segmentation of UAV Remote Sensing Images
Zhou X.
Zhou L.
Gong S.
Zhang H.
Zhong S.
Xia Y.
Huang Y.
IEEE Journal on Miniaturization for Air and Space Systems, 2024, 5 (01): : 33 - 41
[34] AAFormer: Attention-Attended Transformer for Semantic Segmentation of Remote Sensing Images
Li, Xin
Xu, Feng
Li, Linyang
Xu, Nan
Liu, Fan
Yuan, Chi
Chen, Ziqi
Lyu, Xin
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
[35] SSDT: Scale-Separation Semantic Decoupled Transformer for Semantic Segmentation of Remote Sensing Images
Zheng, Chengyu
Jiang, Yanru
Lv, Xiaowei
Nie, Jie
Liang, Xinyue
Wei, Zhiqiang
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 9037 - 9052
[36] Swin-CDSA: The Semantic Segmentation of Remote Sensing Images Based on Cascaded Depthwise Convolution and Spatial Attention Mechanism
Kang, Yuhan
Ji, Jian
Xu, Hekai
Yang, Yong
Chen, Peng
Zhao, Hui
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
[37] Bidirectional Feature Fusion and Enhanced Alignment Based Multimodal Semantic Segmentation for Remote Sensing Images
Liu, Qianqian
Wang, Xili
REMOTE SENSING, 2024, 16 (13)
[38] Semantic Segmentation of Remote-Sensing Images Based on Multiscale Feature Fusion and Attention Refinement
He, Xin
Zhou, Yong
Zhao, Jiaqi
Zhang, Man
Yao, Rui
Liu, Bing
Li, Haichao
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[39] Semantic Segmentation of Remote-Sensing Images Based on Multiscale Feature Fusion and Attention Refinement
He, Xin
Zhou, Yong
Zhao, Jiaqi
Zhang, Man
Yao, Rui
Liu, Bing
Li, Haichao
IEEE Geoscience and Remote Sensing Letters, 2022, 19
[40] FTransDeepLab: Multimodal Fusion Transformer-Based DeepLabv3+for Remote Sensing Semantic Segmentation
Feng, Haixia
Hu, Qingwu
Zhao, Pengcheng
Wang, Shunli
Ai, Mingyao
Zheng, Daoyuan
Liu, Tiancheng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63

← 1 2 3 4 5 →