Global Adaptive Second-Order Transformer for Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Zhang, Yijie [1 ]
Cheng, Jian [1 ]
Su, Yanzhou [1 ]
Deng, Changjian [1 ]
Xia, Ziying [1 ]
Tashi, Nyima [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[2] Tibet Univ, Sch Informat Sci & Technol, Lhasa 850000, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature enhancement; global feature; second-order transformer; semantic segmentation;
D O I
10.1109/TGRS.2024.3453501
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the domain of remote sensing (RS) image analysis, capturing global context is the key for precise semantic segmentation. Current vision transformer (ViT) advance this field by addressing convolutional neural network's (CNN) local receptive field limitations. However, ViT predominantly rely on the first-order information in image to establish global relationships, often overlooking the potential of second-order information, which is crucial for enhancing the discrimination of ground objects that exhibit high similarity and constant changes. To address this issue, we propose a global adaptive second-order transformer network (GASOT-Net). Specifically, the proposed global adaptive second-order transformer (GASOT) enhances the existing ViT structure by mining second-order information and adaptively fusing it with the first-order information during the process of establishing global dependency relationships. This approach enables the extraction of more discriminative features, thereby enriching the representation of global features. In addition, the local feature aggregation module (LFAM) is proposed to effectively aggregate features from different stages of CNN as input to the GASOT blocks. Moreover, to refine boundaries of complex ground objects, the global feature enhancement module (GFEM) is used in the decoder stage. In particular, GFEM includes two sub modules-feature shift module (FSM) and hierarchical feature fusion module (HFFM). FSM is used to enhance the local feature representation at first, and then, HFFM hierarchically aggregates local and global features from different stages. We conduct extensive experiments on four benchmark RS datasets, and the results show that our GASOT-Net outperforms other state-of-the-art methods. The code will be available at: https://github.com/j136812832/GASOT-Net.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Global and edge enhanced transformer for semantic segmentation of remote sensing
    Wang, Hengyou
    Li, Xiao
    Huo, Lianzhi
    Hu, Changmiao
    APPLIED INTELLIGENCE, 2024, 54 (07) : 5658 - 5673
  • [2] Remote sensing image semantic segmentation based on cascaded Transformer
    Wang F.
    Ji J.
    Wang Y.
    IEEE. Trans. Artif. Intell., 2024, 8 (4136-4148): : 1 - 12
  • [3] CNN and Transformer Fusion for Remote Sensing Image Semantic Segmentation
    Chen, Xin
    Li, Dongfen
    Liu, Mingzhe
    Jia, Jiaru
    REMOTE SENSING, 2023, 15 (18)
  • [4] Swin-Conv-Dspp and Global Local Transformer for Remote Sensing Image Semantic Segmentation
    Mo, Youda
    Li, Huihui
    Xiao, Xiangling
    Zhao, Huimin
    Liu, Xiaoyong
    Zhan, Jin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 5284 - 5296
  • [5] Adaptive Context Transformer for Semisupervised Remote Sensing Image Segmentation
    Li, Yunbo
    Yi, Zhiyu
    Wang, Yuebin
    Zhang, Liqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] Combining Swin Transformer With UNet for Remote Sensing Image Semantic Segmentation
    Fan, Lili
    Zhou, Yu
    Liu, Hongmei
    Li, Yunjie
    Cao, Dongpu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 11
  • [7] Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Di
    Yao, Rui
    Xue, Yong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] Memory-Augmented Transformer for Remote Sensing Image Semantic Segmentation
    Zhao, Xin
    Guo, Jiayi
    Zhang, Yueting
    Wu, Yirong
    REMOTE SENSING, 2021, 13 (22)
  • [9] Enhancing Multiscale Representations With Transformer for Remote Sensing Image Semantic Segmentation
    Xiao, Tao
    Liu, Yikun
    Huang, Yuwen
    Li, Mingsong
    Yang, Gongping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [10] Semantic Segmentation with Second-Order Pooling
    Carreira, Joao
    Caseiro, Rui
    Batista, Jorge
    Sminchisescu, Cristian
    COMPUTER VISION - ECCV 2012, PT VII, 2012, 7578 : 430 - 443