Pixel Representation Augmented through Cross-Attention for High-Resolution Remote Sensing Imagery Segmentation

被引:3
|
作者
Luo, Yiyun [1 ,2 ]
Wang, Jinnian [1 ,2 ]
Yang, Xiankun [1 ,2 ]
Yu, Zhenyu [1 ,2 ]
Tan, Zixuan [1 ,2 ]
机构
[1] Guangzhou Univ, Sch Geog & Remote Sensing, Guangzhou 510006, Peoples R China
[2] Guangzhou Univ, Ctr Remote Sensing Big Data Intelligence Applicat, Guangzhou 510006, Peoples R China
基金
国家重点研发计划;
关键词
land cover classification; transformer; cross-attention; object embedding queries; LAND-COVER CLASSIFICATION; SEMANTIC SEGMENTATION; NETWORK;
D O I
10.3390/rs14215415
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Natural imagery segmentation has been transferred to land cover classification in remote sensing imagery with excellent performance. However, two key issues have been overlooked in the transfer process: (1) some objects were easily overwhelmed by the complex backgrounds; (2) interclass information for indistinguishable classes was not fully utilized. The attention mechanism in the transformer is capable of modeling long-range dependencies on each sample for per-pixel context extraction. Notably, per-pixel context from the attention mechanism can aggregate category information. Therefore, we proposed a semantic segmentation method based on pixel representation augmentation. In our method, a simplified feature pyramid was designed to decode the hierarchical pixel features from the backbone, and then decode the category representations into learnable category object embedding queries by cross-attention in the transformer decoder. Finally, pixel representation is augmented by an additional cross-attention in the transformer encoder under the supervision of auxiliary segmentation heads. The results of extensive experiments on the aerial image dataset Potsdam and satellite image dataset Gaofen Image Dataset with 15 categories (GID-15) demonstrate that the cross-attention is effective, and our method achieved the mean intersection over union (mIoU) of 86.2% and 62.5% on the Potsdam test set and GID-15 validation set, respectively. Additionally, we achieved an inference speed of 76 frames per second (FPS) on the Potsdam test dataset, higher than all the state-of-the-art models we tested on the same device.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] High-resolution optical remote sensing imagery change detection through deep transfer learning
    Larabi, Mohammed El Amin
    Chaib, Souleyman
    Bakhti, Khadidja
    Hasni, Kamel
    Bouhlala, Mohammed Amine
    JOURNAL OF APPLIED REMOTE SENSING, 2019, 13 (04)
  • [32] DEEP HIERARCHICAL REPRESENTATION AND SEGMENTATION OF HIGH RESOLUTION REMOTE SENSING IMAGES
    Wang, Jun
    Qin, Qiming
    Li, Zhoujing
    Ye, Xin
    Wang, Jianhua
    Yang, Xiucheng
    Qin, Xuebin
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 4320 - 4323
  • [33] MsanlfNet: Semantic Segmentation Network With Multiscale Attention and Nonlocal Filters for High-Resolution Remote Sensing Images
    Bai, Lin
    Lin, Xiangyuan
    Ye, Zhen
    Xue, Dongling
    Yao, Cheng
    Hui, Meng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [34] SCAttNet: Semantic Segmentation Network With Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images
    Li, Haifeng
    Qiu, Kaijian
    Chen, Li
    Mei, Xiaoming
    Hong, Liang
    Tao, Chao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (05) : 905 - 909
  • [35] Attention-Based Pyramid Network for Segmentation and Classification of High-Resolution and Hyperspectral Remote Sensing Images
    Xu, Qingsong
    Yuan, Xin
    Ouyang, Chaojun
    Zeng, Yue
    REMOTE SENSING, 2020, 12 (21) : 1 - 34
  • [36] We Need to Communicate: Communicating Attention Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Meng, Xichen
    Zhu, Liqun
    Han, Yilong
    Zhang, Hanchao
    REMOTE SENSING, 2023, 15 (14)
  • [37] Attention-Enhanced Urban Fugitive Dust Source Segmentation in High-Resolution Remote Sensing Images
    He, Xiaoqing
    Wang, Zhibao
    Bai, Lu
    Fan, Meng
    Chen, Yuanlin
    Chen, Liangfu
    REMOTE SENSING, 2024, 16 (20)
  • [38] Identification of shelterbelt width from high-resolution remote sensing imagery
    Deng, Rongxin
    Yang, Gao
    Li, Ying
    Xu, Zhengran
    Zhang, Xing
    Zhang, Lu
    Li, Chunjing
    AGROFORESTRY SYSTEMS, 2022, 96 (08) : 1091 - 1101
  • [39] Automated object recognition in high-resolution optical remote sensing imagery
    Yao, Yazhou
    Chen, Tao
    Bi, Hanbo
    Cai, Xinhao
    Pei, Gensheng
    Yang, Guoye
    Yan, Zhiyuan
    Sun, Xian
    Xu, Xing
    Zhang, Hai
    NATIONAL SCIENCE REVIEW, 2023, 10 (06)
  • [40] Study of various resampling techniques for high-resolution remote sensing imagery
    Gurjar S.B.
    Padmanabhan N.
    Journal of the Indian Society of Remote Sensing, 2005, 33 (1) : 113 - 120