Pixel Representation Augmented through Cross-Attention for High-Resolution Remote Sensing Imagery Segmentation

被引:3
|
作者
Luo, Yiyun [1 ,2 ]
Wang, Jinnian [1 ,2 ]
Yang, Xiankun [1 ,2 ]
Yu, Zhenyu [1 ,2 ]
Tan, Zixuan [1 ,2 ]
机构
[1] Guangzhou Univ, Sch Geog & Remote Sensing, Guangzhou 510006, Peoples R China
[2] Guangzhou Univ, Ctr Remote Sensing Big Data Intelligence Applicat, Guangzhou 510006, Peoples R China
基金
国家重点研发计划;
关键词
land cover classification; transformer; cross-attention; object embedding queries; LAND-COVER CLASSIFICATION; SEMANTIC SEGMENTATION; NETWORK;
D O I
10.3390/rs14215415
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Natural imagery segmentation has been transferred to land cover classification in remote sensing imagery with excellent performance. However, two key issues have been overlooked in the transfer process: (1) some objects were easily overwhelmed by the complex backgrounds; (2) interclass information for indistinguishable classes was not fully utilized. The attention mechanism in the transformer is capable of modeling long-range dependencies on each sample for per-pixel context extraction. Notably, per-pixel context from the attention mechanism can aggregate category information. Therefore, we proposed a semantic segmentation method based on pixel representation augmentation. In our method, a simplified feature pyramid was designed to decode the hierarchical pixel features from the backbone, and then decode the category representations into learnable category object embedding queries by cross-attention in the transformer decoder. Finally, pixel representation is augmented by an additional cross-attention in the transformer encoder under the supervision of auxiliary segmentation heads. The results of extensive experiments on the aerial image dataset Potsdam and satellite image dataset Gaofen Image Dataset with 15 categories (GID-15) demonstrate that the cross-attention is effective, and our method achieved the mean intersection over union (mIoU) of 86.2% and 62.5% on the Potsdam test set and GID-15 validation set, respectively. Additionally, we achieved an inference speed of 76 frames per second (FPS) on the Potsdam test dataset, higher than all the state-of-the-art models we tested on the same device.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] RAANet: A Residual ASPP with Attention Framework for Semantic Segmentation of High-Resolution Remote Sensing Images
    Liu, Runrui
    Tao, Fei
    Liu, Xintao
    Na, Jiaming
    Leng, Hongjun
    Wu, Junjie
    Zhou, Tong
    REMOTE SENSING, 2022, 14 (13)
  • [22] LIGHT-WEIGHT ATTENTION SEMANTIC SEGMENTATION NETWORK FOR HIGH-RESOLUTION REMOTE SENSING IMAGES
    Liu, Siyu
    He, Changtao
    Bai, Haiwei
    Zhang, Yijie
    Cheng, Jian
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2595 - 2598
  • [23] Densely multiscale framework for segmentation of high resolution remote sensing imagery
    Bello, Inuwa Mamuda
    Zhang, Ke
    Su, Yu
    Wang, Jingyu
    Aslam, Muhammad Azeem
    COMPUTERS & GEOSCIENCES, 2022, 167
  • [24] Adapting Cross-Sensor High-Resolution Remote Sensing Imagery for Land Use Classification
    Li, Wangbin
    Sun, Kaimin
    Wei, Jinjiang
    REMOTE SENSING, 2025, 17 (05)
  • [25] Dual Parallel Branch Fusion Network for Road Segmentation in High-Resolution Optical Remote Sensing Imagery
    Gao, Lin
    Chen, Chen
    APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [26] CHANGE DETECTION FOR HIGH-RESOLUTION REMOTE SENSING IMAGERY BASED ON MULTI-SCALE SEGMENTATION AND FUSION
    Guo, Qingle
    Zhang, Junping
    Li, Tong
    Lu, Xiaochen
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 1919 - 1922
  • [27] On the Effectiveness of Weakly Supervised Semantic Segmentation for Building Extraction From High-Resolution Remote Sensing Imagery
    Li, Zhenshi
    Zhang, Xueliang
    Xiao, Pengfeng
    Zheng, Zixian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 3266 - 3281
  • [28] Image segmentation using mean shift for extracting croplands from high-resolution remote sensing imagery
    Su, Tengfei
    Li, Hongyu
    Zhang, Shengwei
    Li, Yongxiang
    REMOTE SENSING LETTERS, 2015, 6 (12) : 952 - 961
  • [29] FAST SEGMENTATION METHOD OF HIGH-RESOLUTION REMOTE SENSING IMAGE
    Li Xiao-Feng
    Zhang Shu-Qing
    Liu Qiang
    Zhang Bai
    Liu Dian-Wei
    Lu Bi-Bo
    Na Xiao-Dong
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2009, 28 (02) : 146 - 150
  • [30] Semantic Segmentation of High Spatial Resolution Remote Sensing Imagery Based on Weighted Attention U-Net
    Zhang, Yue
    Wang, Leiguang
    Yang, Ruiqi
    Chen, Nan
    Zhao, Yili
    Dai, Qinling
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705