Optical Remote Sensing Image Cloud Detection with Self-Attention and Spatial Pyramid Pooling Fusion

被引:14
|
作者
Pu, Weihua [1 ]
Wang, Zhipan [2 ]
Liu, Di [2 ]
Zhang, Qingling [2 ]
机构
[1] Shenzhen Aerosp Dongfanghong Satellite Co Ltd, Shenzhen 518061, Peoples R China
[2] Sun Yat Sen Univ, Sch Aeronaut & Astronaut, Shenzhen Campus, Shenzhen 518100, Peoples R China
关键词
cloud detection; self-attention; pyramid pooling module; semantic segmentation; optical remote sensing image; DETECTION ALGORITHM;
D O I
10.3390/rs14174312
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Cloud detection is a key step in optical remote sensing image processing, and the cloud-free image is of great significance for land use classification, change detection, and long time-series landcover monitoring. Traditional cloud detection methods based on spectral and texture features have acquired certain effects in complex scenarios, such as cloud-snow mixing, but there is still a large room for improvement in terms of generation ability. In recent years, cloud detection with deep-learning methods has significantly improved the accuracy in complex regions such as high-brightness feature mixing areas. However, the existing deep learning-based cloud detection methods still have certain limitations. For instance, a few omission alarms and commission alarms still exist in cloud edge regions. At present, the cloud detection methods based on deep learning are gradually converted from a pure convolutional structure to a global feature extraction perspective, such as attention modules, but the computational burden is also increased, which is difficult to meet for the rapidly developing time-sensitive tasks, such as onboard real-time cloud detection in optical remote sensing imagery. To address the above problems, this manuscript proposes a high-precision cloud detection network fusing a self-attention module and spatial pyramidal pooling. Firstly, we use the DenseNet network as the backbone, then the deep semantic features are extracted by combining a global self-attention module and spatial pyramid pooling module. Secondly, to solve the problem of unbalanced training samples, we design a weighted cross-entropy loss function to optimize it. Finally, cloud detection accuracy is assessed. With the quantitative comparison experiments on different images, such as Landsat8, Landsat9, GF-2, and Beijing-2, the results indicate that, compared with the feature-based methods, the deep learning network can effectively distinguish in the cloud-snow confusion-prone region using only visible three-channel images, which significantly reduces the number of required image bands. Compared with other deep learning methods, the accuracy at the edge of the cloud region is higher and the overall computational efficiency is relatively optimal.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Remote Sensing Image Target Detection Model Based on Attention and Feature Fusion
    Wang Yani
    Wang Xili
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (02)
  • [32] Improving Cloud/Snow Detection in Remote Sensing Image with Spatiotemporal Information Fusion
    Wen, Jianfeng
    Zhang, Hao
    He, Changxian
    Xu, Gang
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [33] DBPFNet: a dual-band polarization image fusion network based on the attention mechanism and atrous spatial pyramid pooling
    Wu, Yunan
    Chang, Jun
    Ma, Ning
    Yang, Yining
    Ji, Zhongye
    Huang, Yi
    OPTICS LETTERS, 2023, 48 (19) : 5125 - 5128
  • [34] Vehicle Pedestrian Detection Method Based on Spatial Pyramid Pooling and Attention Mechanism
    Guo, Mingtao
    Xue, Donghui
    Li, Peng
    Xu, He
    INFORMATION, 2020, 11 (12) : 1 - 15
  • [35] GLSANet: Global-Local Self-Attention Network for Remote Sensing Image Semantic Segmentation
    Hu, Xudong
    Zhang, Penglin
    Zhang, Qi
    Yuan, Feng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [36] Change Detection in Remote-Sensing Images Using Pyramid Pooling Dynamic Sparse Attention Network With Difference Enhancement
    Li, Zhong
    Ouyang, Bin
    Qiu, Shaohua
    Xu, Xinghua
    Cui, Xiaopeng
    Hua, Xia
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 7052 - 7067
  • [37] GLSANet: Global-Local Self-Attention Network for Remote Sensing Image Semantic Segmentation
    Hu, Xudong
    Zhang, Penglin
    Zhang, Qi
    Yuan, Feng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [38] UAV image object detection based on self-attention guidance and global feature fusion
    Bai, Jing
    Hu, Haiyang
    Liu, Xiaojing
    Zhuang, Shanna
    Wang, Zhengyou
    IMAGE AND VISION COMPUTING, 2024, 151
  • [39] SAFF-SSD: Self-Attention Combined Feature Fusion-Based SSD for Small Object Detection in Remote Sensing
    Huo, Bihan
    Li, Chenglong
    Zhang, Jianwei
    Xue, Yingjian
    Lin, Zhoujin
    REMOTE SENSING, 2023, 15 (12)
  • [40] Multihead Global Attention and Spatial Spectral Information Fusion for Remote Sensing Image Compression
    Shi, Cuiping
    Shi, Kaijie
    Zhu, Fei
    Zeng, Zexin
    Wang, Liguo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 999 - 1015