SS-MAE: Spatial–Spectral Masked Autoencoder for Multisource Remote Sensing Image Classification

被引:30
|
作者
Lin, Junyan [1 ]
Gao, Feng [1 ]
Shi, Xiaochen [1 ]
Dong, Junyu [1 ]
Du, Qian [2 ]
机构
[1] Ocean Univ China, Sch Comp Sci & Technol, Qingdao 266100, Peoples R China
[2] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
关键词
Image reconstruction; Feature extraction; Transformers; Image classification; Training; Decoding; Self-supervised learning; Deep learning; hyperspectral image (HSI); masked autoencoder (MAE); multisource data; DECISION FUSION;
D O I
10.1109/TGRS.2023.3331717
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Masked image modeling (MIM) is a highly popular and effective self-supervised learning method for image understanding. The existing MIM-based methods mostly focus on spatial feature modeling, neglecting spectral feature modeling. Meanwhile, the existing MIM-based methods use Transformer for feature extraction, and some local or high-frequency information may get lost. To this end, we propose a spatial-spectral masked autoencoder (SS-MAE) for hyperspectral image (HSI) and light detection and ranging (LiDAR)/synthetic aperture radar (SAR) data joint classification. Specifically, SS-MAE consists of a spatialwise branch and a spectralwise branch. The spatialwise branch masks random patches and reconstructs missing pixels, while the spectralwise branch masks random spectral channels and reconstructs missing channels. Our SS-MAE fully exploits the spatial and spectral representations of the input data. Furthermore, to complement local features in the training stage, we add two lightweight convolutional nerual networks (CNNs) for feature extraction. Both global and local features are taken into account for feature modeling. To demonstrate the effectiveness of the proposed SS-MAE, we conduct extensive experiments on three publicly available datasets. Extensive experiments on three multisource datasets verify the superiority of our SS-MAE compared with several state-of-the-art baselines. The source codes are available at https://github.com/summitgao/SS-MAE.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [41] Spectral–spatial multi-feature-based deep learning for hyperspectral remote sensing image classification
    Lizhe Wang
    Jiabin Zhang
    Peng Liu
    Kim-Kwang Raymond Choo
    Fang Huang
    Soft Computing, 2017, 21 : 213 - 221
  • [42] Spectral and Multi-spatial-feature based deep learning for hyperspectral remote sensing image classification
    Chen, Chen
    Zhang, JingJing
    Li, Teng
    Yan, Qing
    Xun, LiNa
    PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), 2018, : 421 - 426
  • [43] 3D residual spatial–spectral convolution network for hyperspectral remote sensing image classification
    Hüseyin Firat
    Mehmet Emin Asker
    Mehmet İlyas Bayindir
    Davut Hanbay
    Neural Computing and Applications, 2023, 35 : 4479 - 4497
  • [44] A Universal Representation Mechanism for Multisource Remote Sensing Image
    Kong, Weili
    Liu, Baisen
    Bi, Xiaojun
    He, Yihan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [45] Remote Sensing Image Classification Based on Ensemble Extreme Learning Machine With Stacked Autoencoder
    Lv, Fei
    Han, Min
    Qiu, Tie
    IEEE ACCESS, 2017, 5 : 9021 - 9031
  • [46] A spectral-spatial classification algorithm for multispectral remote sensing data
    Karakahya, H
    Yazgan, B
    Ersoy, OK
    ARTIFICAIL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 1011 - 1017
  • [47] Representation-Enhanced Status Replay Network for Multisource Remote-Sensing Image Classification
    Wang, Junjie
    Li, Wei
    Wang, Yinjian
    Tao, Ran
    Du, Qian
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15346 - 15358
  • [48] MSFMamba: Multiscale Feature Fusion State Space Model for Multisource Remote Sensing Image Classification
    Gao, Feng
    Jin, Xuepeng
    Zhou, Xiaowei
    Dong, Junyu
    Du, Qian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [49] Representation-Enhanced Status Replay Network for Multisource Remote-Sensing Image Classification
    Wang, Junjie
    Li, Wei
    Wang, Yinjian
    Tao, Ran
    Du, Qian
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15346 - 15358
  • [50] Project Gradient Descent Adversarial Attack against Multisource Remote Sensing Image Scene Classification
    Jiang, Yan
    Yin, Guisheng
    Yuan, Ye
    Da, Qingan
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021