SS-MAE: Spatial–Spectral Masked Autoencoder for Multisource Remote Sensing Image Classification

被引:30
|
作者
Lin, Junyan [1 ]
Gao, Feng [1 ]
Shi, Xiaochen [1 ]
Dong, Junyu [1 ]
Du, Qian [2 ]
机构
[1] Ocean Univ China, Sch Comp Sci & Technol, Qingdao 266100, Peoples R China
[2] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
关键词
Image reconstruction; Feature extraction; Transformers; Image classification; Training; Decoding; Self-supervised learning; Deep learning; hyperspectral image (HSI); masked autoencoder (MAE); multisource data; DECISION FUSION;
D O I
10.1109/TGRS.2023.3331717
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Masked image modeling (MIM) is a highly popular and effective self-supervised learning method for image understanding. The existing MIM-based methods mostly focus on spatial feature modeling, neglecting spectral feature modeling. Meanwhile, the existing MIM-based methods use Transformer for feature extraction, and some local or high-frequency information may get lost. To this end, we propose a spatial-spectral masked autoencoder (SS-MAE) for hyperspectral image (HSI) and light detection and ranging (LiDAR)/synthetic aperture radar (SAR) data joint classification. Specifically, SS-MAE consists of a spatialwise branch and a spectralwise branch. The spatialwise branch masks random patches and reconstructs missing pixels, while the spectralwise branch masks random spectral channels and reconstructs missing channels. Our SS-MAE fully exploits the spatial and spectral representations of the input data. Furthermore, to complement local features in the training stage, we add two lightweight convolutional nerual networks (CNNs) for feature extraction. Both global and local features are taken into account for feature modeling. To demonstrate the effectiveness of the proposed SS-MAE, we conduct extensive experiments on three publicly available datasets. Extensive experiments on three multisource datasets verify the superiority of our SS-MAE compared with several state-of-the-art baselines. The source codes are available at https://github.com/summitgao/SS-MAE.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [21] Remote Sensing Image Classification Using the Spectral-Spatial Distance Based on Information Content
    Chen, Siya
    Zhang, Hongyan
    Sun, Tieli
    Zhao, Jianjun
    Guo, Xiaoyi
    SENSORS, 2018, 18 (10)
  • [22] SUPER PIXEL BASED REMOTE SENSING IMAGE CLASSIFICATION WITH HISTOGRAM DESCRIPTORS ON SPECTRAL AND SPATIAL DATA
    Zhang, Guangyun
    Jia, Xiuping
    Kwok, Ngai M.
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 4335 - 4338
  • [23] Land-use Classification with Remote Sensing Image Based on Stacked Autoencoder
    Ding, Anzi
    Zhou, Xinmin
    2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 145 - 149
  • [24] S2MAE: A Spatial-Spectral Pretraining Foundation Model for Spectral Remote Sensing Data
    Li, Xuyang
    Hong, Danfeng
    Chanussot, Jocelyn
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27696 - 27705
  • [25] Cross-Domain Classification of Multisource Remote Sensing Data Using Fractional Fusion and Spatial-Spectral Domain Adaptation
    Zhao, Xudong
    Zhang, Mengmeng
    Tao, Ran
    Li, Wei
    Liao, Wenzhi
    Philips, Wilfried
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 5721 - 5733
  • [26] GO-MAE: Self-supervised pre-training via masked autoencoder for OCT image classification of gynecology
    Wang, Haoran
    Guo, Xinyu
    Song, Kaiwen
    Sun, Mingyang
    Shao, Yanbin
    Xue, Songfeng
    Zhang, Hongwei
    Zhang, Tianyu
    NEURAL NETWORKS, 2025, 181
  • [27] Multisource Collaborative Domain Generalization for Cross-Scene Remote Sensing Image Classification
    Han, Zhu
    Zhang, Ce
    Gao, Lianru
    Zeng, Zhiqiang
    Ng, Michael K.
    Zhang, Bing
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [28] A neural-statistical approach to multitemporal and multisource remote-sensing image classification
    Bruzzone, L
    Prieto, DF
    Serpico, SB
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1999, 37 (03): : 1350 - 1359
  • [29] Research on Multisource Remote Sensing Image Classification Algorithms Based on Image Fusion and the EM-HMRF
    He, Guiqing
    Peng, Jinye
    Feng, Xiaoyi
    Wang, Jun
    2012 6TH INTERNATIONAL CONFERENCE ON NEW TRENDS IN INFORMATION SCIENCE, SERVICE SCIENCE AND DATA MINING (ISSDM2012), 2012, : 185 - 192
  • [30] Flow-MAE: Leveraging Masked AutoEncoder for Accurate, Efficient and Robust Malicious Traffic Classification
    Hang, Zijun
    Lu, Yuliang
    Wang, Yongjie
    Xie, Yi
    PROCEEDINGS OF THE 26TH INTERNATIONAL SYMPOSIUM ON RESEARCH IN ATTACKS, INTRUSIONS AND DEFENSES, RAID 2023, 2023, : 297 - 314