Hyperspectral Image Classification Using Multi-Scale Lightweight Transformer

被引:2
|
作者
Gu, Quan [1 ]
Luan, Hongkang [1 ]
Huang, Kaixuan [1 ]
Sun, Yubao [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Minist Educ, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
hyperspectral image classification; multi-scale spectral attention; Transformer; long-range spectral dependence; SPARSE REPRESENTATION;
D O I
10.3390/electronics13050949
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The distinctive feature of hyperspectral images (HSIs) is their large number of spectral bands, which allows us to identify categories of ground objects by capturing discrepancies in spectral information. Convolutional neural networks (CNN) with attention modules effectively improve the classification accuracy of HSI. However, CNNs are not successful in capturing long-range spectral-spatial dependence. In recent years, Vision Transformer (VIT) has received widespread attention due to its excellent performance in acquiring long-range features. However, it requires calculating the pairwise correlation between token embeddings and has the complexity of the square of the number of tokens, which leads to an increase in the computational complexity of the network. In order to cope with this issue, this paper proposes a multi-scale spectral-spatial attention network with frequency-domain lightweight Transformer (MSA-LWFormer) for HSI classification. This method synergistically integrates CNN, attention mechanisms, and Transformer into the spectral-spatial feature extraction module and frequency-domain fused classification module. Specifically, the spectral-spatial feature extraction module employs a multi-scale 2D-CNN with multi-scale spectral attention (MS-SA) to extract the shallow spectral-spatial features and capture the long-range spectral dependence. In addition, The frequency-domain fused classification module designs a frequency-domain lightweight Transformer that employs the Fast Fourier Transform (FFT) to convert features from the spatial domain to the frequency domain, effectively extracting global information and significantly reducing the time complexity of the network. Experiments on three classic hyperspectral datasets show that MSA-LWFormer has excellent performance.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Hyperspectral Image Classification Based on Multi-Scale Convolutional Features and Multi-Attention Mechanisms
    Sun, Qian
    Zhao, Guangrui
    Xia, Xinyuan
    Xie, Yu
    Fang, Chenrong
    Sun, Le
    Wu, Zebin
    Pan, Chengsheng
    REMOTE SENSING, 2024, 16 (12)
  • [42] Superpixel-based multi-scale multi-instance learning for hyperspectral image classification
    Huang, Shiluo
    Liu, Zheng
    Jin, Wei
    Mu, Ying
    PATTERN RECOGNITION, 2024, 149
  • [43] Dynamic multi-scale image classification
    Salden, A
    Iacob, S
    INTERNET IMAGING II, 2001, 4311 : 156 - 167
  • [44] Hyperspectral image change detection based on an improved multi-scale and spectral-wise transformer
    Zou, Changzhong
    Liang, Wenfeng
    Liu, Lei
    Zou, Changwu
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (06) : 1903 - 1924
  • [45] A multi-scale residual capsule network for hyperspectral image classification with small training samples
    Shi, Meilin
    Zeng, Xilong
    Ren, Jiansi
    Shi, Yichang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40473 - 40501
  • [46] Multi-Scale Spatial-Spectral Residual Attention Network for Hyperspectral Image Classification
    Wu, Qinggang
    He, Mengkun
    Liu, Zhongchi
    Liu, Yanyan
    ELECTRONICS, 2024, 13 (02)
  • [47] Spectral Segmentation Multi-Scale Feature Extraction Residual Networks for Hyperspectral Image Classification
    Wang, Jiamei
    Ren, Jiansi
    Peng, Yinbin
    Shi, Meilin
    REMOTE SENSING, 2023, 15 (17)
  • [48] A multi-scale residual capsule network for hyperspectral image classification with small training samples
    Meilin Shi
    Xilong Zeng
    Jiansi Ren
    Yichang Shi
    Multimedia Tools and Applications, 2023, 82 : 40473 - 40501
  • [49] Hyperspectral image classification based on clustering dimensionality reduction and multi-scale feature fusion
    Cailing Wang
    Xiaonan Song
    Jing Zhang
    Machine Vision and Applications, 2022, 33
  • [50] Hyperspectral image classification based on clustering dimensionality reduction and multi-scale feature fusion
    Wang, Cailing
    Song, Xiaonan
    Zhang, Jing
    MACHINE VISION AND APPLICATIONS, 2022, 33 (06)