A Novel Transformer Network with a CNN-Enhanced Cross-Attention Mechanism for Hyperspectral Image Classification

被引:7
|
作者
Wang, Xinyu [1 ]
Sun, Le [1 ,2 ]
Lu, Chuhan [3 ]
Li, Baozhu [4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Atmospher Sci, Nanjing 210044, Peoples R China
[4] Zhuhai Fudan Innovat Inst, Internet Things & Smart City Innovat Platform, Zhuhai 519031, Peoples R China
关键词
convolutional neural network (CNN); hyperspectral image classification; transformer; multi-scale feature;
D O I
10.3390/rs16071180
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Recently, with the remarkable advancements of deep learning in the field of image processing, convolutional neural networks (CNNs) have garnered widespread attention from researchers in the domain of hyperspectral image (HSI) classification. Moreover, due to the high performance demonstrated by the transformer architecture in classification tasks, there has been a proliferation of neural networks combining CNNs and transformers for HSI classification. However, the majority of the current methods focus on extracting spatial-spectral features from the HSI data of a single size for a pixel, overlooking the rich multi-scale feature information inherent to the data. To address this problem, we designed a novel transformer network with a CNN-enhanced cross-attention (TNCCA) mechanism for HSI classification. It is a dual-branch network that utilizes different scales of HSI input data to extract shallow spatial-spectral features using a multi-scale 3D and 2D hybrid convolutional neural network. After converting the feature maps into tokens, a series of 2D convolutions and dilated convolutions are employed to generate two sets of Q (queries), K (keys), and V (values) at different scales in a cross-attention module. This transformer with CNN-enhanced cross-attention explores multi-scale CNN-enhanced features and fuses them from both branches. Experimental evaluations conducted on three widely used hyperspectral image (HSI) datasets, under the constraint of limited sample size, demonstrate excellent classification performance of the proposed network.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification
    Yang, Judy X.
    Zhou, Jun
    Wang, Jing
    Tian, Hui
    Liew, Alan Wee-Chung
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [22] Deformable Cross-Attention Transformer for Medical Image Registration
    Chen, Junyu
    Liu, Yihao
    He, Yufan
    Du, Yong
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT I, 2024, 14348 : 115 - 125
  • [23] Double Attention Transformer for Hyperspectral Image Classification
    Tang, Ping
    Zhang, Meng
    Liu, Zhihui
    Song, Rong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [24] Hierarchical Attention Transformer for Hyperspectral Image Classification
    Arshad, Tahir
    Zhang, Junping
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [25] A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting
    Bojesomo, Alabi
    Almarzouqi, Hasan
    Liatsis, Panos
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 45 - 55
  • [26] A Multiscale Cross Interaction Attention Network for Hyperspectral Image Classification
    Liu, Dongxu
    Wang, Yirui
    Liu, Peixun
    Li, Qingqing
    Yang, Hang
    Chen, Dianbing
    Liu, Zhichao
    Han, Guangliang
    REMOTE SENSING, 2023, 15 (02)
  • [27] CROSS-DOMAIN ATTENTION NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Wang, Chenglong
    Ye, Minchao
    Lei, Ling
    Xiong, Fengchao
    Qian, Yuntao
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1564 - 1567
  • [28] Hybrid Dense Network With Attention Mechanism for Hyperspectral Image Classification
    Ahmad, Muhammad
    Khan, Adil Mehmood
    Mazzara, Manuel
    Distefano, Salvatore
    Roy, Swalpa Kumar
    Wu, Xin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 3948 - 3957
  • [29] Remote sensing image change detection based on swin transformer and cross-attention mechanism
    Yan, Weidong
    Cao, Li
    Yan, Pei
    Zhu, Chaosheng
    Wang, Mengtian
    EARTH SCIENCE INFORMATICS, 2025, 18 (01)
  • [30] Spatial-Spectral Middle Cross-Attention Fusion Network for Hyperspectral Image Superresolution
    Lang, Xiujuan
    Lu, Tao
    Zhang, Yanduo
    Jiang, Junjun
    Xiong, Zixiang
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2024, 90 (11): : 675 - 686