A Novel Transformer Network with a CNN-Enhanced Cross-Attention Mechanism for Hyperspectral Image Classification

被引:7
|
作者
Wang, Xinyu [1 ]
Sun, Le [1 ,2 ]
Lu, Chuhan [3 ]
Li, Baozhu [4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Atmospher Sci, Nanjing 210044, Peoples R China
[4] Zhuhai Fudan Innovat Inst, Internet Things & Smart City Innovat Platform, Zhuhai 519031, Peoples R China
关键词
convolutional neural network (CNN); hyperspectral image classification; transformer; multi-scale feature;
D O I
10.3390/rs16071180
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Recently, with the remarkable advancements of deep learning in the field of image processing, convolutional neural networks (CNNs) have garnered widespread attention from researchers in the domain of hyperspectral image (HSI) classification. Moreover, due to the high performance demonstrated by the transformer architecture in classification tasks, there has been a proliferation of neural networks combining CNNs and transformers for HSI classification. However, the majority of the current methods focus on extracting spatial-spectral features from the HSI data of a single size for a pixel, overlooking the rich multi-scale feature information inherent to the data. To address this problem, we designed a novel transformer network with a CNN-enhanced cross-attention (TNCCA) mechanism for HSI classification. It is a dual-branch network that utilizes different scales of HSI input data to extract shallow spatial-spectral features using a multi-scale 3D and 2D hybrid convolutional neural network. After converting the feature maps into tokens, a series of 2D convolutions and dilated convolutions are employed to generate two sets of Q (queries), K (keys), and V (values) at different scales in a cross-attention module. This transformer with CNN-enhanced cross-attention explores multi-scale CNN-enhanced features and fuses them from both branches. Experimental evaluations conducted on three widely used hyperspectral image (HSI) datasets, under the constraint of limited sample size, demonstrate excellent classification performance of the proposed network.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Bridging CNN and Transformer With Cross-Attention Fusion Network for Hyperspectral Image Classification
    Xu, Fulin
    Mei, Shaohui
    Zhang, Ge
    Wang, Nan
    Du, Qian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [2] Spatial-Spectral Transformer With Cross-Attention for Hyperspectral Image Classification
    Peng, Yishu
    Zhang, Yuwen
    Tu, Bing
    Li, Qianming
    Li, Wujing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Interactive CNN and Transformer-Based Cross-Attention Fusion Network for Medical Image Classification
    Cai, Shu
    Zhang, Qiude
    Wang, Shanshan
    Hu, Junjie
    Zeng, Liang
    Li, Kaiyan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (03)
  • [4] Cross-Attention Spectral-Spatial Network for Hyperspectral Image Classification
    Yang, Kai
    Sun, Hao
    Zou, Chunbo
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network
    Zhang, Bo
    Chen, Yaxiong
    Xiong, Shengwu
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 899 - 913
  • [6] Hyperspectral Image Classification: An Analysis Employing CNN, LSTM, Transformer, and Attention Mechanism
    Viel, Felipe
    Maciel, Renato Cotrim
    Seman, Laio Oriel
    Zeferino, Cesar Albenes
    Bezerra, Eduardo Augusto
    Leithardt, Valderi Reis Quietinho
    IEEE ACCESS, 2023, 11 : 24835 - 24850
  • [7] CNN and Transformer interaction network for hyperspectral image classification
    Li, Zhongwei
    Huang, Wenhao
    Wang, Leiquan
    Xin, Ziqi
    Meng, Qiao
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (18) : 5548 - 5573
  • [8] Multiscale Dense Cross-Attention Mechanism with Covariance Pooling for Hyperspectral Image Scene Classification
    Liu, Runmin
    Ning, Xin
    Cai, Weiwei
    Li, Guangjun
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [9] A synergistic CNN-transformer network with pooling attention fusion for hyperspectral image classification
    Chen, Peng
    He, Wenxuan
    Qian, Feng
    Shi, Guangyao
    Yan, Jingwen
    DIGITAL SIGNAL PROCESSING, 2025, 160
  • [10] Hyperspectral Image Classification Based on Graph Transformer Network and Graph Attention Mechanism
    Zhao, Xiaofeng
    Niu, Jiahui
    Liu, Chuntong
    Ding, Yao
    Hong, Danfeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19