HyperSINet: A Synergetic Interaction Network Combined With Convolution and Transformer for Hyperspectral Image Classification

被引:8
|
作者
Yu, Qixing [1 ]
Wei, Weibo [1 ]
Li, Dantong [2 ]
Pan, Zhenkuan [1 ]
Li, Chenyu [3 ,4 ]
Hong, Danfeng [4 ,5 ]
机构
[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao 266071, Peoples R China
[2] Cardiff Univ, Cardiff Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales
[3] Southeast Univ, Sch Math & Stat, Nanjing 211189, Peoples R China
[4] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[5] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Convolutional neural network (CNN); hyperspectral image (HIS) classification; interactors; synergetic interaction; vision transformer (VIT);
D O I
10.1109/TGRS.2024.3362471
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In hyperspectral images (HSIs), both local and nonlocal features play crucial roles in classification tasks. Vision transformers (VITs) can extract nonlocal features through attention mechanisms, while convolutional neural networks (CNNs) excel at handling local components. However, in traditional dual-branch models based on VIT and CNN, there is a lack of interaction during feature processing, leading to potential compatibility issues when merging the two types of features. In this article, we propose HyperSINet, a synergetic interaction network that combines VIT and CNN to establish interaction between the two branches, enabling mutual compensation between local and nonlocal features during the training process and ultimately enhancing the performance of classification tasks. Specifically, we devise a pair of interactors, namely, Conv2Trans and Trans2Conv, which serve as intermediaries between the two branches, enabling the VIT branch to refine its local details, while allowing the CNN branch to process larger receptive field nonlocal features. Typical feature maps are implemented to visualize the function of the interactors. Furthermore, within the VIT branch, a VIT encoder with the local mask is developed to strike a balance between emphasizing nonlocal features and preserving local details, while a lightweight CNN block is designed to process spectral and spatial features in the CNN branch. Extensive experiments conducted on four real-world datasets demonstrate that, under a reasonable count of parameters, HyperSINet surpasses several current state-of-the-art methods.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 50 条
  • [41] Hyperspectral image classification with multi-scale graph convolution network
    Zhao, Wenzhi
    Wu, Dinghui
    Liu, Yuanlin
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (21) : 8380 - 8397
  • [42] Graph Neural Network via Edge Convolution for Hyperspectral Image Classification
    Hu, Haojie
    Yao, Minli
    He, Fang
    Zhang, Fenggan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [43] SHORT AND LONG RANGE GRAPH CONVOLUTION NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Zhu, Wenxiang
    Zhao, Chunhui
    Qin, Boao
    Feng, Shou
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3564 - 3567
  • [44] Center Weighted Convolution and GraphSAGE Cooperative Network for Hyperspectral Image Classification
    Cui, Ying
    Shao, Chao
    Luo, Li
    Wang, Liguo
    Gao, Shan
    Chen, Liwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [45] Multiscale Residual Network With Mixed Depthwise Convolution for Hyperspectral Image Classification
    Gao, Hongmin
    Yang, Yao
    Li, Chenming
    Gao, Lianru
    Zhang, Bing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (04): : 3396 - 3408
  • [46] Faster Multiscale Capsule Network With Octave Convolution for Hyperspectral Image Classification
    Xu, Qin
    Wang, Dongyue
    Luo, Bin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (02) : 361 - 365
  • [47] Hyperspectral Image Classification Based on Convolution Neural Network with Attention Mechanism
    Chen Wenhao
    Jing, He
    Gang, Liu
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [48] Synergistic spectral and spatial feature analysis with transformer and convolution networks for hyperspectral image classification
    Dhirendra Prasad Yadav
    Deepak Kumar
    Anand Singh Jalal
    Ankit Kumar
    B. Kada
    Signal, Image and Video Processing, 2024, 18 : 2975 - 2990
  • [49] Generative Adversarial Networks Based on Transformer Encoder and Convolution Block for Hyperspectral Image Classification
    Bai, Jing
    Lu, Jiawei
    Xiao, Zhu
    Chen, Zheng
    Jiao, Licheng
    REMOTE SENSING, 2022, 14 (14)
  • [50] Synergistic spectral and spatial feature analysis with transformer and convolution networks for hyperspectral image classification
    Yadav, Dhirendra Prasad
    Kumar, Deepak
    Jalal, Anand Singh
    Kumar, Ankit
    Kada, B.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 2975 - 2990