HyperSINet: A Synergetic Interaction Network Combined With Convolution and Transformer for Hyperspectral Image Classification

被引:8
|
作者
Yu, Qixing [1 ]
Wei, Weibo [1 ]
Li, Dantong [2 ]
Pan, Zhenkuan [1 ]
Li, Chenyu [3 ,4 ]
Hong, Danfeng [4 ,5 ]
机构
[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao 266071, Peoples R China
[2] Cardiff Univ, Cardiff Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales
[3] Southeast Univ, Sch Math & Stat, Nanjing 211189, Peoples R China
[4] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[5] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural network (CNN); hyperspectral image (HIS) classification; interactors; synergetic interaction; vision transformer (VIT);
D O I
10.1109/TGRS.2024.3362471
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In hyperspectral images (HSIs), both local and nonlocal features play crucial roles in classification tasks. Vision transformers (VITs) can extract nonlocal features through attention mechanisms, while convolutional neural networks (CNNs) excel at handling local components. However, in traditional dual-branch models based on VIT and CNN, there is a lack of interaction during feature processing, leading to potential compatibility issues when merging the two types of features. In this article, we propose HyperSINet, a synergetic interaction network that combines VIT and CNN to establish interaction between the two branches, enabling mutual compensation between local and nonlocal features during the training process and ultimately enhancing the performance of classification tasks. Specifically, we devise a pair of interactors, namely, Conv2Trans and Trans2Conv, which serve as intermediaries between the two branches, enabling the VIT branch to refine its local details, while allowing the CNN branch to process larger receptive field nonlocal features. Typical feature maps are implemented to visualize the function of the interactors. Furthermore, within the VIT branch, a VIT encoder with the local mask is developed to strike a balance between emphasizing nonlocal features and preserving local details, while a lightweight CNN block is designed to process spectral and spatial features in the CNN branch. Extensive experiments conducted on four real-world datasets demonstrate that, under a reasonable count of parameters, HyperSINet surpasses several current state-of-the-art methods.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 50 条
  • [1] CITNet: Convolution Interaction Transformer Network for Hyperspectral and LiDAR Image Classification
    Wang, Minhui
    Sun, Yaxiu
    Xiang, Jianhong
    Zhong, Yu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [2] Convolution Transformer Fusion Splicing Network for Hyperspectral Image Classification
    Zhao, Feng
    Li, Shijie
    Zhang, Junjie
    Liu, Hanqiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [3] Convolution Transformer Fusion Splicing Network for Hyperspectral Image Classification
    Zhao, Feng
    Li, Shijie
    Zhang, Junjie
    Liu, Hanqiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [4] CNN and Transformer interaction network for hyperspectral image classification
    Li, Zhongwei
    Huang, Wenhao
    Wang, Leiquan
    Xin, Ziqi
    Meng, Qiao
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (18) : 5548 - 5573
  • [5] FusionNet: A Convolution-Transformer Fusion Network for Hyperspectral Image Classification
    Yang, Liming
    Yang, Yihang
    Yang, Jinghui
    Zhao, Ningyuan
    Wu, Ling
    Wang, Liguo
    Wang, Tianrui
    REMOTE SENSING, 2022, 14 (16)
  • [6] Convolution-Transformer Adaptive Fusion Network for Hyperspectral Image Classification
    Li, Jiaju
    Xing, Hanfa
    Ao, Zurui
    Wang, Hefeng
    Liu, Wenkai
    Zhang, Anbing
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [7] Convolution Transformer Mixer for Hyperspectral Image Classification
    Zhang, Junjie
    Meng, Zhe
    Zhao, Feng
    Liu, Hanqiang
    Chang, Zhenhui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [8] A hybrid convolution transformer for hyperspectral image classification
    Arshad, Tahir
    Zhang, Junping
    Ullah, Inam
    EUROPEAN JOURNAL OF REMOTE SENSING, 2024, 57 (01)
  • [9] DBCTNet: Double Branch Convolution-Transformer Network for Hyperspectral Image Classification
    Xu, Rui
    Dong, Xue-Mei
    Li, Weijie
    Peng, Jiangtao
    Sun, Weiwei
    Xu, Yi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [10] CONVOLUTION ENHANCED SPATIAL-SPECTRAL UNIFIED TRANSFORMER NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Xin, Ziqi
    Li, Zhongwei
    Xu, Mingming
    Wang, Leiquan
    Zhu, Xue
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 2267 - 2270