HyperSINet: A Synergetic Interaction Network Combined With Convolution and Transformer for Hyperspectral Image Classification

被引：8

作者：

Yu, Qixing ^{[1
]}

Wei, Weibo ^{[1
]}

Li, Dantong ^{[2
]}

Pan, Zhenkuan ^{[1
]}

Li, Chenyu ^{[3
,4
]}

Hong, Danfeng ^{[4
,5
]}

机构：

[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao 266071, Peoples R China

[2] Cardiff Univ, Cardiff Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales

[3] Southeast Univ, Sch Math & Stat, Nanjing 211189, Peoples R China

[4] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China

[5] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Convolutional neural network (CNN); hyperspectral image (HIS) classification; interactors; synergetic interaction; vision transformer (VIT);

D O I：

10.1109/TGRS.2024.3362471

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

In hyperspectral images (HSIs), both local and nonlocal features play crucial roles in classification tasks. Vision transformers (VITs) can extract nonlocal features through attention mechanisms, while convolutional neural networks (CNNs) excel at handling local components. However, in traditional dual-branch models based on VIT and CNN, there is a lack of interaction during feature processing, leading to potential compatibility issues when merging the two types of features. In this article, we propose HyperSINet, a synergetic interaction network that combines VIT and CNN to establish interaction between the two branches, enabling mutual compensation between local and nonlocal features during the training process and ultimately enhancing the performance of classification tasks. Specifically, we devise a pair of interactors, namely, Conv2Trans and Trans2Conv, which serve as intermediaries between the two branches, enabling the VIT branch to refine its local details, while allowing the CNN branch to process larger receptive field nonlocal features. Typical feature maps are implemented to visualize the function of the interactors. Furthermore, within the VIT branch, a VIT encoder with the local mask is developed to strike a balance between emphasizing nonlocal features and preserving local details, while a lightweight CNN block is designed to process spectral and spatial features in the CNN branch. Extensive experiments conducted on four real-world datasets demonstrate that, under a reasonable count of parameters, HyperSINet surpasses several current state-of-the-art methods.

引用

页码：1 / 18

页数：18

共 50 条

[41] Hyperspectral image classification with multi-scale graph convolution network
Zhao, Wenzhi
Wu, Dinghui
Liu, Yuanlin
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (21) : 8380 - 8397
[42] Graph Neural Network via Edge Convolution for Hyperspectral Image Classification
Hu, Haojie
Yao, Minli
He, Fang
Zhang, Fenggan
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[43] SHORT AND LONG RANGE GRAPH CONVOLUTION NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
Zhu, Wenxiang
Zhao, Chunhui
Qin, Boao
Feng, Shou
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3564 - 3567
[44] Center Weighted Convolution and GraphSAGE Cooperative Network for Hyperspectral Image Classification
Cui, Ying
Shao, Chao
Luo, Li
Wang, Liguo
Gao, Shan
Chen, Liwei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[45] Multiscale Residual Network With Mixed Depthwise Convolution for Hyperspectral Image Classification
Gao, Hongmin
Yang, Yao
Li, Chenming
Gao, Lianru
Zhang, Bing
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (04): : 3396 - 3408
[46] Faster Multiscale Capsule Network With Octave Convolution for Hyperspectral Image Classification
Xu, Qin
Wang, Dongyue
Luo, Bin
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (02) : 361 - 365
[47] Hyperspectral Image Classification Based on Convolution Neural Network with Attention Mechanism
Chen Wenhao
Jing, He
Gang, Liu
LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
[48] Synergistic spectral and spatial feature analysis with transformer and convolution networks for hyperspectral image classification
Dhirendra Prasad Yadav
Deepak Kumar
Anand Singh Jalal
Ankit Kumar
B. Kada
Signal, Image and Video Processing, 2024, 18 : 2975 - 2990
[49] Generative Adversarial Networks Based on Transformer Encoder and Convolution Block for Hyperspectral Image Classification
Bai, Jing
Lu, Jiawei
Xiao, Zhu
Chen, Zheng
Jiao, Licheng
REMOTE SENSING, 2022, 14 (14)
[50] Synergistic spectral and spatial feature analysis with transformer and convolution networks for hyperspectral image classification
Yadav, Dhirendra Prasad
Kumar, Deepak
Jalal, Anand Singh
Kumar, Ankit
Kada, B.
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 2975 - 2990

← 1 2 3 4 5 →