AMS: A hyperspectral image classification method based on SVM and multi-modal attention network

被引:0
|
作者
Chen, Yingxia [1 ,2 ]
Liu, Zhaoheng [1 ]
Chen, Zeqiang [3 ]
机构
[1] Yangtze Univ, Sch Comp Sci, Jingzhou 432023, Peoples R China
[2] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
[3] China Univ Geosci, Natl Engn Res Ctr Geog Informat Syst, Wuhan 430074, Peoples R China
关键词
Hyperspectral image classification; Convolutional neural network; Attention mechanism; Cross-layer adaptive fusion; Support vector machine; FEATURE-EXTRACTION;
D O I
10.1016/j.knosys.2025.113236
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperspectral (HS) image classification technology is increasingly used for identifying land cover categories. However, spectral aliasing restricts its ability to accurately and completely capture land cover features. To overcome this issue, herein, we introduce a classification method that integrates three modules, namely, a convolutional neural network with an attention mechanism (AMCNN), multi-modal cross-layer adaptive fusion encoder (MCAFE) and support vector machine (SVM), which is referred to as attention-based multi-modal crosslayer fusion network with SVM (AMS). In particular, AMCNN integrates convolution and attention mechanisms to overcome the limitations of a single CNN structure in dynamically allocating attention. MCAFE is proposed to overcome the issues of ineffective inter-layer information interaction and gradient vanishing commonly observed in stacked encoder layers structures. Furthermore, SVM is used to obtain the decision boundaries because of its better performance on linearly separable data than on traditional fully connected (FC) layers. Experimental results demonstrate that AMS considerably enhances the overall accuracy (OA), average accuracy (AA) and Kappa metrics on the Houston and MUUFL datasets, outperforming other state-of-the-art methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Multi-Modal Retinal Image Classification With Modality-Specific Attention Network
    He, Xingxin
    Deng, Ying
    Fang, Leyuan
    Peng, Qinghua
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (06) : 1591 - 1602
  • [2] Multi-modal tree-based SVM classification
    Freeman, Cecille
    Kulic, Dana
    Basir, Otman
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 65 - 71
  • [3] Cross-modal attention network for retinal disease classification based on multi-modal images
    Liu, Zirong
    Hu, Yan
    Qiu, Zhongxi
    Niu, Yanyan
    Zhou, Dan
    Li, Xiaoling
    Shen, Junyong
    Jiang, Hongyang
    Li, Heng
    Liu, Jiang
    BIOMEDICAL OPTICS EXPRESS, 2024, 15 (06): : 3699 - 3714
  • [4] A Stepwise Matching Method for Multi-modal Image based on Cascaded Network
    Mu, Jinming
    Gou, Shuiping
    Mao, Shasha
    Zheng, Shankui
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1284 - 1292
  • [5] PolSAR Image Classification Based on Multi-Modal Contrastive Fully Convolutional Network
    Hua, Wenqiang
    Wang, Yi
    Yang, Sijia
    Jin, Xiaomin
    REMOTE SENSING, 2024, 16 (02)
  • [6] MMST: A Multi-Modal Ground-Based Cloud Image Classification Method
    Wei, Liang
    Zhu, Tingting
    Guo, Yiren
    Ni, Chao
    SENSORS, 2023, 23 (09)
  • [7] Single-shot hyperspectral imaging based on dual attention neural network with multi-modal learning
    He, Tianyue
    Zhang, Qican
    Zhou, Mingwei
    Kou, Tingdong
    Shen, Junfei
    OPTICS EXPRESS, 2022, 30 (06) : 9790 - 9813
  • [8] Emotion classification with multi-modal physiological signals using multi-attention-based neural network
    Zou, Chengsheng
    Deng, Zhen
    He, Bingwei
    Yan, Maosong
    Wu, Jie
    Zhu, Zhaoju
    COGNITIVE COMPUTATION AND SYSTEMS, 2024, 6 (1-3) : 1 - 11
  • [9] Hyperspectral Image Classification Based on Multi-Scale Residual Network with Attention Mechanism
    Qing, Yuhao
    Liu, Wenyi
    REMOTE SENSING, 2021, 13 (03) : 1 - 18
  • [10] Cross-modal attention for multi-modal image registration
    Song, Xinrui
    Chao, Hanqing
    Xu, Xuanang
    Guo, Hengtao
    Xu, Sheng
    Turkbey, Baris
    Wood, Bradford J.
    Sanford, Thomas
    Wang, Ge
    Yan, Pingkun
    MEDICAL IMAGE ANALYSIS, 2022, 82