AMS: A hyperspectral image classification method based on SVM and multi-modal attention network

被引:0
|
作者
Chen, Yingxia [1 ,2 ]
Liu, Zhaoheng [1 ]
Chen, Zeqiang [3 ]
机构
[1] Yangtze Univ, Sch Comp Sci, Jingzhou 432023, Peoples R China
[2] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
[3] China Univ Geosci, Natl Engn Res Ctr Geog Informat Syst, Wuhan 430074, Peoples R China
关键词
Hyperspectral image classification; Convolutional neural network; Attention mechanism; Cross-layer adaptive fusion; Support vector machine; FEATURE-EXTRACTION;
D O I
10.1016/j.knosys.2025.113236
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperspectral (HS) image classification technology is increasingly used for identifying land cover categories. However, spectral aliasing restricts its ability to accurately and completely capture land cover features. To overcome this issue, herein, we introduce a classification method that integrates three modules, namely, a convolutional neural network with an attention mechanism (AMCNN), multi-modal cross-layer adaptive fusion encoder (MCAFE) and support vector machine (SVM), which is referred to as attention-based multi-modal crosslayer fusion network with SVM (AMS). In particular, AMCNN integrates convolution and attention mechanisms to overcome the limitations of a single CNN structure in dynamically allocating attention. MCAFE is proposed to overcome the issues of ineffective inter-layer information interaction and gradient vanishing commonly observed in stacked encoder layers structures. Furthermore, SVM is used to obtain the decision boundaries because of its better performance on linearly separable data than on traditional fully connected (FC) layers. Experimental results demonstrate that AMS considerably enhances the overall accuracy (OA), average accuracy (AA) and Kappa metrics on the Houston and MUUFL datasets, outperforming other state-of-the-art methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Hyperspectral Image Classification Based on Convolution Neural Network with Attention Mechanism
    Chen Wenhao
    Jing, He
    Gang, Liu
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [42] Autoencoder-Based Collaborative Attention GAN for Multi-Modal Image Synthesis
    Cao, Bing
    Cao, Haifang
    Liu, Jiaxu
    Zhu, Pengfei
    Zhang, Changqing
    Hu, Qinghua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 995 - 1010
  • [43] Split Learning of Multi-Modal Medical Image Classification
    Ghosh, Bishwamittra
    Wang, Yuan
    Fu, Huazhu
    Wei, Qingsong
    Liu, Yong
    Goh, Rick Siow Mong
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1326 - 1331
  • [44] Image and Encoded Text Fusion for Multi-Modal Classification
    Gallo, I.
    Calefati, A.
    Nawaz, S.
    Janjua, M. K.
    2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 203 - 209
  • [45] Enhancing Image Classification Models with Multi-modal Biomarkers
    Caban, Jesus J.
    Liao, David
    Yao, Jianhua
    Mollura, Daniel J.
    Gochuico, Bernadette
    Yoo, Terry
    MEDICAL IMAGING 2011: COMPUTER-AIDED DIAGNOSIS, 2011, 7963
  • [46] A Multi-Modal Multilingual Benchmark for Document Image Classification
    Fujinuma, Yoshinari
    Varia, Siddharth
    Sankaran, Nishant
    Min, Bonan
    Appalaraju, Srikar
    Vyas, Yogarshi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14361 - 14376
  • [47] Attention-based Fusion Network for Breast Cancer Segmentation and Classification Using Multi-modal Ultrasound Images
    Cho, Yoonjae
    Misra, Sampa
    Managuli, Ravi
    Barr, Richard G.
    Lee, Jeongmin
    Kim, Chulhong
    ULTRASOUND IN MEDICINE AND BIOLOGY, 2025, 51 (03): : 568 - 577
  • [48] Multi-modal component subspace-similarity-based multi-kernel SVM for schizophrenia classification
    Gao, Shuang
    Calhoun, Vince D.
    Sui, Jing
    MEDICAL IMAGING 2020: COMPUTER-AIDED DIAGNOSIS, 2020, 11314
  • [49] AMM-FuseNet: Attention-Based Multi-Modal Image Fusion Network for Land Cover Mapping
    Ma, Wanli
    Karaku, Oktay
    Rosin, Paul L.
    REMOTE SENSING, 2022, 14 (18)
  • [50] Multi-modal long document classification based on Hierarchical Prompt and Multi-modal Transformer
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Wang, Jiapu
    Sun, Yanfeng
    Yin, Baocai
    NEURAL NETWORKS, 2024, 176