Cross Attention-Based Multi-Scale Convolutional Fusion Network for Hyperspectral and LiDAR Joint Classification

被引:0
|
作者
Ge, Haimiao [1 ,2 ]
Wang, Liguo [3 ]
Pan, Haizhu [1 ,2 ]
Liu, Yanzhong [1 ,2 ]
Li, Cheng [1 ,2 ]
Lv, Dan [1 ,2 ]
Ma, Huiyu [1 ,2 ]
机构
[1] Qiqihar Univ, Coll Comp & Control Engn, Qiqihar 161000, Peoples R China
[2] Qiqihar Univ, Heilongjiang Key Lab Big Data Network Secur Detect, Qiqihar 161000, Peoples R China
[3] Dalian Minzu Univ, Coll Informat & Commun Engn, Dalian 116600, Peoples R China
基金
中国国家自然科学基金;
关键词
HSI and LiDAR fusion classification; convolutional neural network; multi-scale feature extraction; cross attention;
D O I
10.3390/rs16214073
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In recent years, deep learning-based multi-source data fusion, e.g., hyperspectral image (HSI) and light detection and ranging (LiDAR) data fusion, has gained significant attention in the field of remote sensing. However, the traditional convolutional neural network fusion techniques always provide poor extraction of discriminative spatial-spectral features from diversified land covers and overlook the correlation and complementarity between different data sources. Furthermore, the mere act of stacking multi-source feature embeddings fails to represent the deep semantic relationships among them. In this paper, we propose a cross attention-based multi-scale convolutional fusion network for HSI-LiDAR joint classification. It contains three major modules: spatial-elevation-spectral convolutional feature extraction module (SESM), cross attention fusion module (CAFM), and classification module. In the SESM, improved multi-scale convolutional blocks are utilized to extract features from HSI and LiDAR to ensure discriminability and comprehensiveness in diversified land cover conditions. Spatial and spectral pseudo-3D convolutions, pointwise convolutions, residual aggregation, one-shot aggregation, and parameter-sharing techniques are implemented in the module. In the CAFM, a self-designed local-global cross attention block is utilized to collect and integrate relationships of the feature embeddings and generate joint semantic representations. In the classification module, average polling, dropout, and linear layers are used to map the fused semantic representations to the final classification results. The experimental evaluations on three public HSI-LiDAR datasets demonstrate the competitiveness of the proposed network in comparison with state-of-the-art methods.
引用
收藏
页数:33
相关论文
共 50 条
  • [21] Hyperspectral Image Classification Based on Multi-Scale Residual Network with Attention Mechanism
    Qing, Yuhao
    Liu, Wenyi
    REMOTE SENSING, 2021, 13 (03) : 1 - 18
  • [22] Mosquito swarm counting via attention-based multi-scale convolutional neural network
    Chen, Huahua
    Ren, Junhao
    Sun, Wensheng
    Hou, Juan
    Miao, Ziping
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [23] Mosquito swarm counting via attention-based multi-scale convolutional neural network
    Huahua Chen
    Junhao Ren
    Wensheng Sun
    Juan Hou
    Ziping Miao
    Scientific Reports, 13
  • [24] Hyperspectral Image Classification Based on Multi-Scale Feature Fusion Residual Network
    Deng Ziqing
    Wang Yang
    Zhang Bing
    Ding Zhao
    Bian Lifeng
    Yang Chen
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [25] HAMNet: hyperspectral image classification based on hybrid neural network with attention mechanism and multi-scale feature fusion
    Shen, Jinyue
    Zheng, Zhouzhou
    Sun, Yingwei
    Zhao, Mengmeng
    Chang, Yankang
    Shao, Yuyi
    Zhang, Yan
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (11) : 4233 - 4258
  • [26] Convolutional Neural Network and Vision Transformer-driven Cross-layer Multi-scale Fusion Network for Hyperspectral Image Classification
    Zhao F.
    Geng M.
    Liu H.
    Zhang J.
    Yu J.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (05): : 2237 - 2248
  • [27] Multi-Scale Dilated Convolutional Neural Network for Hyperspectral Image Classification
    Shanshan Zheng
    Wen Liu
    Rui Shan
    Jingyi Zhao
    Guoqian Jiang
    Zhi Zhang
    JournalofHarbinInstituteofTechnology(NewSeries), 2021, 28 (04) : 25 - 32
  • [28] Deep Multi-scale Convolutional Neural Network for Hyperspectral Image Classification
    Zhang Feng-zhe
    Yang Xia
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [29] Multi-scale Convolutional Attention Fuzzy Broad Network for Few-Shot Hyperspectral Image Classification
    Hu, Xiaopei
    Zhao, Guixin
    Yuan, Lu
    Dong, Xiangjun
    Dong, Aimei
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 46 - 60
  • [30] Gaze Estimation with Multi-scale Attention-based Convolutional Neural Networks
    Zhang, Yuanyuan
    Li, Jing
    Ouyang, Gaoxiang
    2023 29TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE, M2VIP 2023, 2023,