Joint Classification of Hyperspectral and LiDAR Data Using Hierarchical Multimodal Feature Aggregation-Based Multihead Axial Attention Transformer

被引:0
|
作者
Zhu, Fei [1 ]
Shi, Cuiping [2 ]
Shi, Kaijie [1 ]
Wang, Liguo [3 ]
机构
[1] Qiqihar Univ, Dept Commun Engn, Qiqihar 161000, Peoples R China
[2] Huzhou Univ, Coll Informat Engn, Huzhou 313000, Peoples R China
[3] Dalian Nationalities Univ, Coll Informat & Commun Engn, Dalian 116000, Peoples R China
基金
中国国家自然科学基金;
关键词
Axial attention; convolutional neural networks (CNNs); feature aggregation; hyperspectral; light detection and ranging (LiDAR); multimodal; transformer; REMOTE-SENSING DATA; EXTINCTION PROFILES; FUSION;
D O I
10.1109/TGRS.2025.3533475
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The rapid development of sensor and multimodal technology has provided more possibilities for multisource remote sensing image classification. However, some existing joint classification methods are limited to single-level feature fusion and fail to fully explore the deep correlation between cross-level features, thus limiting the effective interaction and complementarity of information between different modal data. To alleviate this issue, this article proposes a hierarchical multimodal feature aggregation-based multihead axial attention transformer (HMAT) for joint classification of hyperspectral and light detection and ranging (LiDAR) data. First, a hierarchical multimodal feature aggregation module (HMFA) is proposed to more effectively fuse spatial-spectral features of hyperspectral images (HSIs) and elevation features of LiDAR data and generate more discriminative low-dimensional feature representations. Second, a pyramid-inverted pyramid convolution module (PIP) is designed. Through the complementary feature extraction structure, PIP can more fully capture the multiscale local features in the fused feature map of hyperspectral and LiDAR data. Finally, a multihead axial attention (MHAA) component is constructed to capture information at different scales in the fused feature maps, thereby accurately modeling global dependencies. The proposed HMAT has been extensively tested on three publicly available datasets. The experimental results demonstrate that the classification performance of the proposed method outperforms that of several state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Multi-Scale Feature Extraction for Joint Classification of Hyperspectral and LiDAR Data
    Xi Y.
    Ye Z.
    Journal of Beijing Institute of Technology (English Edition), 2023, 32 (01): : 13 - 22
  • [22] Multi-Scale Feature Extraction for Joint Classification of Hyperspectral and LiDAR Data
    Yongqiang Xi
    Zhen Ye
    Journal of Beijing Institute of Technology, 2023, 32 (01) : 13 - 22
  • [23] MULTI-SCALE FEATURE FUSION FOR HYPERSPECTRAL AND LIDAR DATA JOINT CLASSIFICATION
    Zhang, Maqun
    Gao, Feng
    Dong, Junyu
    Qi, Lin
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 2856 - 2859
  • [24] Multimodal Attention-Aware Convolutional Neural Networks for Classification of Hyperspectral and LiDAR Data
    Zhang, Haotian
    Yao, Jing
    Ni, Li
    Gao, Lianru
    Huang, Min
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 3635 - 3644
  • [25] Multimodal Attention-Aware Convolutional Neural Networks for Classification of Hyperspectral and LiDAR Data
    Zhang, Haotian
    Yao, Jing
    Ni, Li
    Gao, Lianru
    Huang, Min
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 3635 - 3644
  • [26] Feature based classification using hierarchical fuzzy rules for the analysis of hyperspectral data
    Yoon, CR
    Kim, HG
    Kim, KO
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 3856 - 3859
  • [27] Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer
    Wu H.
    Dai S.
    Wang A.
    Yuji I.
    Yu X.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (07): : 1087 - 1100
  • [28] Sound Event Detection Using Attention and Aggregation-Based Feature Pyramid Network
    Kim, Ji Won
    Lee, Geon Woo
    Kim, Hong Kook
    Kim, Nam Kyun
    2022 27TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2022): CREATING INNOVATIVE COMMUNICATION TECHNOLOGIES FOR POST-PANDEMIC ERA, 2022, : 496 - 497
  • [29] Classification of hyperspectral and LIDAR data using extinction profiles with feature fusion
    Zhang, Mengmeng
    Ghamisi, Pedram
    Li, Wei
    REMOTE SENSING LETTERS, 2017, 8 (10) : 957 - 966
  • [30] FusAtNet: Dual Attention based SpectroSpatial Multimodal Fusion Network for Hyperspectral and LiDAR Classification
    Mohla, Satyam
    Pande, Shivam
    Banerjee, Biplab
    Chaudhuri, Subhasis
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 416 - 425