MCFT: Multimodal Contrastive Fusion Transformer for Classification of Hyperspectral Image and LiDAR Data

被引:0
|
作者
Feng, Yining [1 ]
Jin, Jiarui [2 ]
Yin, Yin [2 ]
Song, Chuanming [3 ]
Wang, Xianghai [1 ,2 ]
机构
[1] Liaoning Normal Univ, Sch Geog, Dalian 116029, Peoples R China
[2] Liaoning Normal Univ, Sch Comp Sci & Artificial Intelligence, Dalian 116029, Peoples R China
[3] Dalian Univ, Sch Informat Engn, Dalian 116622, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Laser radar; Data mining; Convolutional neural networks; Computer vision; Accuracy; Head; Electronic mail; Data models; Contrastive learning; deep learning (DL); feature alignment; feature matching; HS-LiDAR fusion and classification; vision transformer (ViT);
D O I
10.1109/TGRS.2024.3490752
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Multisource remote sensing (RS) image fusion leverages data from various sensors to enhance the accuracy and comprehensiveness of Earth observation. Notably, the fusion of hyperspectral (HS) images and light detection and ranging (LiDAR) data has garnered significant attention due to their complementary features. However, current methods predominantly rely on simplistic techniques such as weight sharing, feature superposition, or feature products, which often fall short of achieving true feature fusion. These methods primarily focus on feature accumulation rather than integrative fusion. The transformer framework, with its self-attention mechanisms, offers potential for effective multimodal data fusion. However, simple linear transformations used in feature extraction may not adequately capture all relevant information. To address these challenges, we propose a novel multimodal contrastive fusion transformer (MCFT). Our approach employs convolutional neural networks (CNNs) for feature extraction from different modalities and leverages transformer networks for advanced fusion. We have modified the basic transformer architecture and propose a double position embedding mode to make it more suitable for RS image processing tasks. We introduce two novel modules: feature alignment module and feature matching module, designed to exploit both paired and unpaired samples. These modules facilitate more effective cross-modal learning by emphasizing the commonalities within the same features and the differences between features from distinct modalities. Experimental evaluations on several publicly available HS-LiDAR datasets demonstrate that proposed method consistently outperforms existing advanced methods. The source code for our approach is available at: https://github.com/SYFYN0317/MCFT.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] FUSION OF HYPERSPECTRAL AND LIDAR DATA IN CLASSIFICATION OF URBAN AREAS
    Ghamisi, Pedram
    Benediktsson, Jon Atli
    Phinn, Stuart
    2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014,
  • [22] Hyperspectral and LiDAR data fusion in features based classification
    Farsat Heeto Abdulrahman
    Arabian Journal of Geosciences, 2021, 14 (24)
  • [23] Local Information Interaction Transformer for Hyperspectral and LiDAR Data Classification
    Zhang, Yuwen
    Peng, Yishu
    Tu, Bing
    Liu, Yaru
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 1130 - 1143
  • [24] Deep Hierarchical Vision Transformer for Hyperspectral and LiDAR Data Classification
    Xue, Zhixiang
    Tan, Xiong
    Yu, Xuchu
    Liu, Bing
    Yu, Anzhu
    Zhang, Pengqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3095 - 3110
  • [25] Classification of hyperspectral and LiDAR data by transformer-based enhancement
    Pan, Jiechen
    Shuai, Xing
    Xu, Qing
    Dai, Mofan
    Zhang, Guoping
    Wang, Guo
    REMOTE SENSING LETTERS, 2024, 15 (10) : 1074 - 1084
  • [26] Fusion of Hyperspectral Image and LiDAR Data and Classification using Deep Convolutional Neural Networks
    Salman, Mesut
    Yuksel, Seniha Esen
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [27] Multimodal Semantic Collaborative Classification for Hyperspectral Images and LiDAR Data
    Wang, Aili
    Dai, Shiyu
    Wu, Haibin
    Iwahori, Yuji
    REMOTE SENSING, 2024, 16 (16)
  • [28] Multimodal Deep Learning for Semisupervised Classification of Hyperspectral and LiDAR Data
    Pu, Chunyu
    Liu, Yingxu
    Lin, Shuai
    Shi, Xu
    Li, Zhengying
    Huang, Hong
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (02) : 821 - 834
  • [29] Collaborative Contrastive Learning for Hyperspectral and LiDAR Classification
    Jia, Sen
    Zhou, Xi
    Jiang, Shuguo
    He, Ruyan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [30] CITNet: Convolution Interaction Transformer Network for Hyperspectral and LiDAR Image Classification
    Wang, Minhui
    Sun, Yaxiu
    Xiang, Jianhong
    Zhong, Yu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62