A multimodal hyper-fusion transformer for remote sensing image classification

被引:23
|
作者
Ma, Mengru [1 ]
Ma, Wenping [1 ]
Jiao, Licheng [1 ]
Liu, Xu [1 ]
Li, Lingling [1 ]
Feng, Zhixi [1 ]
Liu, Fang [1 ]
Yang, Shuyuan [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Int Res Ctr Intelligent Percept & Computat, Joint Int Res Lab Intelligent Percept & Computat,K, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Multi-modal remote sensing; Transformer; Gist feature; Fusion classification; PAN; NETWORK; MS;
D O I
10.1016/j.inffus.2023.03.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multispectral (MS) and the panchromatic (PAN) images represent complementary and synergistic spatial spectral information, how to make optimal use of the advantages of them has become a hot research topic. This paper proposes a selectable Transformer and Gist CNN network (STGC-Net). It designs a subspace similar recombination module (SSR-Module) based on non-negative matrix factorization (NMF) and the self-attention mechanism for feature decomposition. This can alleviate the redundant information of multi-modal data and extract their own singular and common features. Considering that the MS and the PAN images exhibit different advantageous properties, a selectable self-attention spectral feature extraction module (S3FE-Module) and a multi-stream Gist spatial feature extraction module (MGSFE-Module) are proposed for the different singular features. The former can refine the Transformer's input and simultaneously characterize the sequence information between channels for the MS image. The latter introduces the positional relationship between local features while extracting spatial features for the PAN image, thereby improving the accuracy of scene classification. Experimental results indicate that the proposed method performs better than the other methods. The relevant code of this paper is provided at: https://github.com/ru-willow/ST-GC-Net.
引用
收藏
页码:66 / 79
页数:14
相关论文
共 50 条
  • [31] Multimodal Image Fusion Framework for End-to-End Remote Sensing Image Registration
    Li, Liangzhi
    Han, Ling
    Ding, Mingtao
    Cao, Hongye
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [32] Multimodal Remote Sensing Image Classification with Small Sample Size Based on High-Level Feature Fusion
    He Qi
    Li Yao
    Song Wei
    Huang Dongmei
    He Shengqi
    Du Yanling
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (11)
  • [33] Unified multimodal fusion transformer for few shot object detection for remote sensing images
    Azeem, Abdullah
    Li, Zhengzhou
    Siddique, Abubakar
    Zhang, Yuting
    Zhou, Shangbo
    INFORMATION FUSION, 2024, 111
  • [34] MCFT: Multimodal Contrastive Fusion Transformer for Classification of Hyperspectral Image and LiDAR Data
    Feng, Yining
    Jin, Jiarui
    Yin, Yin
    Song, Chuanming
    Wang, Xianghai
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [35] Hierarchical Feature Fusion of Transformer With Patch Dilating for Remote Sensing Scene Classification
    Chen, Xiaoning
    Ma, Mingyang
    Li, Yong
    Mei, Shaohui
    Han, Zonghao
    Zhao, Jian
    Cheng, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 16
  • [36] Transfer Representation Learning Meets Multimodal Fusion Classification for Remote Sensing Images
    Ma, Mengru
    Ma, Wenping
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    Hou, Biao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [37] Hybrid Attention Fusion Embedded in Transformer for Remote Sensing Image Semantic Segmentation
    Chen, Yan
    Dong, Quan
    Wang, Xiaofeng
    Zhang, Qianchuan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4421 - 4435
  • [38] TransRA: transformer and residual attention fusion for single remote sensing image dehazing
    Pengwei Dong
    Bo Wang
    Multidimensional Systems and Signal Processing, 2022, 33 : 1119 - 1138
  • [39] TransRA: transformer and residual attention fusion for single remote sensing image dehazing
    Dong, Pengwei
    Wang, Bo
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2022, 33 (04) : 1119 - 1138
  • [40] A Unified Generative Adversarial Network With Convolution and Transformer for Remote Sensing Image Fusion
    Wu, Yuanyuan
    Huang, Mengxing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62