A multimodal hyper-fusion transformer for remote sensing image classification

被引:23
|
作者
Ma, Mengru [1 ]
Ma, Wenping [1 ]
Jiao, Licheng [1 ]
Liu, Xu [1 ]
Li, Lingling [1 ]
Feng, Zhixi [1 ]
Liu, Fang [1 ]
Yang, Shuyuan [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Int Res Ctr Intelligent Percept & Computat, Joint Int Res Lab Intelligent Percept & Computat,K, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Multi-modal remote sensing; Transformer; Gist feature; Fusion classification; PAN; NETWORK; MS;
D O I
10.1016/j.inffus.2023.03.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multispectral (MS) and the panchromatic (PAN) images represent complementary and synergistic spatial spectral information, how to make optimal use of the advantages of them has become a hot research topic. This paper proposes a selectable Transformer and Gist CNN network (STGC-Net). It designs a subspace similar recombination module (SSR-Module) based on non-negative matrix factorization (NMF) and the self-attention mechanism for feature decomposition. This can alleviate the redundant information of multi-modal data and extract their own singular and common features. Considering that the MS and the PAN images exhibit different advantageous properties, a selectable self-attention spectral feature extraction module (S3FE-Module) and a multi-stream Gist spatial feature extraction module (MGSFE-Module) are proposed for the different singular features. The former can refine the Transformer's input and simultaneously characterize the sequence information between channels for the MS image. The latter introduces the positional relationship between local features while extracting spatial features for the PAN image, thereby improving the accuracy of scene classification. Experimental results indicate that the proposed method performs better than the other methods. The relevant code of this paper is provided at: https://github.com/ru-willow/ST-GC-Net.
引用
收藏
页码:66 / 79
页数:14
相关论文
共 50 条
  • [1] Multimodal Fusion Transformer for Remote Sensing Image Classification
    Roy, Swalpa Kumar
    Deria, Ankur
    Hong, Danfeng
    Rasti, Behnood
    Plaza, Antonio
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [2] Deep Symmetric Fusion Transformer for Multimodal Remote Sensing Data Classification
    Chang, Honghao
    Bi, Haixia
    Li, Fan
    Xu, Chen
    Chanussot, Jocelyn
    Hong, Danfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [3] Fractional Fourier Image Transformer for Multimodal Remote Sensing Data Classification
    Zhao, Xudong
    Zhang, Mengmeng
    Tao, Ran
    Li, Wei
    Liao, Wenzhi
    Tian, Lianfang
    Philips, Wilfried
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2314 - 2326
  • [4] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
    Jin Chuan
    Tong Changqing
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
  • [5] A Multilevel Multimodal Fusion Transformer for Remote Sensing Semantic Segmentation
    Ma, Xianping
    Zhang, Xiaokang
    Pun, Man-On
    Liu, Ming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [6] RsMmFormer: Multimodal Transformer Using Multiscale Self-attention for Remote Sensing Image Classification
    Zhang, Bo
    Ming, Zuheng
    Liu, Yaqian
    Feng, Wei
    He, Liang
    Zhao, Kaixing
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 329 - 339
  • [7] MHFNet: An Improved HGR Multimodal Network for Informative Correlation Fusion in Remote Sensing Image Classification
    Zhang, Hongkang
    Huang, Shao-Lun
    Kuruoglu, Ercan Engin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 15052 - 15066
  • [8] IFF-Net: Irregular Feature Fusion Network for Multimodal Remote Sensing Image Classification
    Wang, Huiqing
    Wang, Huajun
    Wu, Linfeng
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [9] Multimodal Fusion Remote Sensing Image-Audio Retrieval
    Yang, Rui
    Wang, Shuang
    Sun, Yingzhi
    Zhang, Huan
    Liao, Yu
    Gu, Yu
    Hou, Biao
    Jiao, Licheng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 6220 - 6235
  • [10] CNN and Transformer Fusion for Remote Sensing Image Semantic Segmentation
    Chen, Xin
    Li, Dongfen
    Liu, Mingzhe
    Jia, Jiaru
    REMOTE SENSING, 2023, 15 (18)