MIMFormer: Multiscale Inception Mixer Transformer for Hyperspectral and Multispectral Image Fusion

被引:2
|
作者
Li, Rumei [1 ]
Zhang, Liyan [1 ,2 ]
Wang, Zun [1 ]
Li, Xiaojuan [1 ,2 ]
机构
[1] Capital Normal Univ, Coll Resource Environm & Tourism, Beijing 100048, Peoples R China
[2] Capital Normal Univ, Key Lab 3 D Informat Acquisit & Applicat, MOE, Beijing 100048, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Hyperspectral imaging; Spatial resolution; Tensors; Image resolution; Deep learning; hyperspectral image (HSI); image fusion; multispectral image (MSI); transformer; WAVELET TRANSFORM; CLASSIFICATION; MODEL;
D O I
10.1109/JSTARS.2024.3447648
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The fusion of low-spatial-resolution hyperspectral image and high-spatial-resolution multispectral image provides an effective method to obtain high-spatial-resolution hyperspectral image. However, existing hybrid fusion architectures combining convolutional neural networks (CNNs) and transformers face significant challenges. Sequential approaches struggle with simultaneous local and global modeling, while parallel approaches often result in information redundancy. In this article, to meet diverse information demands at different layers, we propose a novel multiscale inception mixer transformer network (MIMFormer), a multiscale hybrid network based on the Inception structure integrating CNN and transformer. The core of this network is the multiscale spatial transformer (MST) structure, which enhances the detail richness of fused images by integrating local and global information at various scales. The inception spatial-spectral mixer (ISSM) module within the MST leverages an Inception architecture and employs a spectral splitting mechanism to regulate spectral channel counts across different branches. This design allows the ISSM module to efficiently extract local spatial-spectral features through convolution and max pooling, while global features are captured using a self-attention mechanism, ensuring comprehensive feature fusion across spectral groups. Experimental results on three benchmark datasets and one real remote sensing dataset demonstrate that MIMFormer outperforms ten advanced fusion methods.
引用
收藏
页码:15122 / 15135
页数:14
相关论文
共 50 条
  • [1] Multiscale spatial-spectral transformer network for hyperspectral and multispectral image fusion
    Jia, Sen
    Min, Zhichao
    Fu, Xiyou
    INFORMATION FUSION, 2023, 96 : 117 - 129
  • [2] UMTF-Net: An Unsupervised Multiscale Transformer Fusion Network for Hyperspectral and Multispectral Image Fusion
    Liu, Shuaiqi
    Zhang, Shichong
    Liu, Siyuan
    Li, Bing
    Zhang, Yu-Dong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 17221 - 17238
  • [3] Reciprocal transformer for hyperspectral and multispectral image fusion
    Ma, Qing
    Jiang, Junjun
    Liu, Xianming
    Ma, Jiayi
    INFORMATION FUSION, 2024, 104
  • [4] Bidirectional Dilation Transformer for Multispectral and Hyperspectral Image Fusion
    Deng, Shangqi
    Deng, Liang-Jian
    Wu, Xiao
    Ran, Ran
    Wen, Rui
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 3633 - 3641
  • [5] MDC-FusFormer: Multiscale Deep Cross-Fusion Transformer Network for Hyperspectral and Multispectral Image Fusion
    Sun, Le
    Zhou, Jianxiao
    Ye, Qiaolin
    Wu, Zebin
    Chen, Qiao
    Xu, Zhongqi
    Fu, Liyong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [6] An Asymptotic Multiscale Symmetric Fusion Network for Hyperspectral and Multispectral Image Fusion
    Liu, Shuaiqi
    Shao, Tingting
    Liu, Siyuan
    Li, Bing
    Zhang, Yu-Dong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [7] Multiscale Fusion Transformer Network for Hyperspectral Image Classification
    Yuquan Gan
    Hao Zhang
    Chen Yi
    Journal of Beijing Institute of Technology, 2024, (03) : 255 - 270
  • [8] Multiscale Fusion Transformer Network for Hyperspectral Image Classification
    Gan, Yuquan
    Zhang, Hao
    Yi, Chen
    Journal of Beijing Institute of Technology (English Edition), 2024, 33 (03): : 255 - 270
  • [9] Sparse Mix-Attention Transformer for Multispectral Image and Hyperspectral Image Fusion
    Yu, Shihai
    Zhang, Xu
    Song, Huihui
    REMOTE SENSING, 2024, 16 (01)
  • [10] Hierarchical Spectral-Spatial Transformer for Hyperspectral and Multispectral Image Fusion
    Zhu, Tianxing
    Liu, Qin
    Zhang, Lixiang
    REMOTE SENSING, 2024, 16 (22)