MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer

被引:0
|
作者
Tang, Wei [1 ]
He, Fazhi [1 ]
Liu, Yu [2 ]
Duan, Yansong [3 ]
机构
[1] Wuhan University, School of Computer Science, Wuhan,430072, China
[2] Hefei University of Technology, Department of Biomedical Engineering, Hefei,230009, China
[3] Wuhan University, School of Remote Sensing and Information Engineering, Wuhan,430079, China
关键词
Convolution - Deep learning - Diagnosis - Image fusion - Medical imaging - Network architecture - Particle beams - Quality control - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
Owing to the limitations of imaging sensors, it is challenging to obtain a medical image that simultaneously contains functional metabolic information and structural tissue details. Multimodal medical image fusion, an effective way to merge the complementary information in different modalities, has become a significant technique to facilitate clinical diagnosis and surgical navigation. With powerful feature representation ability, deep learning (DL)-based methods have improved such fusion results but still have not achieved satisfactory performance. Specifically, existing DL-based methods generally depend on convolutional operations, which can well extract local patterns but have limited capability in preserving global context information. To compensate for this defect and achieve accurate fusion, we propose a novel unsupervised method to fuse multimodal medical images via a multiscale adaptive Transformer termed MATR. In the proposed method, instead of directly employing vanilla convolution, we introduce an adaptive convolution for adaptively modulating the convolutional kernel based on the global complementary context. To further model long-range dependencies, an adaptive Transformer is employed to enhance the global semantic extraction capability. Our network architecture is designed in a multiscale fashion so that useful multimodal information can be adequately acquired from the perspective of different scales. Moreover, an objective function composed of a structural loss and a region mutual information loss is devised to construct constraints for information preservation at both the structural-level and the feature-level. Extensive experiments on a mainstream database demonstrate that the proposed method outperforms other representative and state-of-the-art methods in terms of both visual quality and quantitative evaluation. We also extend the proposed method to address other biomedical image fusion issues, and the pleasing fusion results illustrate that MATR has good generalization capability. The code of the proposed method is available at https://github.com/tthinking/MATR. © 1992-2012 IEEE.
引用
收藏
页码:5134 / 5149
相关论文
共 50 条
  • [31] Multimodal Fusion for Human Action Recognition via Spatial Transformer
    Sun, Yaohui
    Xu, Weiyao
    Gao, Ju
    Yu, Xiaoyi
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1638 - 1641
  • [32] Bilateral Adaptive Evolution Transformer for Multispectral Image Fusion
    Hou, Junming
    Chen, Xiaoyu
    Wu, Chenxu
    Zhou, Man
    Li, Junling
    Hong, Danfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [33] Multimodal image fusion with adaptive joint sparsity model
    Zhang, Chengfang
    Yi, Liangzhong
    Feng, Ziliang
    Gao, Zhisheng
    Jin, Xin
    Yan, Dan
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (01)
  • [34] Medical Image Description Based on Multimodal Auxiliary Signals and Transformer
    Tan, Yun
    Li, Chunzhi
    Qin, Jiaohua
    Xue, Youyuan
    Xiang, Xuyu
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
  • [35] MIMFormer: Multiscale Inception Mixer Transformer for Hyperspectral and Multispectral Image Fusion
    Li, Rumei
    Zhang, Liyan
    Wang, Zun
    Li, Xiaojuan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 15122 - 15135
  • [36] FATFusion: A functional–anatomical transformer for medical image fusion
    Tang, Wei
    He, Fazhi
    Information Processing and Management, 2024, 61 (04):
  • [37] Perceptual quality assessment for multimodal medical image fusion
    Tang, Lu
    Tian, Chuangeng
    Li, Leida
    Hu, Bo
    Yu, Wei
    Xu, Kai
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 85 (85)
  • [38] Multimodal medical image fusion by cloud model theory
    Li, Weisheng
    Zhao, Jia
    Xiao, Bin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2018, 12 (03) : 437 - 444
  • [39] A Siamese Network with a Multiscale Window-Based Transformer via an Adaptive Fusion Strategy for High-Resolution Remote Sensing Image Change Detection
    Tao, Chao
    Kuang, Dongsheng
    Wu, Kai
    Zhao, Xiaomei
    Zhao, Chunyan
    Du, Xin
    Zhang, Yunsheng
    REMOTE SENSING, 2023, 15 (09)
  • [40] Comparison of Registered Multimodal Medical Image fusion Techniques
    Kuruvilla, Sonia
    Anitha, J.
    2014 INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2014,