MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer

被引:0
|
作者
Tang, Wei [1 ]
He, Fazhi [1 ]
Liu, Yu [2 ]
Duan, Yansong [3 ]
机构
[1] Wuhan University, School of Computer Science, Wuhan,430072, China
[2] Hefei University of Technology, Department of Biomedical Engineering, Hefei,230009, China
[3] Wuhan University, School of Remote Sensing and Information Engineering, Wuhan,430079, China
关键词
Convolution - Deep learning - Diagnosis - Image fusion - Medical imaging - Network architecture - Particle beams - Quality control - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
Owing to the limitations of imaging sensors, it is challenging to obtain a medical image that simultaneously contains functional metabolic information and structural tissue details. Multimodal medical image fusion, an effective way to merge the complementary information in different modalities, has become a significant technique to facilitate clinical diagnosis and surgical navigation. With powerful feature representation ability, deep learning (DL)-based methods have improved such fusion results but still have not achieved satisfactory performance. Specifically, existing DL-based methods generally depend on convolutional operations, which can well extract local patterns but have limited capability in preserving global context information. To compensate for this defect and achieve accurate fusion, we propose a novel unsupervised method to fuse multimodal medical images via a multiscale adaptive Transformer termed MATR. In the proposed method, instead of directly employing vanilla convolution, we introduce an adaptive convolution for adaptively modulating the convolutional kernel based on the global complementary context. To further model long-range dependencies, an adaptive Transformer is employed to enhance the global semantic extraction capability. Our network architecture is designed in a multiscale fashion so that useful multimodal information can be adequately acquired from the perspective of different scales. Moreover, an objective function composed of a structural loss and a region mutual information loss is devised to construct constraints for information preservation at both the structural-level and the feature-level. Extensive experiments on a mainstream database demonstrate that the proposed method outperforms other representative and state-of-the-art methods in terms of both visual quality and quantitative evaluation. We also extend the proposed method to address other biomedical image fusion issues, and the pleasing fusion results illustrate that MATR has good generalization capability. The code of the proposed method is available at https://github.com/tthinking/MATR. © 1992-2012 IEEE.
引用
收藏
页码:5134 / 5149
相关论文
共 50 条
  • [21] Advancing multimodal medical image fusion: an adaptive image decomposition approach based on multilevel Guided filtering
    Moghtaderi, Shiva
    Einlou, Mokarrameh
    Wahid, Khan A.
    Lukong, Kiven Erique
    ROYAL SOCIETY OPEN SCIENCE, 2024, 11 (04):
  • [22] A novel approach for multimodal medical image fusion
    Liu, Zhaodong
    Yin, Hongpeng
    Chai, Yi
    Yang, Simon X.
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (16) : 7425 - 7435
  • [23] Underwater Image Enhancement via Adaptive Group Attention-Based Multiscale Cascade Transformer
    Huang, Zhixiong
    Li, Jinjiang
    Hua, Zhen
    Fan, Linwei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [24] Laplacian Redecomposition for Multimodal Medical Image Fusion
    Li, Xiaoxiao
    Guo, Xiaopeng
    Han, Pengfei
    Wang, Xiang
    Li, Huaguang
    Luo, Tao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (09) : 6880 - 6890
  • [25] A Review of Multimodal Medical Image Fusion Techniques
    Huang, Bing
    Yang, Feng
    Yin, Mengxiao
    Mo, Xiaoying
    Zhong, Cheng
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2020, 2020
  • [26] MRFormer: Multiscale retractable transformer for medical image progressive denoising via noise level estimation
    Bai, Can
    Han, Xianjun
    IMAGE AND VISION COMPUTING, 2024, 144
  • [27] Multiscale Adaptive Fusion Network for Hyperspectral Image Denoising
    Pan, Haodong
    Gao, Feng
    Dong, Junyu
    Du, Qian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 3045 - 3059
  • [28] Multimodal Medical Image Fusion Utilizing Two-scale Image Decomposition via Saliency Detection
    Kaur, Harmanpreet
    Vig, Renu
    Kumar, Naresh
    Sharma, Apoorav
    Dogra, Ayush
    Goyal, Bhawna
    CURRENT MEDICAL IMAGING, 2024, 20
  • [29] Enhanced multimodal medical image fusion via modified DWT with arithmetic optimization algorithm
    Alzahrani, Ahmad A.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [30] Block-Matching Based Multimodal Medical Image Fusion via PCNN with SML
    Hu Shaohai
    Yang Dongsheng
    Liu Shuaiqi
    Ma Xiaole
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 13 - 18