MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer

被引:0
|
作者
Tang, Wei [1 ]
He, Fazhi [1 ]
Liu, Yu [2 ]
Duan, Yansong [3 ]
机构
[1] Wuhan University, School of Computer Science, Wuhan,430072, China
[2] Hefei University of Technology, Department of Biomedical Engineering, Hefei,230009, China
[3] Wuhan University, School of Remote Sensing and Information Engineering, Wuhan,430079, China
关键词
Convolution - Deep learning - Diagnosis - Image fusion - Medical imaging - Network architecture - Particle beams - Quality control - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
Owing to the limitations of imaging sensors, it is challenging to obtain a medical image that simultaneously contains functional metabolic information and structural tissue details. Multimodal medical image fusion, an effective way to merge the complementary information in different modalities, has become a significant technique to facilitate clinical diagnosis and surgical navigation. With powerful feature representation ability, deep learning (DL)-based methods have improved such fusion results but still have not achieved satisfactory performance. Specifically, existing DL-based methods generally depend on convolutional operations, which can well extract local patterns but have limited capability in preserving global context information. To compensate for this defect and achieve accurate fusion, we propose a novel unsupervised method to fuse multimodal medical images via a multiscale adaptive Transformer termed MATR. In the proposed method, instead of directly employing vanilla convolution, we introduce an adaptive convolution for adaptively modulating the convolutional kernel based on the global complementary context. To further model long-range dependencies, an adaptive Transformer is employed to enhance the global semantic extraction capability. Our network architecture is designed in a multiscale fashion so that useful multimodal information can be adequately acquired from the perspective of different scales. Moreover, an objective function composed of a structural loss and a region mutual information loss is devised to construct constraints for information preservation at both the structural-level and the feature-level. Extensive experiments on a mainstream database demonstrate that the proposed method outperforms other representative and state-of-the-art methods in terms of both visual quality and quantitative evaluation. We also extend the proposed method to address other biomedical image fusion issues, and the pleasing fusion results illustrate that MATR has good generalization capability. The code of the proposed method is available at https://github.com/tthinking/MATR. © 1992-2012 IEEE.
引用
收藏
页码:5134 / 5149
相关论文
共 50 条
  • [41] A Brief Analysis of Multimodal Medical Image Fusion Techniques
    Saleh, Mohammed Ali
    Ali, AbdElmgeid A. A.
    Ahmed, Kareem
    Sarhan, Abeer M. M.
    ELECTRONICS, 2023, 12 (01)
  • [42] MULTIMODAL MEDICAL IMAGE FUSION USING HYBRID DOMAINS
    Naidu, A. Rajesh
    Bhavana, D.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2022, 23 (04): : 225 - 232
  • [43] A Systematic Literature Review on Multimodal Medical Image Fusion
    Shatabdi Basu
    Sunita Singhal
    Dilbag Singh
    Multimedia Tools and Applications, 2024, 83 : 15845 - 15913
  • [44] Multimodal Medical Image Registration and Fusion for Quality Enhancement
    Azam, Muhammad Adeel
    Khan, Khan Bahadar
    Ahmad, Muhammad
    Mazzara, Manuel
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 821 - 840
  • [45] DEPSO With DTCWT Algorithm for Multimodal Medical Image Fusion
    Talbi, Hassiba
    Kholladi, Mohamed-Khireddine
    INTERNATIONAL JOURNAL OF APPLIED METAHEURISTIC COMPUTING, 2021, 12 (04) : 78 - 97
  • [46] Detail-enhanced multimodal medical image fusion
    Yang, Guocheng
    Chen, Leiting
    Qiu, Hang
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, : 1611 - 1615
  • [47] Multimodal Medical Supervised Image Fusion Method by CNN
    Li, Yi
    Zhao, Junli
    Lv, Zhihan
    Pan, Zhenkuan
    FRONTIERS IN NEUROSCIENCE, 2021, 15
  • [48] Multimodal medical image fusion by cloud model theory
    Weisheng Li
    Jia Zhao
    Bin Xiao
    Signal, Image and Video Processing, 2018, 12 : 437 - 444
  • [49] A Systematic Literature Review on Multimodal Medical Image Fusion
    Basu, Shatabdi
    Singhal, Sunita
    Singh, Dilbag
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 15845 - 15913
  • [50] Multimodal medical image fusion based on IHS and PCA
    He, Changtao
    Liu, Quanxi
    Li, Hongliang
    Wang, Haixu
    2010 SYMPOSIUM ON SECURITY DETECTION AND INFORMATION PROCESSING, 2010, 7 : 280 - 285