PEMMA: Parameter-Efficient Multi-Modal Adaptation for Medical Image Segmentation

被引:0
|
作者
Saadi, Nada [1 ]
Saeed, Numan [1 ]
Yaqub, Mohammad [1 ]
Nandakumar, Karthik [1 ]
机构
[1] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
关键词
Multi-modal Adaptation; Low-rank Adaptation; Parameter-Efficiency; Cross-modal Entanglement; 3D Medical Image Segmentation;
D O I
10.1007/978-3-031-72390-2_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imaging modalities such as Computed Tomography (CT) and Positron Emission Tomography (PET) are key in cancer detection, inspiring Deep Neural Networks (DNN) models that merge these scans for tumor segmentation. When both CT and PET scans are available, it is common to combine them as two channels of the input to the segmentation model. However, this method requires both scan types during training and inference, posing a challenge due to the limited availability of PET scans, thereby sometimes limiting the process to CT scans only. Hence, there is a need to develop a flexible DNN architecture that can be trained/updated using only CT scans but can effectively utilize PET scans when they become available. In this work, we propose a parameter-efficient multi-modal adaptation (PEMMA) framework for lightweight upgrading of a transformer-based segmentation model trained only on CT scans to also incorporate PET scans. The benefits of the proposed approach are two-fold. Firstly, we leverage the inherent modularity of the transformer architecture and perform low-rank adaptation (LoRA) of the attention weights to achieve parameter-efficient adaptation. Secondly, since the PEMMA framework attempts to minimize cross-modal entanglement, it is possible to subsequently update the combined model using only one modality, without causing catastrophic forgetting of the other modality. Our proposed method achieves comparable results with the performance of early fusion techniques with just 8% of the trainable parameters, especially with a remarkable +28% improvement on the average dice score on PET scans when trained on a single modality.
引用
收藏
页码:262 / 271
页数:10
相关论文
共 50 条
  • [1] Prompt tuning for parameter-efficient medical image segmentation
    Fischer, Marc
    Bartler, Alexander
    Yang, Bin
    MEDICAL IMAGE ANALYSIS, 2024, 91
  • [2] Multi-modal unsupervised domain adaptation for semantic image segmentation
    Hu, Sijie
    Bonardi, Fabien
    Bouchafa, Samia
    Sidibe, Desire
    PATTERN RECOGNITION, 2023, 137
  • [3] Multi-modal hypergraph contrastive learning for medical image segmentation
    Jing, Weipeng
    Wang, Junze
    Di, Donglin
    Li, Dandan
    Song, Yang
    Fan, Lei
    PATTERN RECOGNITION, 2025, 165
  • [4] Partially Supervised Unpaired Multi-modal Learning for Label-Efficient Medical Image Segmentation
    Zhu, Lei
    Xu, Yanyu
    Fu, Huazhu
    Xu, Xinxing
    Goh, Rick Siow Mong
    Liu, Yong
    MACHINE LEARNING IN MEDICAL IMAGING, PT II, MLMI 2024, 2025, 15242 : 85 - 94
  • [5] Multi-modal semantic image segmentation
    Pemasiri, Akila
    Kien Nguyen
    Sridharan, Sridha
    Fookes, Clinton
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 202
  • [6] Semi-Supervised Unpaired Multi-Modal Learning for Label-Efficient Medical Image Segmentation
    Zhu, Lei
    Yang, Kaiyuan
    Zhang, Meihui
    Chan, Ling Ling
    Ng, Teck Khim
    Ooi, Beng Chin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 394 - 404
  • [7] Multi-modal medical Transformers: A meta-analysis for medical image segmentation in oncology
    Andrade-Miranda, Gustavo
    Jaouen, Vincent
    Tankyevych, Olena
    Le Rest, Catherine Cheze
    Visvikis, Dimitris
    Conze, Pierre-Henri
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 110
  • [8] Semi-supervised multi-modal medical image segmentation with unified translation
    Sun H.
    Wei J.
    Yuan W.
    Li R.
    Computers in Biology and Medicine, 2024, 176
  • [9] Modality-Aware Mutual Learning for Multi-modal Medical Image Segmentation
    Zhang, Yao
    Yang, Jiawei
    Tian, Jiang
    Shi, Zhongchao
    Zhong, Cheng
    Zhang, Yang
    He, Zhiqiang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 589 - 599
  • [10] A novel method of medical image segmentation based on the multi-modal function optimization
    Liu, Z. (lxxc1016@gmail.com), 1600, Binary Information Press, Flat F 8th Floor, Block 3, Tanner Garden, 18 Tanner Road, Hong Kong (11):