PEMMA: Parameter-Efficient Multi-Modal Adaptation for Medical Image Segmentation

被引:0
|
作者
Saadi, Nada [1 ]
Saeed, Numan [1 ]
Yaqub, Mohammad [1 ]
Nandakumar, Karthik [1 ]
机构
[1] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
关键词
Multi-modal Adaptation; Low-rank Adaptation; Parameter-Efficiency; Cross-modal Entanglement; 3D Medical Image Segmentation;
D O I
10.1007/978-3-031-72390-2_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imaging modalities such as Computed Tomography (CT) and Positron Emission Tomography (PET) are key in cancer detection, inspiring Deep Neural Networks (DNN) models that merge these scans for tumor segmentation. When both CT and PET scans are available, it is common to combine them as two channels of the input to the segmentation model. However, this method requires both scan types during training and inference, posing a challenge due to the limited availability of PET scans, thereby sometimes limiting the process to CT scans only. Hence, there is a need to develop a flexible DNN architecture that can be trained/updated using only CT scans but can effectively utilize PET scans when they become available. In this work, we propose a parameter-efficient multi-modal adaptation (PEMMA) framework for lightweight upgrading of a transformer-based segmentation model trained only on CT scans to also incorporate PET scans. The benefits of the proposed approach are two-fold. Firstly, we leverage the inherent modularity of the transformer architecture and perform low-rank adaptation (LoRA) of the attention weights to achieve parameter-efficient adaptation. Secondly, since the PEMMA framework attempts to minimize cross-modal entanglement, it is possible to subsequently update the combined model using only one modality, without causing catastrophic forgetting of the other modality. Our proposed method achieves comparable results with the performance of early fusion techniques with just 8% of the trainable parameters, especially with a remarkable +28% improvement on the average dice score on PET scans when trained on a single modality.
引用
收藏
页码:262 / 271
页数:10
相关论文
共 50 条
  • [21] Quaternion Cross-Modality Spatial Learning for Multi-Modal Medical Image Segmentation
    Chen, Junyang
    Huang, Guoheng
    Yuan, Xiaochen
    Zhong, Guo
    Zheng, Zewen
    Pun, Chi-Man
    Zhu, Jian
    Huang, Zhixin
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (03) : 1412 - 1423
  • [22] Dual-Attention Deep Fusion Network for Multi-modal Medical Image Segmentation
    Zheng, Shenhai
    Ye, Xin
    Tan, Jiaxin
    Yang, Yifei
    Li, Laquan
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [23] MEDICAL IMAGE SEGMENTATION BASED ON MULTI-MODAL CONVOLUTIONAL NEURAL NETWORK: STUDY ON IMAGE FUSION SCHEMES
    Guo, Zhe
    Li, Xiang
    Huang, Heng
    Guo, Ning
    Li, Quanzheng
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 903 - 907
  • [24] Multi-modal anchor adaptation learning for multi-modal summarization
    Chen, Zhongfeng
    Lu, Zhenyu
    Rong, Huan
    Zhao, Chuanjun
    Xu, Fan
    NEUROCOMPUTING, 2024, 570
  • [25] Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
    Xu, Zunnan
    Chen, Zhihong
    Zhang, Yong
    Song, Yibing
    Wan, Xiang
    Li, Guanbin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17457 - 17466
  • [26] Split Learning of Multi-Modal Medical Image Classification
    Ghosh, Bishwamittra
    Wang, Yuan
    Fu, Huazhu
    Wei, Qingsong
    Liu, Yong
    Goh, Rick Siow Mong
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1326 - 1331
  • [27] A novel multi-modal medical image fusion algorithm
    Li, Xinhua
    Zhao, Jing
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 1995 - 2002
  • [28] A novel multi-modal medical image fusion algorithm
    Xinhua Li
    Jing Zhao
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 1995 - 2002
  • [29] Multi-Modal Medical Image Elastic Registration Algorithm
    Zhao, Zhichao
    Wu, T. F.
    INDIAN JOURNAL OF PHARMACEUTICAL SCIENCES, 2019, 81 (01) : S61 - S61
  • [30] Mutual Query Network for Multi-Modal Product Image Segmentation
    Guo, Yun
    Feng, Wei
    Zhang, Zheng
    Ren, Xiancong
    Li, Yaoyu
    Lv, Jingjing
    Zhu, Xin
    Lin, Zhangang
    Shao, Jingping
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2273 - 2278