MultiTrans: Multi-branch transformer network for medical image segmentation

被引:1
|
作者
Zhang, Yanhua [1 ,2 ]
Balestra, Gabriella [1 ]
Zhang, Ke [2 ]
Wang, Jingyu [2 ]
Rosati, Samanta [1 ]
Giannini, Valentina [3 ,4 ]
机构
[1] Politecn Torino, Dept Elect & Telecommun, Corso Duca Abruzzi 24, I-10129 Turin, Italy
[2] Northwestern Polytech Univ, Sch Astronaut, 127 West Youyi Rd, Xian 710072, Peoples R China
[3] Univ Turin, Dept Surg Sci, I-10124 Turin, Italy
[4] FPO IRCCS, Candiolo Canc Inst, Radiol Unit, I-10060 Candiolo, Italy
关键词
Medical image segmentation; Abdominal multi-organ segmentation; Cardiac segmentation; Deep learning; Efficient self-attention; Multi-branch transformers; ATTENTION; NET;
D O I
10.1016/j.cmpb.2024.108280
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: Transformer, which is notable for its ability of global context modeling, has been used to remedy the shortcomings of Convolutional neural networks (CNN) and break its dominance in medical image segmentation. However, the self -attention module is both memory and computational inefficient, so many methods have to build their Transformer branch upon largely downsampled feature maps or adopt the tokenized image patches to fit their model into accessible GPUs. This patch -wise operation restricts the network in extracting pixel -level intrinsic structural or dependencies inside each patch, hurting the performance of pixel -level classification tasks. Methods: To tackle these issues, we propose a memory- and computation -efficient self -attention module to enable reasoning on relatively high -resolution features, promoting the efficiency of learning global information while effective grasping fine spatial details. Furthermore, we design a novel Multi -Branch Transformer (MultiTrans) architecture to provide hierarchical features for handling objects with variable shapes and sizes in medical images. By building four parallel Transformer branches on different levels of CNN, our hybrid network aggregates both multi -scale global contexts and multi -scale local features. Results: MultiTrans achieves the highest segmentation accuracy on three medical image datasets with different modalities: Synapse, ACDC and M&Ms. Compared to the Standard Self -Attention (SSA), the proposed Efficient Self -Attention (ESA) can largely reduce the training memory and computational complexity while even slightly improve the accuracy. Specifically, the training memory cost, FLOPs and Params of our ESA are 18.77%, 20.68% and 74.07% of the SSA. Conclusions: Experiments on three medical image datasets demonstrate the generality and robustness of the designed network. The ablation study shows the efficiency and effectiveness of our proposed ESA. Code is available at: https://github.com/Yanhua-Zhang/MultiTrans-extension.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] MIINet: a multi-branch information interaction network for few-shot segmentation
    Zhang, Zhaopeng
    Xu, Zhijie
    Zhang, Jianqin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (12) : 9081 - 9094
  • [42] A Medical Image Segmentation Network with Multi-Scale and Dual-Branch Attention
    Zhu, Cancan
    Cheng, Ke
    Hua, Xuecheng
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [43] Multi-Branch and Progressive Network for Low-Light Image Enhancement
    Zhang, Kaibing
    Yuan, Cheng
    Li, Jie
    Gao, Xinbo
    Li, Minqi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2295 - 2308
  • [44] A Novel Multi-Branch Channel Expansion Network for Garbage Image Classification
    Shi, Cuiping
    Xia, Ruiyang
    Wang, Liguo
    IEEE ACCESS, 2020, 8 : 154436 - 154452
  • [45] FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation
    Jiang, Zhongchuan
    Wu, Yun
    Huang, Lei
    Gu, Maohua
    JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2024, 32 (04) : 931 - 951
  • [46] Feature ensemble network for medical image segmentation with multi-scale atrous transformer
    Gai, Di
    Geng, Yuhan
    Huang, Xia
    Huang, Zheng
    Xiong, Xin
    Zhou, Ruihua
    Wang, Qi
    IET IMAGE PROCESSING, 2024, 18 (11) : 3082 - 3092
  • [47] Research on road crack segmentation based on deep convolution and transformer with multi-branch feature fusion
    Lai, Yuebo
    Liu, Bing
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (11)
  • [48] CTRANS: A MULTI-RESOLUTION CONVOLUTION-TRANSFORMER NETWORK FOR MEDICAL IMAGE SEGMENTATION
    Gong, Zhendi
    French, Andrew P.
    Qiu, Guoping
    Chen, Xin
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [49] A novel multi-task semi-supervised medical image segmentation method based on multi-branch cross pseudo supervision
    Yueyue Xiao
    Chunxiao Chen
    Xue Fu
    Liang Wang
    Jie Yu
    Yuan Zou
    Applied Intelligence, 2023, 53 : 30343 - 30358
  • [50] A novel multi-task semi-supervised medical image segmentation method based on multi-branch cross pseudo supervision
    Xiao, Yueyue
    Chen, Chunxiao
    Fu, Xue
    Wang, Liang
    Yu, Jie
    Zou, Yuan
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30359 - 30383