D-former: a U-shaped Dilated Transformer for 3D medical image segmentation

被引:0
|
作者
Wu, Yixuan [1 ]
Liao, Kuanlun [2 ]
Chen, Jintai [2 ]
Wang, Jinhong [2 ]
Chen, Danny Z. [3 ]
Gao, Honghao [4 ,5 ]
Wu, Jian [6 ,7 ]
机构
[1] Zhejiang Univ, Sch Med, Hangzhou 310030, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China
[3] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[4] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea
[5] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[6] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Hangzhou 310058, Peoples R China
[7] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 02期
基金
中国国家自然科学基金;
关键词
Medical image analysis; Segmentation; Transformer; Long-range dependency; Position encoding; NETWORKS; ATTENTION;
D O I
10.1007/s00521-022-07859-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer-aided medical image segmentation has been applied widely in diagnosis and treatment to obtain clinically useful information of shapes and volumes of target organs and tissues. In the past several years, convolutional neural network (CNN)-based methods (e.g., U-Net) have dominated this area, but still suffered from inadequate long-range information capturing. Hence, recent work presented computer vision Transformer variants for medical image segmentation tasks and obtained promising performances. Such Transformers modeled long-range dependency by computing pair-wise patch relations. However, they incurred prohibitive computational costs, especially on 3D medical images (e.g., CT and MRI). In this paper, we propose a new method called Dilated Transformer, which conducts self-attention alternately in local and global scopes for pair-wise patch relations capturing. Inspired by dilated convolution kernels, we conduct the global self-attention in a dilated manner, enlarging receptive fields without increasing the patches involved and thus reducing computational costs. Based on this design of Dilated Transformer, we construct a U-shaped encoder-decoder hierarchical architecture called D-Former for 3D medical image segmentation. Experiments on the Synapse and ACDC datasets show that our D-Former model, trained from scratch, outperforms various competitive CNN-based or Transformer-based segmentation models at a low computational cost without time-consuming per-training process.
引用
收藏
页码:1931 / 1944
页数:14
相关论文
共 50 条
  • [21] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861
  • [22] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Shen, Chunhua
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180
  • [23] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
    JunBo Qiao
    Xing Wang
    Ji Chen
    MingTao Liu
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 1895 - 1902
  • [24] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
    Qiao, JunBo
    Wang, Xing
    Chen, Ji
    Liu, MingTao
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (10) : 1895 - 1902
  • [25] U-shaped Densely Connected Convolutional Networks for Automatic 3D Cardiovascular MR Segmentation
    Ran, Chongyang
    Liu, Ping
    Qian, Yinling
    He, Yucheng
    Wang, Qiong
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1010 - 1015
  • [26] A 3D Liver Semantic Segmentation Method Based on U-shaped Feature Fusion Enhancement
    Jiang, Daoran
    Zhang, Xiaolong
    Lin, Xiaoli
    Deng, He
    Ren, Hongwei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14863 : 15 - 27
  • [27] 3D Medical Axial Transformer: A Lightweight Transformer Model for 3D Brain Tumor Segmentation
    Liu, Cheng
    Kiryu, Hisanori
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 799 - 813
  • [28] TU-Former: A Hybrid U-Shaped Transformer Network for SAR Image Denoising
    Tian, Shikang
    Liu, Shuaiqi
    Zhao, Yuhang
    Liu, Siyuan
    Zhao, Shuhuan
    Zhao, Jie
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 377 - 389
  • [29] 3D medical image segmentation technique
    El-said, Shaimaa Ahmed
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2015, 17 (03) : 232 - 251
  • [30] TT-Net: Tensorized Transformer Network for 3D medical image segmentation
    Wang, Jing
    Qu, Aixi
    Wang, Qing
    Zhao, Qibin
    Liu, Ju
    Wu, Qiang
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 107