D-former: a U-shaped Dilated Transformer for 3D medical image segmentation

被引:0
|
作者
Wu, Yixuan [1 ]
Liao, Kuanlun [2 ]
Chen, Jintai [2 ]
Wang, Jinhong [2 ]
Chen, Danny Z. [3 ]
Gao, Honghao [4 ,5 ]
Wu, Jian [6 ,7 ]
机构
[1] Zhejiang Univ, Sch Med, Hangzhou 310030, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China
[3] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[4] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea
[5] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[6] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Hangzhou 310058, Peoples R China
[7] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 02期
基金
中国国家自然科学基金;
关键词
Medical image analysis; Segmentation; Transformer; Long-range dependency; Position encoding; NETWORKS; ATTENTION;
D O I
10.1007/s00521-022-07859-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer-aided medical image segmentation has been applied widely in diagnosis and treatment to obtain clinically useful information of shapes and volumes of target organs and tissues. In the past several years, convolutional neural network (CNN)-based methods (e.g., U-Net) have dominated this area, but still suffered from inadequate long-range information capturing. Hence, recent work presented computer vision Transformer variants for medical image segmentation tasks and obtained promising performances. Such Transformers modeled long-range dependency by computing pair-wise patch relations. However, they incurred prohibitive computational costs, especially on 3D medical images (e.g., CT and MRI). In this paper, we propose a new method called Dilated Transformer, which conducts self-attention alternately in local and global scopes for pair-wise patch relations capturing. Inspired by dilated convolution kernels, we conduct the global self-attention in a dilated manner, enlarging receptive fields without increasing the patches involved and thus reducing computational costs. Based on this design of Dilated Transformer, we construct a U-shaped encoder-decoder hierarchical architecture called D-Former for 3D medical image segmentation. Experiments on the Synapse and ACDC datasets show that our D-Former model, trained from scratch, outperforms various competitive CNN-based or Transformer-based segmentation models at a low computational cost without time-consuming per-training process.
引用
收藏
页码:1931 / 1944
页数:14
相关论文
共 50 条
  • [41] LMU-Net: lightweight U-shaped network for medical image segmentation
    Ting Ma
    Ke Wang
    Feng Hu
    Medical & Biological Engineering & Computing, 2024, 62 : 61 - 70
  • [42] Two-dimensional medical image segmentation based on U-shaped structure
    Cai, Sijing
    Xiao, Yuwei
    Wang, Yanyu
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (01)
  • [43] FTUNet: A Feature-Enhanced Network for Medical Image Segmentation Based on the Combination of U-Shaped Network and Vision Transformer
    Wang, Yuefei
    Yu, Xi
    Yang, Yixi
    Zeng, Shijie
    Xu, Yuquan
    Feng, Ronghui
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [44] STM-UNet: An Efficient U-shaped Architecture Based on Swin Transformer and Multiscale MLP for Medical Image Segmentation
    Shi, Lei
    Gao, Tianyu
    Zhang, Zheng
    Zhang, Junxing
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2003 - 2008
  • [45] FTUNet: A Feature-Enhanced Network for Medical Image Segmentation Based on the Combination of U-Shaped Network and Vision Transformer
    Yuefei Wang
    Xi Yu
    Yixi Yang
    Shijie Zeng
    Yuquan Xu
    Ronghui Feng
    Neural Processing Letters, 56
  • [46] Uformer: A General U-Shaped Transformer for Image Restoration
    Wang, Zhendong
    Cun, Xiaodong
    Bao, Jianmin
    Zhou, Wengang
    Liu, Jianzhuang
    Li, Houqiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17662 - 17672
  • [47] MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation
    Xi, Heran
    Dong, Haoji
    Sheng, Yue
    Cui, Hui
    Huang, Chengying
    Li, Jinbao
    Zhu, Jinghua
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01):
  • [48] A hybrid framework for 3D medical image segmentation
    Chen, T
    Metaxas, D
    MEDICAL IMAGE ANALYSIS, 2005, 9 (06) : 547 - 565
  • [49] UNETR: Transformers for 3D Medical Image Segmentation
    Hatamizadeh, Ali
    Tang, Yucheng
    Nath, Vishwesh
    Yang, Dong
    Myronenko, Andriy
    Landman, Bennett
    Roth, Holger R.
    Xu, Daguang
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1748 - 1758
  • [50] Comparison of 3D and 2D method to study the propagation in a U-shaped valley
    Hamel, Pierrick
    Adam, Jean-Pierre
    Beniguel, Yannick
    Joly, Jean-Christophe
    2015 9th European Conference on Antennas and Propagation (EuCAP), 2015,