DENSELY CONNECTED SWIN-UNET FOR MULTISCALE INFORMATION AGGREGATION IN MEDICAL IMAGE SEGMENTATION

被引:5
|
作者
Wang, Ziyang [1 ]
Su, Meiwen [2 ]
Zheng, Jian-Qing [3 ]
Liu, Yang [4 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
[3] Univ Oxford, Kennedy Inst Rheumatol, Oxford, England
[4] Univ Plymouth, Dept Comp Sci, Plymouth, Devon, England
关键词
Semantic Segmentation; UNet; Vision Transformer;
D O I
10.1109/ICIP49359.2023.10222451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image semantic segmentation is a dense prediction task in computer vision that is dominated by deep learning techniques in recent years. UNet, which is a symmetric encoder-decoder end-to-end Convolutional Neural Network (CNN) with skip connections, has shown promising performance. Aiming to process the multiscale feature information efficiently, we propose a new Densely Connected Swin-UNet (DCS-UNet) with multiscale information aggregation for medical image segmentation. Firstly, inspired by Swin-Transformer to model long-range dependencies via shift-window-based self-attention, this work proposes the use of fully ViT-based network blocks with a shift-window approach, resulting in a purely self-attention-based U-shape segmentation network. The relevant layers including feature sampling and image tokenization are re-designed to align with the ViT fashion. Secondly, a full-scale deep supervision scheme is developed to process the aggregated feature map with various resolutions generated by different levels of decoders. Thirdly, dense skip connections are proposed that allow the semantic feature information to be thoroughly transferred from different levels of encoders to lower level decoders. Our proposed method is validated on a public benchmark MRI Cardiac segmentation data set with comprehensive validation metrics showing competitive performance against other variant encoder-decoder networks. The code is available at https://github.com/ziyangwang007/VIT4UNet.
引用
收藏
页码:940 / 944
页数:5
相关论文
共 50 条
  • [41] SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation
    Zhu, Wenhui
    Chen, Xiwen
    Qiu, Peijie
    Farazi, Mohammad
    Sotiras, Aristeidis
    Razi, Abolfazl
    Wang, Yalin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 601 - 611
  • [42] A-DenseUNet: Adaptive Densely Connected UNet for Polyp Segmentation in Colonoscopy Images with Atrous Convolution
    Safarov, Sirojbek
    Whangbo, Taeg Keun
    SENSORS, 2021, 21 (04) : 1 - 15
  • [43] A New Subject-Sensitive Hashing Algorithm Based on Multi-PatchDrop and Swin-Unet for the Integrity Authentication of HRRS Image
    Ding, Kaimeng
    Wang, Yingying
    Wang, Chishe
    Ma, Ji
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (09)
  • [44] Swin-TransUper: Swin Transformer-based UperNet for medical image segmentation
    Yin J.
    Chen Y.
    Li C.
    Zheng Z.
    Gu Y.
    Zhou J.
    Multimedia Tools and Applications, 2024, 83 (42) : 89817 - 89836
  • [45] Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution
    Yimin Cai
    Yuqing Long
    Zhenggong Han
    Mingkun Liu
    Yuchen Zheng
    Wei Yang
    Liming Chen
    BMC Medical Informatics and Decision Making, 23
  • [46] Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution
    Cai, Yimin
    Long, Yuqing
    Han, Zhenggong
    Liu, Mingkun
    Zheng, Yuchen
    Yang, Wei
    Chen, Liming
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [47] Swin-UNet plus plus : A Nested Swin Transformer Architecture for Location Identification and Morphology Segmentation of Dimples on 2.25Cr1Mo0.25V Fractured Surface
    Liu, Pan
    Song, Yan
    Chai, Mengyu
    Han, Zelin
    Zhang, Yu
    MATERIALS, 2021, 14 (24)
  • [48] A Multiscale Attentional Unet Model for Automatic Segmentation in Medical Ultrasound Images
    Wang, Rui
    Zhou, Haoyuan
    Fu, Peng
    Shen, Hui
    Bai, Yang
    ULTRASONIC IMAGING, 2023, 45 (04) : 159 - 174
  • [49] DAUNet: A deformable aggregation UNet for multi-organ 3D medical image segmentation
    Liu, Qinghao
    Liu, Min
    Zhu, Yuehao
    Liu, Licheng
    Zhang, Zhe
    Wang, Yaonan
    PATTERN RECOGNITION LETTERS, 2025, 191 : 58 - 65
  • [50] EDH-STNet: An Evaporation Duct Height Spatiotemporal Prediction Model Based on Swin-Unet Integrating Multiple Environmental Information Sources
    Ji, Hanjie
    Guo, Lixin
    Zhang, Jinpeng
    Wei, Yiwen
    Guo, Xiangming
    Zhang, Yusheng
    REMOTE SENSING, 2024, 16 (22)