DENSELY CONNECTED SWIN-UNET FOR MULTISCALE INFORMATION AGGREGATION IN MEDICAL IMAGE SEGMENTATION

被引:5
|
作者
Wang, Ziyang [1 ]
Su, Meiwen [2 ]
Zheng, Jian-Qing [3 ]
Liu, Yang [4 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
[3] Univ Oxford, Kennedy Inst Rheumatol, Oxford, England
[4] Univ Plymouth, Dept Comp Sci, Plymouth, Devon, England
关键词
Semantic Segmentation; UNet; Vision Transformer;
D O I
10.1109/ICIP49359.2023.10222451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image semantic segmentation is a dense prediction task in computer vision that is dominated by deep learning techniques in recent years. UNet, which is a symmetric encoder-decoder end-to-end Convolutional Neural Network (CNN) with skip connections, has shown promising performance. Aiming to process the multiscale feature information efficiently, we propose a new Densely Connected Swin-UNet (DCS-UNet) with multiscale information aggregation for medical image segmentation. Firstly, inspired by Swin-Transformer to model long-range dependencies via shift-window-based self-attention, this work proposes the use of fully ViT-based network blocks with a shift-window approach, resulting in a purely self-attention-based U-shape segmentation network. The relevant layers including feature sampling and image tokenization are re-designed to align with the ViT fashion. Secondly, a full-scale deep supervision scheme is developed to process the aggregated feature map with various resolutions generated by different levels of decoders. Thirdly, dense skip connections are proposed that allow the semantic feature information to be thoroughly transferred from different levels of encoders to lower level decoders. Our proposed method is validated on a public benchmark MRI Cardiac segmentation data set with comprehensive validation metrics showing competitive performance against other variant encoder-decoder networks. The code is available at https://github.com/ziyangwang007/VIT4UNet.
引用
收藏
页码:940 / 944
页数:5
相关论文
共 50 条
  • [1] A Spinal MRI Image Segmentation Method Based on Improved Swin-UNet
    Cao, Jie
    Fan, Jiacheng
    Chen, Chin-Ling
    Wu, Zhenyu
    Jiang, Qingxuan
    Li, Shikai
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024,
  • [2] A Siamese Swin-Unet for image change detection
    Tang, Yizhuo
    Cao, Zhengtao
    Guo, Ningbo
    Jiang, Mingyong
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [3] A Siamese Swin-Unet for image change detection
    Yizhuo Tang
    Zhengtao Cao
    Ningbo Guo
    Mingyong Jiang
    Scientific Reports, 14
  • [4] TD Swin-UNet: Texture-Driven Swin-UNet with Enhanced Boundary-Wise Perception for Retinal Vessel Segmentation
    Li, Angran
    Sun, Mingzhu
    Wang, Zengshuo
    BIOENGINEERING-BASEL, 2024, 11 (05):
  • [5] DENSE SWIN-UNET: DENSE SWIN TRANSFORMERS FOR SEMANTIC SEGMENTATION OF PNEUMOTHORAX IN CT IMAGES
    Tang, Zhixian
    Zhang, Jinyang
    Bai, Chulin
    Zhang, Yan
    Liang, Kaiyi
    Yao, Xufeng
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2023, 23 (08)
  • [6] DenseUNet: densely connected UNet for electron microscopy image segmentation
    Cao, Yue
    Liu, Shigang
    Peng, Yali
    Li, Jun
    IET IMAGE PROCESSING, 2020, 14 (12) : 2682 - 2689
  • [7] DENSE SWIN-UNET: DENSE SWIN TRANSFORMERS FOR SEMANTIC SEGMENTATION OF PNEUMOTHORAX IN CT IMAGES
    Tang, Zhixian
    Zhang, Jinyang
    Bai, Chulin
    Zhang, Yan
    Liang, Kaiyi
    Yao, Xufeng
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2023,
  • [8] Csswin-unet: a Swin-unet network for semantic segmentation of remote sensing images by aggregating contextual information and extracting spatial information
    Xiao, Dong
    Kang, Zhihao
    Fu, Yanhua
    Li, Zhenni
    Ran, Mengying
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (23) : 7598 - 7625
  • [9] A Fabric Defect Segmentation Model Based on Improved Swin-Unet with Gabor Filter
    Xu, Haitao
    Liu, Chengming
    Duan, Shuya
    Ren, Liangpin
    Cheng, Guozhen
    Hao, Bing
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [10] Swin-HAUnet: A Swin-Hierarchical Attention Unet For Enhanced Medical Image Segmentation
    Chen, Jiarong
    Zhang, Xuyang
    Li, Rongwen
    Zhou, Peng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV, 2025, 15044 : 371 - 385