DENSELY CONNECTED SWIN-UNET FOR MULTISCALE INFORMATION AGGREGATION IN MEDICAL IMAGE SEGMENTATION

被引:5
|
作者
Wang, Ziyang [1 ]
Su, Meiwen [2 ]
Zheng, Jian-Qing [3 ]
Liu, Yang [4 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
[3] Univ Oxford, Kennedy Inst Rheumatol, Oxford, England
[4] Univ Plymouth, Dept Comp Sci, Plymouth, Devon, England
关键词
Semantic Segmentation; UNet; Vision Transformer;
D O I
10.1109/ICIP49359.2023.10222451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image semantic segmentation is a dense prediction task in computer vision that is dominated by deep learning techniques in recent years. UNet, which is a symmetric encoder-decoder end-to-end Convolutional Neural Network (CNN) with skip connections, has shown promising performance. Aiming to process the multiscale feature information efficiently, we propose a new Densely Connected Swin-UNet (DCS-UNet) with multiscale information aggregation for medical image segmentation. Firstly, inspired by Swin-Transformer to model long-range dependencies via shift-window-based self-attention, this work proposes the use of fully ViT-based network blocks with a shift-window approach, resulting in a purely self-attention-based U-shape segmentation network. The relevant layers including feature sampling and image tokenization are re-designed to align with the ViT fashion. Secondly, a full-scale deep supervision scheme is developed to process the aggregated feature map with various resolutions generated by different levels of decoders. Thirdly, dense skip connections are proposed that allow the semantic feature information to be thoroughly transferred from different levels of encoders to lower level decoders. Our proposed method is validated on a public benchmark MRI Cardiac segmentation data set with comprehensive validation metrics showing competitive performance against other variant encoder-decoder networks. The code is available at https://github.com/ziyangwang007/VIT4UNet.
引用
收藏
页码:940 / 944
页数:5
相关论文
共 50 条
  • [21] ID-UNet: A densely connected UNet architecture for infrared small target segmentation
    Chen, Diankun
    Qin, Feiwei
    Ge, Ruiquan
    Peng, Yong
    Wang, Changmiao
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 110 : 234 - 244
  • [22] A Novel Lightweight Swin-Unet Network for Semantic Segmentation of COVID-19 Lesion in CT Images
    Gao, Zhi-Jun
    He, Yi
    Li, Yi
    IEEE ACCESS, 2023, 11 : 950 - 962
  • [23] Research on Semantic Segmentation Method of Macular Edema in Retinal OCT Images Based on Improved Swin-Unet
    Gao, Zhijun
    Chen, Lun
    ELECTRONICS, 2022, 11 (15)
  • [24] Combining Swin Transformer With UNet for Remote Sensing Image Semantic Segmentation
    Fan, Lili
    Zhou, Yu
    Liu, Hongmei
    Li, Yunjie
    Cao, Dongpu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 11
  • [25] A Densely Connected Network Based on U-Net for Medical Image Segmentation
    Yang, Zhenzhen
    Xu, Pengfei
    Yang, Yongpeng
    Bao, Bing-Kun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)
  • [26] HADCN: a hierarchical ascending densely connected network for enhanced medical image segmentation
    Zhou, Dibin
    Zhao, Mingxuan
    Liu, Wenhao
    Gu, Xirui
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2025,
  • [27] Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Di
    Yao, Rui
    Xue, Yong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [28] MLCA-UNet: medical image segmentation networks with multiscale linear and convolutional attention
    Zhou, Jinzhi
    He, Haoyang
    Ma, Guangcen
    Li, Saifeng
    Zhang, Guopeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (06)
  • [29] SE-SWIN UNET FOR IMAGE SEGMENTATION OF MAJOR MAIZE FOLIAR DISEASES
    Yang, Yujie
    Wang, Congsheng
    Zhao, Qing
    Li, Guoqiang
    Zang, Hecang
    ENGENHARIA AGRICOLA, 2024, 44
  • [30] MSRD-Unet: Multiscale Residual Dilated U-Net for Medical Image Segmentation
    Khalaf, Muna
    Dhannoon, Ban N.
    BAGHDAD SCIENCE JOURNAL, 2022, 19 (06) : 1603 - 1611