Vision Mamba and xLSTM-UNet for medical image segmentation

被引:0
|
作者
Zhong, Xin [1 ]
Lu, Gehao [1 ]
Li, Hao [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650504, Yunnan, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
关键词
Deep Learning; Medical Image Segmentation; SSM; XLSTM; LSTM; FRAMEWORK;
D O I
10.1038/s41598-025-88967-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning-based medical image segmentation methods are generally divided into convolutional neural networks (CNNs) and Transformer-based models. Traditional CNNs are limited by their receptive field, making it challenging to capture long-range dependencies. While Transformers excel at modeling global information, their high computational complexity restricts their practical application in clinical scenarios. To address these limitations, this study introduces VMAXL-UNet, a novel segmentation network that integrates Structured State Space Models (SSM) and lightweight LSTMs (xLSTM). The network incorporates Visual State Space (VSS) and ViL modules in the encoder to efficiently fuse local boundary details with global semantic context. The VSS module leverages SSM to capture long-range dependencies and extract critical features from distant regions. Meanwhile, the ViL module employs a gating mechanism to enhance the integration of local and global features, thereby improving segmentation accuracy and robustness. Experiments on datasets such as ISIC17, ISIC18, CVC-ClinicDB, and Kvasir demonstrate that VMAXL-UNet significantly outperforms traditional CNNs and Transformer-based models in capturing lesion boundaries and their distant correlations. These results highlight the model's superior performance and provide a promising approach for efficient segmentation in complex medical imaging scenarios.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation
    Wang, Ziheng
    Min, Xiongkuo
    Shi, Fangyu
    Jin, Ruinian
    Nawrin, Saida S.
    Yu, Ichen
    Nagatomi, Ryoichi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 517 - 526
  • [22] Light-UNet: An Efficient Segmentation Network for Medical Image
    Zhang, Yue
    Xu, Chao
    Zhang, Zhifan
    Wang, Jianjun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 302 - 313
  • [23] Semantic Segmentation in Medical Image Based on Hybrid Dlinknet and Unet
    Samudrala, Suresh
    Mohan, C. Krishna
    3rd IEEE 2022 International Conference on Computing, Communication, and Intelligent Systems, ICCCIS 2022, 2022, : 42 - 47
  • [24] CSWin-UNet: Transformer UNet with cross-shaped windows for medical image segmentation
    Liu, Xiao
    Gao, Peng
    Yu, Tao
    Wang, Fei
    Yuan, Ru-Yue
    INFORMATION FUSION, 2025, 113
  • [25] DEA-UNet: a dense-edge-attention UNet architecture for medical image segmentation
    Zeng, Zhenhuan
    Fan, Chaodong
    Xiao, Leyi
    Qu, Xilong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [26] MLFA-UNet: A multi-level feature assembly UNet for medical image segmentation
    Garbaz, Anass
    Oukdacha, Yassine
    Charfi, Said
    El Ansari, Mohamed
    Koutti, Lahcen
    Salihoun, Mouna
    METHODS, 2024, 232 : 52 - 64
  • [27] LIT-Unet: a lightweight and effective model for medical image segmentation
    Wang, Ru
    Kou, Qiqi
    Dou, Lina
    RADIOLOGICAL PHYSICS AND TECHNOLOGY, 2024, 17 (04) : 878 - 887
  • [28] NAS-Unet: Neural Architecture Search for Medical Image Segmentation
    Weng, Yu
    Zhou, Tianbao
    Li, Yujie
    Qiu, Xiaoyu
    IEEE ACCESS, 2019, 7 : 44247 - 44257
  • [29] Automatic Medical Image Segmentation with Vision Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Wang, Huaijun
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [30] EMED-UNet: An Efficient Multi-Encoder-Decoder Based UNet for Medical Image Segmentation
    Shah, Kashish D.
    Patel, Dhaval K.
    Thaker, Minesh P.
    Patel, Harsh A.
    Saikia, Manob Jyoti
    Ranger, Bryan J.
    IEEE ACCESS, 2023, 11 : 95253 - 95266