Medical Image Segmentation Based on Multi-Scale Convolution Modulation

被引:0
|
作者
Zhou, Xin-Min [1 ,2 ]
Xiong, Zhi-Mou [3 ]
Shi, Chang-Fa [4 ,5 ]
Yang, Jian [3 ]
机构
[1] School of Artificial Intelligence and Advanced Computing, Hunan University of Technology and Business, Hunan, Changsha,410205, China
[2] Xiangjiang Laboratory, Hunan, Changsha,410205, China
[3] School of Computer Science, Hunan University of Technology and Business, Hunan, Changsha,410205, China
[4] School of Intelligent Engineering and Intelligent Manufacturing, Hunan University of Technology and Business, Hunan, Changsha,410205, China
[5] Changsha Social Laboratory of Artificial Intelligence, Hunan University of Technology and Business, Hunan, Changsha,410205, China
来源
基金
中国国家自然科学基金;
关键词
Basic structure - Convolutional modulation - Convolutional neural network - Features extraction - Image segmentation model - Input sequence - Medical image segmentation - Multi-scales - Transformer - Transformer modeling;
D O I
10.12263/DZXB.20231068
中图分类号
学科分类号
摘要
Currently, more and more medical image segmentation models are using Transformer as their basic structure. However, the computational complexity of the Transformer model is quadratic with respect to the input sequence, and it requires a large amount of data for pre-training in order to achieve good results. In situations where there is insufficient data, the Transformer's advantages cannot be fully realized. Additionally, the Transformer often fails to effectively extract local information from images. In contrast, convolutional neural networks can effectively avoid these two problems. In order to fully leverage the strengths of both convolutional neural networks and Transformers and further explore the potential of convolutional neural networks, this paper proposes a multi-scale convolution modulation network (MSCMNet) model. This model incorporates the design methodology of visual Transformer models into traditional convolutional networks. By using convolution modulation and multi-scale feature extraction strategies, a feature extraction module based on multi-scale convolution modulation (MSCM) is constructed. Efficient patch combination and patch decomposition strategies are also proposed for downsampling and upsampling of feature maps, respectively, further enhancing the model's representation ability. The mDice scores obtained on four different types and sizes of medical image segmentation datasets - multiple organs in the abdomen, heart, skin cancer, and nucleus - are 0.805 7, 0.923 3, 0.923 9 and 0.854 8, respectively. With lower computational complexity and parameter count, MSCMNet achieves the best segmentation performance, providing a novel and efficient model structure design paradigm for convolutional neural networks and Transformers in the field of medical image segmentation. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:3159 / 3171
相关论文
共 50 条
  • [1] AMSUnet: A neural network using atrous multi-scale convolution for medical image segmentation
    Yin, Yunchou
    Han, Zhimeng
    Jian, Muwei
    Wang, Gai-Ge
    Chen, Liyan
    Wang, Rui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 162
  • [2] Research on Medical Image Segmentation based on Multi-scale CLT
    Zhang Cai-qing
    Liu Hui
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 192 - +
  • [3] MSLUnet: A Medical Image Segmentation Network Incorporating Multi-Scale Semantics and Large Kernel Convolution
    Zhu, Shijuan
    Cheng, Lingfei
    APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [4] Automatic segmentation method using FCN with multi-scale dilated convolution for medical ultrasound image
    Qian, Ledan
    Huang, Huiling
    Xia, Xiaonyu
    Li, Yi
    Zhou, Xiao
    VISUAL COMPUTER, 2023, 39 (11): : 5953 - 5969
  • [5] Automatic segmentation method using FCN with multi-scale dilated convolution for medical ultrasound image
    Ledan Qian
    Huiling Huang
    Xiaonyu Xia
    Yi Li
    Xiao Zhou
    The Visual Computer, 2023, 39 : 5953 - 5969
  • [6] Multi-scale Medical Image Segmentation Based on Salient Region Detection
    Wu, Yingxue
    Zhao, Xi
    Xie, Guiyang
    Liang, Yangkexin
    Wang, Wei
    Li, Yue
    BIOMETRIC RECOGNITION, CCBR 2015, 2015, 9428 : 624 - 632
  • [7] Multi-Scale Deep Neural Network Based on Dilated Convolution for Spacecraft Image Segmentation
    Liu, Yuan
    Zhu, Ming
    Wang, Jing
    Guo, Xiangji
    Yang, Yifan
    Wang, Jiarong
    SENSORS, 2022, 22 (11)
  • [8] CMLCNet: medical image segmentation network based on convolution capsule encoder and multi-scale local co-occurrence
    Qin, Chendong
    Wang, Yongxiong
    Zhang, Jiapeng
    MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [9] DMFC-UFormer: Depthwise multi-scale factorized convolution transformer-based UNet for medical image segmentation
    Garbaz, Anass
    Oukdach, Yassine
    Charfi, Said
    El Ansari, Mohamed
    Koutti, Lahcen
    Salihoun, Mouna
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [10] Multi-scale image segmentation based on morphology
    Wang, XP
    Hao, CY
    Fan, YY
    Xi, YL
    CHINESE JOURNAL OF ELECTRONICS, 2005, 14 (01): : 119 - 121