MulA-nnUNet: A Multi-Attention Enhanced nnUNet Framework for 3D Abdominal Multi-Organs Segmentation

被引:0
|
作者
Ding, Jiashuo [1 ]
Ni, Wei [1 ]
Wan, Jiahui [2 ]
Deng, Xiaojun [1 ]
Wan, Lanjun [1 ]
机构
[1] Hunan Univ Technol, Sch Comp Sci, Zhuzhou 412007, Peoples R China
[2] Hunan Agr Univ, Coll Mech & Elect Engn, Changsha 410125, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Image segmentation; Three-dimensional displays; Semantics; Decoding; Accuracy; Attention mechanisms; Tumors; Abdominal multi-organ image segmentation; attention mechanism; deep learning; nnUNet;
D O I
10.1109/ACCESS.2024.3437652
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the domain of medical image segmentation, the nnUNet framework is highly respected for its excellent performance and wide range of applications. However, the inherent bias of locality and weight sharing introduced by the continuous convolutional operations currently used limits the network's performance in modeling long-term dependencies. Furthermore, in the process of implementing residual links, certain limitations are encountered due to the substantial semantic discrepancy between the encoder's output feature maps and the decoder's. These limitations are seen in the direct application of skip connections for feature fusion and gradient propagation, which are known to impact the model's convergence speed and overall performance. In this paper, a novel framework is presented, namely Multi-Attention nnUNet (MulA-nnUNet), which utilizes nnUNet as the foundational network structure and integrates two key attention mechanisms: large kernel convolutional attention (LKA) and pixel attention (PA). LKA is embedded within the deep encoder, maintaining the effectiveness of shallow feature extraction and enhancing the deep neural networks' ability to understand long-range spatial dependencies. At the same time, the semantic distinction between the encoder and decoder's output map of features is decreased by the PA module, which helps to improve the effect of skip connection feature fusion. The complexity of the model is reduced by replacing the standard convolutions in the encoder and decoder layers with depthwise separable convolutions (DS), which have fewer parameters. The effectiveness of the proposed framework is confirmed by a set of ablation experiments and comparison experiments with current state-of-the-art models on the computed tomography (CT) subset of the multimodal abdominal multi-organ segmentation dataset (AMOS), which includes 500 CT scans, with 350 scans for training, 75 for validation, and 75 for testing. MulA-nnUNet shows improvements of 1.1% in mean dice similarity coefficient (mDSC) and 1.52% in mean intersection over union (mIoU), while the baseline model requires 5 times the floating point operations (FLOPs) and over 7 times the parameters (Params). Additionally, it demonstrates superior accuracy in segmenting organs such as the liver, stomach, aorta, and pancreas, thereby enhancing the accuracy of 3D abdominal multi-organ image segmentation.
引用
收藏
页码:106658 / 106671
页数:14
相关论文
共 50 条
  • [21] A web based 3D model construction tool for abdominal organs segmentation
    Zhang, Xuejun
    Li, Bijiang
    Wu, Dongbo
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 124 : 26 - 26
  • [22] REMOVAL OF ABDOMINAL WALL FOR 3D VISUALIZATION AND SEGMENTATION OF ORGANS IN CT VOLUME
    Ding, Feng
    Leow, Wee Kheng
    Venkatesh, Sudhakar
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3373 - +
  • [23] A Multi-stage Framework for 3D Individual Tooth Segmentation in Dental CBCT
    Wang, Chunshi
    Zhao, Bin
    Ding, Shuxue
    SEMI-SUPERVISED TOOTH SEGMENTATION, SEMITOOTHSEG 2023, 2025, 14623 : 36 - 45
  • [24] Hierarchical Multi-Organ Segmentation Without Registration in 3D Abdominal CT Images
    Zografos, Vasileios
    Valentinitsch, Alexander
    Rempfler, Markus
    Tombari, Federico
    Menze, Bjoern
    Medical Computer Vision: Algorithms for Big Data, 2016, 9601 : 37 - 46
  • [25] TDPC-Net: Multi-scale lightweight and efficient 3D segmentation network with a 3D attention mechanism for brain tumor segmentation
    Li, Yixuan
    Kang, Jie
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [26] Efficient multiscale spatial attention 3D abdominal multiorgan segmentation model
    Yan, Chenxi
    Hou, Huimin
    Shen, Tongtong
    Xu, Huafei
    Zhai, Chen
    Zheng, Wen
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (03)
  • [27] Segmentation of multiple organs in non-contrast 3D abdominal CT images
    Akinobu Shimizu
    Rena Ohno
    Takaya Ikegami
    Hidefumi Kobatake
    Shigeru Nawano
    Daniel Smutek
    International Journal of Computer Assisted Radiology and Surgery, 2007, 2 : 135 - 142
  • [28] Segmentation of multiple organs in non-contrast 3D abdominal CT images
    Shimizu, Akinobu
    Ohno, Rena
    Ikegami, Takaya
    Kobatake, Hidefumi
    Nawano, Shigeru
    Smutek, Daniel
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2007, 2 (3-4) : 135 - 142
  • [29] MSA-Net: Multi-scale feature fusion network with enhanced attention module for 3D medical image segmentation
    Wang, Shuo
    Wang, Yuanhong
    Peng, Yanjun
    Chen, Xue
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [30] Multi-modality self-attention aware deep network for 3D biomedical segmentation
    Jia, Xibin
    Liu, Yunfeng
    Yang, Zhenghan
    Yang, Dawei
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)