MulA-nnUNet: A Multi-Attention Enhanced nnUNet Framework for 3D Abdominal Multi-Organs Segmentation

被引:0
|
作者
Ding, Jiashuo [1 ]
Ni, Wei [1 ]
Wan, Jiahui [2 ]
Deng, Xiaojun [1 ]
Wan, Lanjun [1 ]
机构
[1] Hunan Univ Technol, Sch Comp Sci, Zhuzhou 412007, Peoples R China
[2] Hunan Agr Univ, Coll Mech & Elect Engn, Changsha 410125, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Image segmentation; Three-dimensional displays; Semantics; Decoding; Accuracy; Attention mechanisms; Tumors; Abdominal multi-organ image segmentation; attention mechanism; deep learning; nnUNet;
D O I
10.1109/ACCESS.2024.3437652
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the domain of medical image segmentation, the nnUNet framework is highly respected for its excellent performance and wide range of applications. However, the inherent bias of locality and weight sharing introduced by the continuous convolutional operations currently used limits the network's performance in modeling long-term dependencies. Furthermore, in the process of implementing residual links, certain limitations are encountered due to the substantial semantic discrepancy between the encoder's output feature maps and the decoder's. These limitations are seen in the direct application of skip connections for feature fusion and gradient propagation, which are known to impact the model's convergence speed and overall performance. In this paper, a novel framework is presented, namely Multi-Attention nnUNet (MulA-nnUNet), which utilizes nnUNet as the foundational network structure and integrates two key attention mechanisms: large kernel convolutional attention (LKA) and pixel attention (PA). LKA is embedded within the deep encoder, maintaining the effectiveness of shallow feature extraction and enhancing the deep neural networks' ability to understand long-range spatial dependencies. At the same time, the semantic distinction between the encoder and decoder's output map of features is decreased by the PA module, which helps to improve the effect of skip connection feature fusion. The complexity of the model is reduced by replacing the standard convolutions in the encoder and decoder layers with depthwise separable convolutions (DS), which have fewer parameters. The effectiveness of the proposed framework is confirmed by a set of ablation experiments and comparison experiments with current state-of-the-art models on the computed tomography (CT) subset of the multimodal abdominal multi-organ segmentation dataset (AMOS), which includes 500 CT scans, with 350 scans for training, 75 for validation, and 75 for testing. MulA-nnUNet shows improvements of 1.1% in mean dice similarity coefficient (mDSC) and 1.52% in mean intersection over union (mIoU), while the baseline model requires 5 times the floating point operations (FLOPs) and over 7 times the parameters (Params). Additionally, it demonstrates superior accuracy in segmenting organs such as the liver, stomach, aorta, and pancreas, thereby enhancing the accuracy of 3D abdominal multi-organ image segmentation.
引用
收藏
页码:106658 / 106671
页数:14
相关论文
共 50 条
  • [31] GSNet: a multi-class 3D attention-based hybrid glioma segmentation network
    Jawad, Md Tasnim
    Yeafi, Ashfak
    Halder, Kalyan Kumar
    OPTICS EXPRESS, 2023, 31 (24) : 40881 - 40906
  • [32] Multi-modality self-attention aware deep network for 3D biomedical segmentation
    Xibin Jia
    Yunfeng Liu
    Zhenghan Yang
    Dawei Yang
    BMC Medical Informatics and Decision Making, 20
  • [33] Multi-Head Attention Refiner for Multi-View 3D Reconstruction
    Lee, Kyunghee
    Cho, Ihjoon
    Yang, Boseung
    Park, Unsang
    JOURNAL OF IMAGING, 2024, 10 (11)
  • [34] Atlas-Based Segmentation of Abdominal Organs in 3D Ultrasound, and its Application in Automated Kidney Segmentation
    Marsousi, Mahdi
    Plataniotis, Konstantinos N.
    Stergiopoulos, Stergios
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 2001 - 2005
  • [35] Multi-agent segmentation for 3D medical images
    Moussa, R.
    Beurton-Aimar, M.
    Desbarats, P.
    2009 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS IN BIOMEDICINE, 2009, : 617 - 621
  • [36] 3D rendering of Multi-Layer Segmentation (MLS)
    Jawad, Shahad
    Mock, Ryan
    Straub, Jochen
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2020, 61 (09)
  • [37] Remu-Net: Multi-Branch Net Framework for 3D Brain Tumor Segmentation
    Wang, Zu-Min
    Dong, Lei
    Zhang, Min
    Gao, Bing
    Jiang, Zong-Kang
    Duan, Yu-Cong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (12)
  • [38] Semi-Supervised 3D Abdominal Multi-Organ Segmentation via Deep Multi-Planar Co-Training
    Zhou, Yuyin
    Wang, Yan
    Tang, Peng
    Bai, Song
    Shen, Wei
    Fishman, Elliot K.
    Yuille, Alan
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 121 - 140
  • [39] Supervised Transfer Learning for Multi Organs 3D Segmentation With Registration Tools for Metal Artifact Reduction in CT Images
    Abboodi, Hanaa M. Al
    Al-funjan, Amera W.
    Abd Hamza, Najlaa
    Abdullah, Alaa H.
    Shami, Bashar H.
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2023, 12 (03): : 1342 - 1353
  • [40] A Modular Framework for 2D/3D and Multi-modal Segmentation with Joint Super-Resolution
    Langmann, Benjamin
    Hartmann, Klaus
    Loffeld, Otmar
    COMPUTER VISION - ECCV 2012, PT II, 2012, 7584 : 12 - 21