Prostate lesion segmentation based on a 3D end-to-end convolution neural network with deep multi-scale attention

被引:11
|
作者
Song, Enmin [1 ]
Long, Jiaosong [1 ]
Ma, Guangzhi [1 ]
Liu, Hong [1 ]
Hung, Chih-Cheng [2 ]
Jin, Renchao [1 ]
Wang, Peijun [3 ]
Wang, Wei [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Kennesaw State Univ, Coll Comp & Software Engn, Atlanta, GA USA
[3] Tongji Univ, Tongji Hosp, Sch Medcine, Dept Radiol, Shanghai 200065, Peoples R China
基金
中国国家自然科学基金;
关键词
Mp-MRI; Prostate cancer segmentation; Convolution neural network; Attention; COMPUTER-AIDED DIAGNOSIS; SUPPORT VECTOR MACHINES; GLEASON SCORE; MR-IMAGES; CANCER;
D O I
10.1016/j.mri.2023.01.015
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Prostate cancer is one of the deadest cancers among human beings. To better diagnose the prostate cancer, prostate lesion segmentation becomes a very important work, but its progress is very slow due to the prostate lesions small in size, irregular in shape, and blurred in contour. Therefore, automatic prostate lesion segmentation from mp-MRI is a great significant work and a challenging task. However, the most existing multi-step segmentation methods based on voxel-level classification are time-consuming, may introduce errors in different steps and lead to error accumulation. To decrease the computation time, harness richer 3D spatial features, and fuse the multi-level contextual information of mp-MRI, we present an automatic segmentation method in which all steps are optimized conjointly as one step to form our end-to-end convolutional neural network. The proposed end-to-end network DMSA-V-Net consists of two parts: (1) a 3D V-Net is used as the backbone network, it is the first attempt in employing 3D convolutional neural network for CS prostate lesion segmentation, (2) a deep multi-scale attention mechanism is introduced into the 3D V-Net which can highly focus on the ROI while suppressing the redundant background. As a merit, the attention can adaptively re-align the context information between the feature maps at different scales and the saliency maps in high-levels. We performed experiments based on five cross-fold validation with data including 97 patients. The results show that the Dice and sensitivity are 0.7014 and 0.8652 respectively, which demonstrates that our segmentation approach is more significant and accurate compared to other methods.
引用
收藏
页码:98 / 109
页数:12
相关论文
共 50 条
  • [1] A deep neural network-based end-to-end 3D medical abdominal segmentation and reconstruction model
    Cui, Jin
    Jiang, Yuhan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 513 - 522
  • [2] Efficient Neural Network for Text Recognition in Natural Scenes Based on End-to-End Multi-Scale Attention Mechanism
    Peng, Huiling
    Yu, Jia
    Nie, Yalin
    ELECTRONICS, 2023, 12 (06)
  • [3] An End-to-End Robust Video Steganography Model Based on a Multi-Scale Neural Network
    Xu, Shutong
    Li, Zhaohong
    Zhang, Zhenzhen
    Liu, Junhui
    ELECTRONICS, 2022, 11 (24)
  • [4] Multi-scale attention guided network for end-to-end face alignment and recognition
    Shakeel, M. Saad
    Zhang, Yuxuan
    Wang, Xin
    Kang, Wenxiong
    Mahmood, Arif
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [5] Skin Lesion Segmentation Based on Multi-Scale Attention Convolutional Neural Network
    Jiang, Yun
    Cao, Simin
    Tao, Shengxin
    Zhang, Hai
    IEEE ACCESS, 2020, 8 : 122811 - 122825
  • [6] RMCNet: A Liver Cancer Segmentation Network Based on 3D Multi-Scale Convolution, Attention, and Residual Path
    Zhang, Zerui
    Gao, Jianyun
    Li, Shu
    Wang, Hao
    BIOENGINEERING-BASEL, 2024, 11 (11):
  • [7] MSMP-Net: A Multi-Scale Neural Network for End-to-End Monkeypox Virus Skin Lesion Classification
    Huan, Eryang
    Dun, Hui
    APPLIED SCIENCES-BASEL, 2024, 14 (20):
  • [8] MMMNet: An End-to-End Multi-Task Deep Convolution Neural Network With Multi-Scale and Multi-Hierarchy Fusion for Blind Image Quality Assessment
    Li, Fan
    Zhang, Yangfan
    Cosman, Pamela C.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4798 - 4811
  • [9] MSARN: A Multi-scale Attention Residual Network for End-to-End Environmental Sound Classification
    Fucai Hu
    Peng Song
    Ruhan He
    Zhaoli Yan
    Yongsheng Yu
    Neural Processing Letters, 2023, 55 : 11449 - 11465
  • [10] MSARN: A Multi-scale Attention Residual Network for End-to-End Environmental Sound Classification
    Hu, Fucai
    Song, Peng
    He, Ruhan
    Yan, Zhaoli
    Yu, Yongsheng
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 11449 - 11465