Multi-Modal Multi-Stream UNET Model for Liver Segmentation

被引:0
|
作者
Elghazy, Hagar Louye [1 ]
Fakhr, Mohamed Waleed [2 ]
机构
[1] Arab Acad Sci Technol & Maritime Transport, Coll Comp & Informat Technol, Comp Sci, Cairo, Egypt
[2] Arab Acad Sci Technol & Maritime Transport, Coll Engn & Technol, Comp Engn, Cairo, Egypt
关键词
medical images; UNET; dual stream; segmentation; NETWORKS;
D O I
10.1109/AIIOT52608.2021.9454216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer segmentation of abdominal organs using CT and MRI images can benefit diagnosis, treatment, and workload management. In recent years, UNETs have been widely used in medical image segmentation for their precise accuracy. Most of the UNETs current solutions rely on the use of single data modality. Recently, it has been shown that learning from more than one modality at a time can significantly enhance the segmentation accuracy, however most of available multi-modal datasets are not large enough for training complex architectures. In this paper, we worked on a small dataset and proposed a multi-modal dual-stream UNET architecture that learns from unpaired MRI and CT image modalities to improve the segmentation accuracy on each individual one. We tested the practicality of the proposed architecture on Task 1 of the CHAOS segmentation challenge. Results showed that multi-modal/multi-stream learning improved accuracy over single modality learning and that using UNET in the dual stream was superior than using a standard FCN. A "Dice" score of 96.78 was achieved on CT images. To the best of our knowledge, this is one of the highest reported scores yet.
引用
收藏
页码:28 / 33
页数:6
相关论文
共 50 条
  • [21] Multi-stream inflation
    Li, Miao
    Wang, Yi
    JOURNAL OF COSMOLOGY AND ASTROPARTICLE PHYSICS, 2009, (07):
  • [22] Multi-task learning とmulti-stream monocular depth estimation using integrated model with multi-task learning and multi-stream
    Takamine, Michiru
    Endo, Satoshi
    Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (05): : 1 - 9
  • [23] Towards a multi-modal perceptual model
    Hollier, MP
    Voelcker, R
    BT TECHNOLOGY JOURNAL, 1997, 15 (04): : 162 - 171
  • [24] Multi-modal Background Model Initialization
    Bloisi, Domenico D.
    Grillo, Alfonso
    Pennisi, Andrea
    Iocchi, Luca
    Passaretti, Claudio
    NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2015 WORKSHOPS, 2015, 9281 : 485 - 492
  • [25] A multi-modal approach to story segmentation for news video
    Chaisorn, L
    Chua, TS
    Lee, CH
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2003, 6 (02): : 187 - 208
  • [26] Unpaired multi-modal tumor segmentation with structure adaptation
    Zhou, Pei
    Chen, Houjin
    Li, Yanfeng
    Peng, Yahui
    APPLIED INTELLIGENCE, 2023, 53 (04) : 3639 - 3651
  • [27] Flexible Multi-modal Graph-Based Segmentation
    Sanberg, Willem P.
    Do, Luat
    de With, Peter H. N.
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2013, 2013, 8192 : 492 - 503
  • [28] Multi-Modal Glioblastoma Segmentation: Man versus Machine
    Porz, Nicole
    Bauer, Stefan
    Pica, Alessia
    Schucht, Philippe
    Beck, Juergen
    Verma, Rajeev Kumar
    Slotboom, Johannes
    Reyes, Mauricio
    Wiest, Roland
    PLOS ONE, 2014, 9 (05):
  • [29] A framework for unsupervised segmentation of multi-modal medical images
    El-Baz, Ayman
    Farag, Aly
    Ali, Asem
    Gimel'farb, Georgy
    Casanova, Manuel
    COMPUTER VISION APPROACHES TO MEDICAL IMAGE ANALYSIS, 2006, 4241 : 120 - 131
  • [30] A Multi-Modal Approach to Story Segmentation for News Video
    Lekha Chaisorn
    Tat-Seng Chua
    Chin-Hui Lee
    World Wide Web, 2003, 6 : 187 - 208