Hierarchical Decoder with Parallel Transformer and CNN for Medical Image Segmentation

被引:0
|
作者
Li, Shijie [1 ]
Gong, Yu [1 ]
Xiang, Qingyuan [1 ]
Li, Zheng [1 ,2 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Sichuan Univ, Tianfu Engn Oriented Numercial Simulat & Software, Chengdu 610207, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Hierarchical decoder; Attention mechanism; PLUS PLUS;
D O I
10.1007/978-981-97-8496-7_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the success of Transformers, hybrid Transformer and CNN methods gain considerable popularity in medical image segmentation. These methods utilize a hybrid architecture that combines Transformers and CNNs to fuse global and local information, supplemented by a pyramid structure to facilitate multi-scale interaction. However, they encounter two primary limitations: (i) Transformer struggle to capture complete global information due to the sliding window nature of the convolutional operator, and (ii) the pyramid structure within single decoder fails to provide sufficient multi-scale interaction necessary for restoring detailed features at higher levels. In this paper, we introduce the Hierarchical Decoder with Parallel Transformer and CNN (HiPar), a novel architecture designed to address these limitations. Firstly, we present a parallel structure of Transformer and CNN to maximize the capture of both global and local features. Subsequently, we propose a hierarchical decoder to model multi-scale information and progressively restore spatial details. Additionally, we incorporate lightweight components to enhance the efficiency of feature representation. Extensive experiments demonstrate that our HiPar achieves state-of-the-art results on three popular medical image segmentation benchmarks: Synapse, ACDC and GlaS.
引用
收藏
页码:133 / 147
页数:15
相关论文
共 50 条
  • [41] A NEW CNN OSCILLATOR MODEL FOR PARALLEL IMAGE SEGMENTATION
    Strzelecki, Michal
    Kowalski, Jacek
    Kim, Hyongsuk
    Ko, Soohong
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2008, 18 (07): : 1999 - 2015
  • [42] HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation
    Sun, Qixuan
    Fang, Nianhua
    Liu, Zhuo
    Zhao, Liang
    Wen, Youpeng
    Lin, Hongxiang
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [43] CNN and Transformer Fusion for Remote Sensing Image Semantic Segmentation
    Chen, Xin
    Li, Dongfen
    Liu, Mingzhe
    Jia, Jiaru
    REMOTE SENSING, 2023, 15 (18)
  • [44] Segmentation Method of Magnetoelectric Brain Image Based on the Transformer and the CNN
    Liu, Xiaoli
    Cheng, Xiaorong
    INFORMATION, 2022, 13 (10)
  • [45] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [46] Medical Image Segmentation Using Transformer Networks
    Karimi, Davood
    Dou, Haoran
    Gholipour, Ali
    IEEE ACCESS, 2022, 10 : 29322 - 29332
  • [47] ATFormer: Advanced transformer for medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Oinlan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [48] The Fully Convolutional Transformer for Medical Image Segmentation
    Tragakis, Athanasios
    Kaul, Chaitanya
    Murray-Smith, Roderick
    Husmeier, Dirk
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3649 - 3658
  • [49] Automatic Medical Image Segmentation with Vision Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Wang, Huaijun
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [50] Coformer: Collaborative Transformer for Medical Image Segmentation
    Gao, Yufei
    Zhang, Shichao
    Zhang, Dandan
    Shi, Yucheng
    Zhao, Guohua
    Shi, Lei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 240 - 250