CTRANSNET: CONVOLUTIONAL NEURAL NETWORK COMBINED WITH TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION

被引:2
|
作者
Zhang, Zhixin [1 ]
Jiang, Shuhao [1 ]
Pan, Xuhua [1 ]
机构
[1] Tianjin Univ Commerce, Informat Engn Dept, Tianjin 300134, Peoples R China
关键词
Medical image segmentation; deep learning; attention mechanism; ATTENTION; CNN;
D O I
10.31577/cai20232392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Transformer has been widely used for many tasks in NLP before, but there is still much room to explore the application of the Transformer to the image domain. In this paper, we propose a simple and efficient hybrid Transformer framework, CTransNet, which combines self-attention and CNN to improve medi-cal image segmentation performance. Capturing long-range dependencies at differ-ent scales. To this end, this paper proposes an effective self-attention mechanism incorporating relative position information encoding, which can reduce the time complexity of self-attention from O(n2) to O(n), and a new self-attention decoder that can recover fine-grained features in encoder from skip connection. This paper aims to address the current dilemma of Transformer applications: i.e., the need to learn induction bias from large amounts of training data. The hybrid layer in CTransNet allows the Transformer to be initialized as a CNN without pre-training. We have evaluated the performance of CTransNet on several medical segmentation datasets. CTransNet shows superior segmentation performance, robustness, and great promise for generalization to other medical image segmentation tasks.
引用
收藏
页码:392 / 410
页数:19
相关论文
共 50 条
  • [21] STransFuse: Fusing Swin Transformer and Convolutional Neural Network for Remote Sensing Image Semantic Segmentation
    Gao, Liang
    Liu, Hui
    Yang, Minhang
    Chen, Long
    Wan, Yaling
    Xiao, Zhengqing
    Qian, Yurong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 10990 - 11003
  • [22] DECTNet: Dual Encoder Network combined convolution and Transformer architecture for medical image segmentation
    Li, Boliang
    Xu, Yaming
    Wang, Yan
    Zhang, Bo
    PLOS ONE, 2024, 19 (04):
  • [23] CT image segmentation of bone for medical additive manufacturing using a convolutional neural network
    Minnema, Jordi
    van Eijnatten, Maureen
    Kouw, Wouter
    Diblen, Faruk
    Mendrik, Adrienne
    Wolff, Jan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 103 : 130 - 139
  • [24] A combined deformable model and medical transformer algorithm for medical image segmentation
    Tang, Zhixian
    Duan, Jintao
    Sun, Yanming
    Zeng, Yanan
    Zhang, Yile
    Yao, Xufeng
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (01) : 129 - 137
  • [25] AdaResU-Net: Multiobjective adaptive convolutional neural network for medical image segmentation
    Baldeon-Calisto, Maria
    Lai-Yuen, Susana K.
    NEUROCOMPUTING, 2020, 392 : 325 - 340
  • [26] A combined deformable model and medical transformer algorithm for medical image segmentation
    Zhixian Tang
    Jintao Duan
    Yanming Sun
    Yanan Zeng
    Yile Zhang
    Xufeng Yao
    Medical & Biological Engineering & Computing, 2023, 61 : 129 - 137
  • [27] DRU-NET: AN EFFICIENT DEEP CONVOLUTIONAL NEURAL NETWORK FOR MEDICAL IMAGE SEGMENTATION
    Jafari, Mina
    Auer, Dorothee
    Francis, Susan
    Garibaldi, Jonathan
    Chen, Xin
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1144 - 1148
  • [28] Reviewing 3D convolutional neural network approaches for medical image segmentation
    Ilesanmi, Ademola E.
    Ilesanmi, Taiwo O.
    Ajayi, Babatunde O.
    HELIYON, 2024, 10 (06)
  • [29] Medical Image Classification with Convolutional Neural Network
    Li, Qing
    Cai, Weidong
    Wang, Xiaogang
    Zhou, Yun
    Feng, David Dagan
    Chen, Mei
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 844 - 848
  • [30] Analysis of Convolutional Neural Network for Fundus Image Segmentation
    Shirokanev, A. S.
    Ilyasova, N. Yu
    Demin, N. S.
    2019 4TH INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING (CCISP 2019), 2020, 1438