Medical Transformer: Gated Axial-Attention for Medical Image Segmentation

被引:822
|
作者
Valanarasu, Jeya Maria Jose [1 ]
Oza, Poojan [1 ]
Hacihaliloglu, Ilker [2 ]
Patel, Vishal M. [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Rutgers State Univ, New Brunswick, NJ USA
基金
美国国家科学基金会;
关键词
Transformers; Medical image segmentation; Self-attention;
D O I
10.1007/978-3-030-87193-2_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past decade, deep convolutional neural networks have been widely adopted for medical image segmentation and shown to achieve adequate performance. However, due to inherent inductive biases present in convolutional architectures, they lack understanding of long-range dependencies in the image. Recently proposed transformer-based architectures that leverage self-attention mechanism encode long-range dependencies and learn representations that are highly expressive. This motivates us to explore transformer-based solutions and study the feasibility of using transformer-based network architectures for medical image segmentation tasks. Majority of existing transformer-based network architectures proposed for vision applications require large-scale datasets to train properly. However, compared to the datasets for vision applications, in medical imaging the number of data samples is relatively low, making it difficult to efficiently train transformers for medical imaging applications. To this end, we propose a gated axial-attention model which extends the existing architectures by introducing an additional control mechanism in the self-attention module. Furthermore, to train the model effectively on medical images, we propose a Local-Global training strategy (LoGo) which further improves the performance. Specifically, we operate on the whole image and patches to learn global and local features, respectively. The proposed Medical Transformer (MedT) is evaluated on three different medical image segmentation datasets and it is shown that it achieves better performance than the convolutional and other related transformer-based architectures. Code: https://github.com/jeya-maria-jose/Medical-Transformer
引用
收藏
页码:36 / 46
页数:11
相关论文
共 50 条
  • [21] ATFormer: Advanced transformer for medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Oinlan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [22] The Fully Convolutional Transformer for Medical Image Segmentation
    Tragakis, Athanasios
    Kaul, Chaitanya
    Murray-Smith, Roderick
    Husmeier, Dirk
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3649 - 3658
  • [23] Automatic Medical Image Segmentation with Vision Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Wang, Huaijun
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [24] Coformer: Collaborative Transformer for Medical Image Segmentation
    Gao, Yufei
    Zhang, Shichao
    Zhang, Dandan
    Shi, Yucheng
    Zhao, Guohua
    Shi, Lei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 240 - 250
  • [25] Region Attention Transformer for Medical Image Restoration
    Yang, Zhiwen
    Chen, Haowei
    Qian, Ziniu
    Zhou, Yang
    Zhang, Hui
    Zhao, Dan
    Wei, Bingzheng
    Xu, Yan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII, 2024, 15007 : 603 - 613
  • [26] A study of attention information from transformer layers in hybrid medical image segmentation networks
    Hasany, Syed Nouman
    Petitjean, Caroline
    Meriaudeau, Fabrice
    MEDICAL IMAGING 2023, 2023, 12464
  • [27] A combined deformable model and medical transformer algorithm for medical image segmentation
    Tang, Zhixian
    Duan, Jintao
    Sun, Yanming
    Zeng, Yanan
    Zhang, Yile
    Yao, Xufeng
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (01) : 129 - 137
  • [28] A combined deformable model and medical transformer algorithm for medical image segmentation
    Zhixian Tang
    Jintao Duan
    Yanming Sun
    Yanan Zeng
    Yile Zhang
    Xufeng Yao
    Medical & Biological Engineering & Computing, 2023, 61 : 129 - 137
  • [29] Evolutionary Attention Network for Medical Image Segmentation
    Hassanzadeh, Tahereh
    Essam, Daryl
    Sarker, Ruhul
    2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
  • [30] Advances in attention mechanisms for medical image segmentation
    Zhang, Jianpeng
    Chen, Xiaomin
    Yang, Bing
    Guan, Qingbiao
    Chen, Qi
    Chen, Jian
    Wu, Qi
    Xie, Yutong
    Xia, Yong
    COMPUTER SCIENCE REVIEW, 2025, 56