DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation

被引:26
|
作者
Tang, Feilong [1 ]
Xu, Zhongxing [1 ]
Huang, Qiming [1 ]
Wang, Jinfeng [1 ]
Hou, Xianxu [1 ]
Su, Jionglong [1 ]
Liu, Jingxin [1 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch AI & Adv Comp, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Polyp segmentation; Dual decoder; Vision Transformers; CANCER;
D O I
10.1007/978-981-99-8469-5_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based models have been widely demonstrated to be successful in computer vision tasks by modeling long-range dependencies and capturing global representations. However, they are often dominated by features of large patterns leading to the loss of local details (e.g., boundaries and small objects), which are critical in medical image segmentation. To alleviate this problem, we propose a Dual-Aggregation Transformer Network called DuAT, which is characterized by two innovative designs, namely, the Global-to-Local Spatial Aggregation (GLSA) and Selective Boundary Aggregation (SBA) modules. The GLSA has the ability to aggregate and represent both global and local spatial features, which are beneficial for locating large and small objects, respectively. The SBA module aggregates the boundary characteristic from low-level features and semantic information from high-level features for better-preserving boundary details and locating the re-calibration objects. Extensive experiments in six benchmark datasets demonstrate that our proposed model outperforms state-of-the-art methods in the segmentation of skin lesion images and polyps in colonoscopy images. In addition, our approach is more robust than existing methods in various challenging situations, such as small object segmentation and ambiguous object boundaries. The project is available at https://github.com/Barrett-python/DuAT.
引用
收藏
页码:343 / 356
页数:14
相关论文
共 50 条
  • [31] Encoder Activation Diffusion and Decoder Transformer Fusion Network for Medical Image Segmentation
    Li, Xueru
    Xu, Guoxia
    Zhao, Meng
    Shi, Fan
    Wang, Hao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 185 - 197
  • [32] Optimization of U-shaped pure transformer medical image segmentation network
    Dan, Yongping
    Jin, Weishou
    Wang, Zhida
    Sun, Changhao
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [33] LET-Net: locally enhanced transformer network for medical image segmentation
    Na Ta
    Haipeng Chen
    Xianzhu Liu
    Nuo Jin
    Multimedia Systems, 2023, 29 (6) : 3847 - 3861
  • [34] LET-Net: locally enhanced transformer network for medical image segmentation
    Ta, Na
    Chen, Haipeng
    Liu, Xianzhu
    Jin, Nuo
    MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3847 - 3861
  • [35] Medical Image Segmentation Using Transformer Networks
    Karimi, Davood
    Dou, Haoran
    Gholipour, Ali
    IEEE ACCESS, 2022, 10 : 29322 - 29332
  • [36] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [37] ATFormer: Advanced transformer for medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Oinlan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [38] Alternate encoder and dual decoder CNN-Transformer networks for medical image segmentation
    Zhang, Lin
    Guo, Xinyu
    Sun, Hongkun
    Wang, Weigang
    Yao, Liwei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [39] RT-Unet: An advanced network based on residual network and transformer for medical image segmentation
    Li, Bo
    Liu, Sikai
    Wu, Fei
    Li, GuangHui
    Zhong, Meiling
    Guan, Xiaohui
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8565 - 8582
  • [40] The Fully Convolutional Transformer for Medical Image Segmentation
    Tragakis, Athanasios
    Kaul, Chaitanya
    Murray-Smith, Roderick
    Husmeier, Dirk
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3649 - 3658