D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images

被引:1
|
作者
Bougourzi F. [1 ]
Dornaika F. [3 ,4 ]
Distante C. [2 ]
Taleb-Ahmed A. [5 ]
机构
[1] Junia, UMR 8520, CNRS, Centrale Lille, University of Polytechnique Hauts-de-France, Lille
[2] Institute of Applied Sciences and Intelligent Systems, National Research Council of Italy, Lecce
[3] University of the Basque Country UPV/EHU, San Sebastian
[4] IKERBASQUE, Basque Foundation for Science, Bilbao
[5] Université Polytechnique Hauts-de-France, Université de Lille, CNRS, Valenciennes, Hauts-de-France
关键词
Bone Metastasis; Convolutional Neural Network; Covid-19; Deep learning; Segmentation; Transformer; Unet;
D O I
10.1016/j.compbiomed.2024.108590
中图分类号
学科分类号
摘要
Over the past two decades, machine analysis of medical imaging has advanced rapidly, opening up significant potential for several important medical applications. As complicated diseases increase and the number of cases rises, the role of machine-based imaging analysis has become indispensable. It serves as both a tool and an assistant to medical experts, providing valuable insights and guidance. A particularly challenging task in this area is lesion segmentation, a task that is challenging even for experienced radiologists. The complexity of this task highlights the urgent need for robust machine learning approaches to support medical staff. In response, we present our novel solution: the D-TrAttUnet architecture. This framework is based on the observation that different diseases often target specific organs. Our architecture includes an encoder–decoder structure with a composite Transformer-CNN encoder and dual decoders. The encoder includes two paths: the Transformer path and the Encoders Fusion Module path. The Dual-Decoder configuration uses two identical decoders, each with attention gates. This allows the model to simultaneously segment lesions and organs and integrate their segmentation losses. To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks. We also investigated the adaptability of the model by testing it without the second decoder in the segmentation of glands and nuclei. The results confirmed the superiority of our approach, especially in Covid-19 infections and the segmentation of bone metastases. In addition, the hybrid encoder showed exceptional performance in the segmentation of glands and nuclei, solidifying its role in modern medical image analysis. © 2024 The Author(s)
引用
收藏
相关论文
共 50 条
  • [41] Weak Appearance Aware Pipeline Leak Detection based on CNN-Transformer Hybrid Architecture
    Zhang, Bulin
    Yuan, Haiwen
    Ge, Jie
    Cheng, Li
    Li, Xuan
    Xiao, Changshi
    IEEE Transactions on Instrumentation and Measurement, 2024,
  • [42] Semantic segmentation of terrace image regions based on lightweight CNN-Transformer hybrid networks
    Liu X.
    Yi S.
    Li L.
    Cheng X.
    Wang C.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2023, 39 (13): : 171 - 181
  • [43] Add-Vit: CNN-Transformer Hybrid Architecture for Small Data Paradigm Processing
    Chen, Jinhui
    Wu, Peng
    Zhang, Xiaoming
    Xu, Renjie
    Liang, Jia
    NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [44] CC-TransXNet: a hybrid CNN-transformer network for automatic segmentation of optic cup and optic disk from fundus images
    Yuan, Zhongzheng
    Wang, Jinke
    Xu, Yukun
    Xu, Min
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, : 1027 - 1044
  • [45] SEGTRANSVAE: HYBRID CNN - TRANSFORMER WITH REGULARIZATION FOR MEDICAL IMAGE SEGMENTATION
    Quan-Dung Pham
    Hai Nguyen-Truong
    Nam Nguyen Phuong
    Nguyen, Khoa N. A.
    Nguyen, Chanh D. T.
    Bui, Trung
    Truong, Steven Q. H.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [46] UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
    Gao, Yunhe
    Zhou, Mu
    Metaxas, Dimitris N.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 61 - 71
  • [47] LEFORMER: A HYBRID CNN-TRANSFORMER ARCHITECTURE FOR ACCURATE LAKE EXTRACTION FROM REMOTE SENSING IMAGERY
    Chen, Ben
    Zou, Xuechao
    Zhang, Yu
    Li, Jiayu
    Li, Kai
    Xing, Junliang
    Tao, Pin
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5710 - 5714
  • [48] Agricultural innovation through deep learning: a hybrid CNN-Transformer architecture for crop disease classification
    Padshetty, Smitha
    Umashetty, Ambika
    JOURNAL OF SPATIAL SCIENCE, 2024,
  • [49] Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging
    Cao, Miao
    Wang, Lishun
    Zhu, Mingyu
    Yuan, Xin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4521 - 4540
  • [50] Hybrid 3D Medical Image Segmentation Using CNN and Frequency Transformer Fusion
    Labbihi, Ismayl
    Meslouhi, Othmane El
    Elassad, Zouhair Elamrani Abou
    Benaddy, Mohamed
    Kardouchi, Mustapha
    Akhloufi, Moulay
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,