DCCAT: Dual-Coordinate Cross-Attention Transformer for thrombus segmentation on coronary OCT

被引:2
|
作者
Chu, Miao [1 ,2 ,3 ]
De Maria, Giovanni Luigi [2 ,3 ]
Dai, Ruobing [1 ]
Benenati, Stefano [2 ,3 ,5 ]
Yu, Wei [1 ]
Zhong, Jiaxin [1 ,6 ]
Kotronias, Rafail [2 ,3 ,4 ]
Walsh, Jason [2 ,3 ,4 ]
Andreaggi, Stefano [2 ,7 ]
Zuccarelli, Vittorio [2 ]
Chai, Jason [2 ,3 ]
Channon, Keith [2 ,3 ,4 ]
Banning, Adrian [2 ,3 ,4 ]
Tu, Shengxian [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Biomed Instrument Inst, Sch Biomed Engn, Shanghai, Peoples R China
[2] Oxford Univ Hosp NHS Trust, Oxford Heart Ctr, Oxford, England
[3] Univ Oxford, Radcliffe Dept Med, Div Cardiovasc Med, Oxford, England
[4] Oxford Biomed Res Ctr, Natl Inst Hlth Res, Oxford, England
[5] Univ Genoa, Genoa, Italy
[6] Fujian Med Univ, Union Hosp, Dept Cardiol, Fuzhou, Fujian, Peoples R China
[7] Univ Verona, Dept Med, Div Cardiol, Verona, Italy
基金
中国国家自然科学基金;
关键词
Acute coronary syndromes; Optical coherence tomography; Thrombus segmentation; Cross-attention; OPTICAL COHERENCE TOMOGRAPHY; PLAQUE EROSION; NEURAL-NETWORK; DIAGNOSIS;
D O I
10.1016/j.media.2024.103265
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acute coronary syndromes (ACS) are one of the leading causes of mortality worldwide, with atherosclerotic plaque rupture and subsequent thrombus formation as the main underlying substrate. Thrombus burden evaluation is important for tailoring treatment therapy and predicting prognosis. Coronary optical coherence tomography (OCT) enables in-vivo visualization of thrombus that cannot otherwise be achieved by other image modalities. However, automatic quantification of thrombus on OCT has not been implemented. The main challenges are due to the variation in location, size and irregularities of thrombus in addition to the small data set. In this paper, we propose a novel dual-coordinate cross-attention transformer network, termed DCCAT, to overcome the above challenges and achieve the first automatic segmentation of thrombus on OCT. Imaging features from both Cartesian and polar coordinates are encoded and fused based on long-range correspondence via multi-head cross-attention mechanism. The dual-coordinate cross-attention block is hierarchically stacked amid convolutional layers at multiple levels, allowing comprehensive feature enhancement. The model was developed based on 5,649 OCT frames from 339 patients and tested using independent external OCT data from 548 frames of 52 patients. DCCAT achieved Dice similarity score (DSC) of 0.706 in segmenting thrombus, which is significantly higher than the CNN-based (0.656) and Transformer-based (0.584) models. We prove that the additional input of polar image not only leverages discriminative features from another coordinate but also improves model robustness for geometrical transformation.Experiment results show that DCCAT achieves competitive performance with only 10% of the total data, highlighting its data efficiency. The proposed dual- coordinate cross-attention design can be easily integrated into other developed Transformer models to boost performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] DuDoCAF: Dual-Domain Cross-Attention Fusion with Recurrent Transformer for Fast Multi-contrast MR Imaging
    Lyu, Jun
    Sui, Bin
    Wang, Chengyan
    Tian, Yapeng
    Dou, Qi
    Qin, Jing
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI, 2022, 13436 : 474 - 484
  • [42] A joint object detection and semantic segmentation model with cross-attention and inner-attention mechanisms
    Nan, Zhixiong
    Peng, Jizhi
    Jiang, Jingjing
    Chen, Hui
    Yang, Ben
    Xin, Jingmin
    Zheng, Nanning
    NEUROCOMPUTING, 2021, 463 : 212 - 225
  • [43] RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker
    Li, Yunfeng
    Wang, Bo
    Sun, Jiuran
    Wu, Xueyi
    Li, Ye
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2260 - 2275
  • [44] Cross-attention Based Text-image Transformer for Visual Question Answering
    Rezapour M.
    Recent Advances in Computer Science and Communications, 2024, 17 (04) : 72 - 78
  • [45] Defect Detection Algorithm for Battery Cell Casings Based on Dual-Coordinate Attention and Small Object Loss Feedback
    Li, Tianjian
    Ren, Jiale
    Yang, Qingping
    Chen, Long
    Sun, Xizhi
    PROCESSES, 2024, 12 (03)
  • [46] Graph-to-Text Generation with Bidirectional Dual Cross-Attention and Concatenation
    Jimale, Elias Lemuye
    Chen, Wenyu
    Al-antari, Mugahed A.
    Gu, Yeong Hyeon
    Agbesi, Victor Kwaku
    Feroze, Wasif
    Akmel, Feidu
    Assefa, Juhar Mohammed
    Shahzad, Ali
    MATHEMATICS, 2025, 13 (06)
  • [47] A deep supervised cross-attention strategy for ischemic stroke segmentation in MRI studies
    Gomez, Santiago
    Mantilla, Daniel
    Rangel, Edgar
    Ortiz, Andres
    Vera, Daniela D.
    Martinez, Fabio
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2023, 9 (03)
  • [48] CsAGP: Detecting Alzheimer's disease from multimodal images via dual-transformer with cross-attention and graph pooling
    Tang, Chaosheng
    Wei, Mingyang
    Sun, Junding
    Wang, Shuihua
    Zhang, Yudong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)
  • [49] Learning a cross-attention whole-brain segmentation for PET/MR images
    Li, Wenbo
    Huang, Zhenxing
    Zhao, Wenjie
    Liu, Haizhou
    Wu, Yaping
    Yuan, Jianmin
    Yang, Yang
    Zhang, Yan
    Yang, Yongfeng
    Zheng, Hairong
    Liang, Dong
    Wang, Meiyun
    Hu, Zhanli
    JOURNAL OF NUCLEAR MEDICINE, 2024, 65
  • [50] Progressive Cross-Attention Network for Flood Segmentation Using Multispectral Satellite Imagery
    Feliren, Vicky
    Khikmah, Fithrothul
    Bhaswara, Irfan Dwiki
    Nasution, Bahrul I.
    Lechner, Alex M.
    Saputra, Muhamad Risqi U.
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22