DCCAT: Dual-Coordinate Cross-Attention Transformer for thrombus segmentation on coronary OCT

被引:2
|
作者
Chu, Miao [1 ,2 ,3 ]
De Maria, Giovanni Luigi [2 ,3 ]
Dai, Ruobing [1 ]
Benenati, Stefano [2 ,3 ,5 ]
Yu, Wei [1 ]
Zhong, Jiaxin [1 ,6 ]
Kotronias, Rafail [2 ,3 ,4 ]
Walsh, Jason [2 ,3 ,4 ]
Andreaggi, Stefano [2 ,7 ]
Zuccarelli, Vittorio [2 ]
Chai, Jason [2 ,3 ]
Channon, Keith [2 ,3 ,4 ]
Banning, Adrian [2 ,3 ,4 ]
Tu, Shengxian [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Biomed Instrument Inst, Sch Biomed Engn, Shanghai, Peoples R China
[2] Oxford Univ Hosp NHS Trust, Oxford Heart Ctr, Oxford, England
[3] Univ Oxford, Radcliffe Dept Med, Div Cardiovasc Med, Oxford, England
[4] Oxford Biomed Res Ctr, Natl Inst Hlth Res, Oxford, England
[5] Univ Genoa, Genoa, Italy
[6] Fujian Med Univ, Union Hosp, Dept Cardiol, Fuzhou, Fujian, Peoples R China
[7] Univ Verona, Dept Med, Div Cardiol, Verona, Italy
基金
中国国家自然科学基金;
关键词
Acute coronary syndromes; Optical coherence tomography; Thrombus segmentation; Cross-attention; OPTICAL COHERENCE TOMOGRAPHY; PLAQUE EROSION; NEURAL-NETWORK; DIAGNOSIS;
D O I
10.1016/j.media.2024.103265
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acute coronary syndromes (ACS) are one of the leading causes of mortality worldwide, with atherosclerotic plaque rupture and subsequent thrombus formation as the main underlying substrate. Thrombus burden evaluation is important for tailoring treatment therapy and predicting prognosis. Coronary optical coherence tomography (OCT) enables in-vivo visualization of thrombus that cannot otherwise be achieved by other image modalities. However, automatic quantification of thrombus on OCT has not been implemented. The main challenges are due to the variation in location, size and irregularities of thrombus in addition to the small data set. In this paper, we propose a novel dual-coordinate cross-attention transformer network, termed DCCAT, to overcome the above challenges and achieve the first automatic segmentation of thrombus on OCT. Imaging features from both Cartesian and polar coordinates are encoded and fused based on long-range correspondence via multi-head cross-attention mechanism. The dual-coordinate cross-attention block is hierarchically stacked amid convolutional layers at multiple levels, allowing comprehensive feature enhancement. The model was developed based on 5,649 OCT frames from 339 patients and tested using independent external OCT data from 548 frames of 52 patients. DCCAT achieved Dice similarity score (DSC) of 0.706 in segmenting thrombus, which is significantly higher than the CNN-based (0.656) and Transformer-based (0.584) models. We prove that the additional input of polar image not only leverages discriminative features from another coordinate but also improves model robustness for geometrical transformation.Experiment results show that DCCAT achieves competitive performance with only 10% of the total data, highlighting its data efficiency. The proposed dual- coordinate cross-attention design can be easily integrated into other developed Transformer models to boost performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Optimization-Inspired Cross-Attention Transformer for Compressive Sensing
    Song, Jiechong
    Mou, Chong
    Wang, Shiqi
    Ma, Siwei
    Zhang, Jian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6174 - 6184
  • [22] DcTr: Noise-robust point cloud completion by dual-channel transformer with cross-attention
    Fei, Ben
    Yang, Weidong
    Ma, Lipeng
    Chen, Wen-Ming
    PATTERN RECOGNITION, 2023, 133
  • [23] CrossFormer: Multi-scale cross-attention for polyp segmentation
    Chen, Lifang
    Ge, Hongze
    Li, Jiawei
    IET IMAGE PROCESSING, 2023, 17 (12) : 3441 - 3452
  • [24] Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation
    Ke, Lei
    Li, Xia
    Danelljan, Martin
    Tai, Yu-Wing
    Tang, Chi-Keung
    Yu, Fisher
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [25] Spatial-Spectral Transformer With Cross-Attention for Hyperspectral Image Classification
    Peng, Yishu
    Zhang, Yuwen
    Tu, Bing
    Li, Qianming
    Li, Wujing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [26] Bidirectional feature fusion via cross-attention transformer for chrysanthemum classification
    Chen, Yifan
    Yang, Xichen
    Yan, Hui
    Liu, Jia
    Jiang, Jian
    Mao, Zhongyuan
    Wang, Tianshu
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (02)
  • [27] DB-DCAFN: dual-branch deformable cross-attention fusion network for bacterial segmentation
    Wang, Jingkun
    Ma, Xinyu
    Cao, Long
    Leng, Yilin
    Li, Zeyi
    Cheng, Zihan
    Cao, Yuzhu
    Huang, Xiaoping
    Zheng, Jian
    VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2023, 6 (01)
  • [28] DB-DCAFN: dual-branch deformable cross-attention fusion network for bacterial segmentation
    Jingkun Wang
    Xinyu Ma
    Long Cao
    Yilin Leng
    Zeyi Li
    Zihan Cheng
    Yuzhu Cao
    Xiaoping Huang
    Jian Zheng
    Visual Computing for Industry, Biomedicine, and Art, 6
  • [29] Multi-scale cross-attention transformer encoder for event classification
    Hammad, A.
    Moretti, S.
    Nojiri, M.
    JOURNAL OF HIGH ENERGY PHYSICS, 2024, 2024 (03)
  • [30] Accurate Multi-contrast MRI Super-Resolution via a Dual Cross-Attention Transformer Network
    Huang, Shoujin
    Li, Jingyu
    Mei, Lifeng
    Zhang, Tan
    Chen, Ziran
    Dong, Yu
    Dong, Linzheng
    Liu, Shaojun
    Lyu, Mengye
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT X, 2023, 14229 : 313 - 322