Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy

被引:8
|
作者
Wang, Zijian [1 ,3 ]
Lu, Haimei [2 ]
Yan, Haixin [3 ]
Kan, Hongxing [1 ]
Jin, Li [1 ]
机构
[1] Anhui Univ Chinese Med, Sch Med & Informat Engn, Hefei 230012, Peoples R China
[2] Anhui Med Univ, Sch Basic Med Sci, Hefei 230032, Peoples R China
[3] Hefei Univ Technol, Hefei 230009, Peoples R China
关键词
D O I
10.1038/s41598-023-38320-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Diabetic Retinopathy (DR) is a major cause of blindness worldwide. Early detection and treatment are crucial to prevent vision loss, making accurate and timely diagnosis critical. Deep learning technology has shown promise in the automated diagnosis of DR, and in particular, multi-lesion segmentation tasks. In this paper, we propose a novel Transformer-based model for DR segmentation that incorporates hyperbolic embeddings and a spatial prior module. The proposed model is primarily built on a traditional Vision Transformer encoder and further enhanced by incorporating a spatial prior module for image convolution and feature continuity, followed by feature interaction processing using the spatial feature injector and extractor. Hyperbolic embeddings are used to classify feature matrices from the model at the pixel level. We evaluated the proposed model's performance on the publicly available datasets and compared it with other widely used DR segmentation models. The results show that our model outperforms these widely used DR segmentation models. The incorporation of hyperbolic embeddings and a spatial prior module into the Vision Transformer-based model significantly improves the accuracy of DR segmentation. The hyperbolic embeddings enable us to better capture the underlying geometric structure of the feature matrices, which is important for accurate segmentation. The spatial prior module improves the continuity of the features and helps to better distinguish between lesions and normal tissues. Overall, our proposed model has potential for clinical use in automated DR diagnosis, improving accuracy and speed of diagnosis. Our study shows that the integration of hyperbolic embeddings and a spatial prior module with a Vision Transformer-based model improves the performance of DR segmentation models. Future research can explore the application of our model to other medical imaging tasks, as well as further optimization and validation in real-world clinical settings.
引用
收藏
页数:13
相关论文
共 37 条
  • [21] Diabetic retinopathy lesion segmentation using deep multi-scale framework
    Guo, Tianjiao
    Yang, Jie
    Yu, Qi
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [22] SEGAN-BASED LESION SEGMENTATION AND OPTIMIZED RideNN FOR DIABETIC RETINOPATHY CLASSIFICATION
    Sagvekar, Vidya
    Joshi, Manjusha
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2023, 35 (04):
  • [23] A Weakly-Supervised Multi-lesion Segmentation Framework Based on Target-Level Incomplete Annotations
    Ju, Jianguo
    Ren, Shumin
    Qiu, Dandan
    Tu, Huijuan
    Yin, Juanjuan
    Xu, Pengfei
    Guan, Ziyu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 44 - 53
  • [24] Diabetic Retinopathy Lesion Segmentation Based on Hierarchical Feature Progressive Fusion in Retinal Fundus Images
    Ding Pengchao
    Li Feng
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2024, 51 (21):
  • [25] Transformer-based multi-attention hybrid networks for skin lesion segmentation
    Dong, Zhiwei
    Li, Jinjiang
    Hua, Zhen
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
  • [26] CMNet: Cascaded context fusion and multi-attention network for multiple lesion segmentation of diabetic retinopathy images
    Guo, Yanfei
    Du, Hangli
    Zhang, Yuanke
    Ma, Fei
    Meng, Jing
    Yuan, Shasha
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
  • [27] Scheme Based on Multi-Level Patch Attention and Lesion Localization for Diabetic Retinopathy Grading
    Xia, Zhuoqun
    Hu, Hangyu
    Li, Wenjing
    Jiang, Qisheng
    Pu, Lan
    Shu, Yicong
    Sangaiah, Arun Kumar
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 140 (01): : 409 - 430
  • [28] Multi-modal Automatic Video Segmentation with Sentence Transformer Embeddings and KeyBERT-Based Subtopic Extraction
    Vasuki, M.
    Gangadharan, M. Arun
    Daniel, Jibin Thomas
    Sadashiv, Arjun
    Venugopal, Vivek
    Vekkot, Susmitha
    2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,
  • [29] WSRFNet: Wavelet-Based Scale-Specific Recurrent Feedback Network for Diabetic Retinopathy Lesion Segmentation
    Li, Xuan
    Wu, Xiangqian
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1038 - 1046
  • [30] RP squeeze U-SegNet model for lesion segmentation and optimization enabled ShuffleNet based multi-level severity diabetic retinopathy classification
    Sulaiman, Zulaikha Beevi
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024,