Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy

被引:8
|
作者
Wang, Zijian [1 ,3 ]
Lu, Haimei [2 ]
Yan, Haixin [3 ]
Kan, Hongxing [1 ]
Jin, Li [1 ]
机构
[1] Anhui Univ Chinese Med, Sch Med & Informat Engn, Hefei 230012, Peoples R China
[2] Anhui Med Univ, Sch Basic Med Sci, Hefei 230032, Peoples R China
[3] Hefei Univ Technol, Hefei 230009, Peoples R China
关键词
D O I
10.1038/s41598-023-38320-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Diabetic Retinopathy (DR) is a major cause of blindness worldwide. Early detection and treatment are crucial to prevent vision loss, making accurate and timely diagnosis critical. Deep learning technology has shown promise in the automated diagnosis of DR, and in particular, multi-lesion segmentation tasks. In this paper, we propose a novel Transformer-based model for DR segmentation that incorporates hyperbolic embeddings and a spatial prior module. The proposed model is primarily built on a traditional Vision Transformer encoder and further enhanced by incorporating a spatial prior module for image convolution and feature continuity, followed by feature interaction processing using the spatial feature injector and extractor. Hyperbolic embeddings are used to classify feature matrices from the model at the pixel level. We evaluated the proposed model's performance on the publicly available datasets and compared it with other widely used DR segmentation models. The results show that our model outperforms these widely used DR segmentation models. The incorporation of hyperbolic embeddings and a spatial prior module into the Vision Transformer-based model significantly improves the accuracy of DR segmentation. The hyperbolic embeddings enable us to better capture the underlying geometric structure of the feature matrices, which is important for accurate segmentation. The spatial prior module improves the continuity of the features and helps to better distinguish between lesions and normal tissues. Overall, our proposed model has potential for clinical use in automated DR diagnosis, improving accuracy and speed of diagnosis. Our study shows that the integration of hyperbolic embeddings and a spatial prior module with a Vision Transformer-based model improves the performance of DR segmentation models. Future research can explore the application of our model to other medical imaging tasks, as well as further optimization and validation in real-world clinical settings.
引用
收藏
页数:13
相关论文
共 37 条
  • [31] A multi model deep net with an explainable AI based framework for diabetic retinopathy segmentation and classification
    Sharma, Neeraj
    Lalwani, Praveen
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [32] LTUNet: A Lightweight Transformer-Based UNet with Multi-scale Mechanism for Skin Lesion Segmentation
    Guo, Huike
    Zhang, Han
    Li, Minghe
    Quan, Xiongwen
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 147 - 158
  • [33] Transformer-Enhanced Retinal Vessel Segmentation for Diabetic Retinopathy Detection Using Attention Mechanisms and Multi-Scale Fusion
    Kim, Hyung-Joo
    Eesaar, Hassan
    Chong, Kil To
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [34] U-Net-based gannet sine cosine algorithm enabled lesion segmentation and deep CNN for diabetic retinopathy classification
    Mundada, Rupesh Goverdhan
    Nawgaje, Devesh D.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023,
  • [35] U-Net-based gannet sine cosine algorithm enabled lesion segmentation and deep CNN for diabetic retinopathy classification
    Mundada, Rupesh Goverdhan
    Nawgaje, Devesh D.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (06): : 2400 - 2417
  • [36] Threshold segmentation based multi-layer analysis for detecting diabetic retinopathy using convolution neural network
    Shanthini, A.
    Manogaran, Gunasekaran
    Vadivu, G.
    Kottilingam, K.
    Nithyakani, P.
    Fancy, C.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 15 (Suppl 1) : 183 - 183
  • [37] Entropy Weighted and Kernalized Power K-Means Clustering Based Lesion Segmentation and Optimized Deep Learning for Diabetic Retinopathy Detection
    Elwin, J. Granty Regina
    Kumar, K. Suresh
    Ananth, J. P.
    Kumar, R. Ramesh
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2023, 32 (01)