Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy

被引：8

作者：

Wang, Zijian ^{[1
,3
]}

Lu, Haimei ^{[2
]}

Yan, Haixin ^{[3
]}

Kan, Hongxing ^{[1
]}

Jin, Li ^{[1
]}

机构：

[1] Anhui Univ Chinese Med, Sch Med & Informat Engn, Hefei 230012, Peoples R China

[2] Anhui Med Univ, Sch Basic Med Sci, Hefei 230032, Peoples R China

[3] Hefei Univ Technol, Hefei 230009, Peoples R China

来源：

SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期

关键词：

D O I：

10.1038/s41598-023-38320-5

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Diabetic Retinopathy (DR) is a major cause of blindness worldwide. Early detection and treatment are crucial to prevent vision loss, making accurate and timely diagnosis critical. Deep learning technology has shown promise in the automated diagnosis of DR, and in particular, multi-lesion segmentation tasks. In this paper, we propose a novel Transformer-based model for DR segmentation that incorporates hyperbolic embeddings and a spatial prior module. The proposed model is primarily built on a traditional Vision Transformer encoder and further enhanced by incorporating a spatial prior module for image convolution and feature continuity, followed by feature interaction processing using the spatial feature injector and extractor. Hyperbolic embeddings are used to classify feature matrices from the model at the pixel level. We evaluated the proposed model's performance on the publicly available datasets and compared it with other widely used DR segmentation models. The results show that our model outperforms these widely used DR segmentation models. The incorporation of hyperbolic embeddings and a spatial prior module into the Vision Transformer-based model significantly improves the accuracy of DR segmentation. The hyperbolic embeddings enable us to better capture the underlying geometric structure of the feature matrices, which is important for accurate segmentation. The spatial prior module improves the continuity of the features and helps to better distinguish between lesions and normal tissues. Overall, our proposed model has potential for clinical use in automated DR diagnosis, improving accuracy and speed of diagnosis. Our study shows that the integration of hyperbolic embeddings and a spatial prior module with a Vision Transformer-based model improves the performance of DR segmentation models. Future research can explore the application of our model to other medical imaging tasks, as well as further optimization and validation in real-world clinical settings.

引用

页数：13

共 37 条

[31] A multi model deep net with an explainable AI based framework for diabetic retinopathy segmentation and classification
Sharma, Neeraj
Lalwani, Praveen
SCIENTIFIC REPORTS, 2025, 15 (01):
[32] LTUNet: A Lightweight Transformer-Based UNet with Multi-scale Mechanism for Skin Lesion Segmentation
Guo, Huike
Zhang, Han
Li, Minghe
Quan, Xiongwen
ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 147 - 158
[33] Transformer-Enhanced Retinal Vessel Segmentation for Diabetic Retinopathy Detection Using Attention Mechanisms and Multi-Scale Fusion
Kim, Hyung-Joo
Eesaar, Hassan
Chong, Kil To
APPLIED SCIENCES-BASEL, 2024, 14 (22):
[34] U-Net-based gannet sine cosine algorithm enabled lesion segmentation and deep CNN for diabetic retinopathy classification
Mundada, Rupesh Goverdhan
Nawgaje, Devesh D.
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023,
[35] U-Net-based gannet sine cosine algorithm enabled lesion segmentation and deep CNN for diabetic retinopathy classification
Mundada, Rupesh Goverdhan
Nawgaje, Devesh D.
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (06): : 2400 - 2417
[36] Threshold segmentation based multi-layer analysis for detecting diabetic retinopathy using convolution neural network
Shanthini, A.
Manogaran, Gunasekaran
Vadivu, G.
Kottilingam, K.
Nithyakani, P.
Fancy, C.
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 15 (Suppl 1) : 183 - 183
[37] Entropy Weighted and Kernalized Power K-Means Clustering Based Lesion Segmentation and Optimized Deep Learning for Diabetic Retinopathy Detection
Elwin, J. Granty Regina
Kumar, K. Suresh
Ananth, J. P.
Kumar, R. Ramesh
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2023, 32 (01)

← 1 2 3 4 →