Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy

被引：8

作者：

Wang, Zijian ^{[1
,3
]}

Lu, Haimei ^{[2
]}

Yan, Haixin ^{[3
]}

Kan, Hongxing ^{[1
]}

Jin, Li ^{[1
]}

机构：

[1] Anhui Univ Chinese Med, Sch Med & Informat Engn, Hefei 230012, Peoples R China

[2] Anhui Med Univ, Sch Basic Med Sci, Hefei 230032, Peoples R China

[3] Hefei Univ Technol, Hefei 230009, Peoples R China

来源：

SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期

关键词：

D O I：

10.1038/s41598-023-38320-5

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Diabetic Retinopathy (DR) is a major cause of blindness worldwide. Early detection and treatment are crucial to prevent vision loss, making accurate and timely diagnosis critical. Deep learning technology has shown promise in the automated diagnosis of DR, and in particular, multi-lesion segmentation tasks. In this paper, we propose a novel Transformer-based model for DR segmentation that incorporates hyperbolic embeddings and a spatial prior module. The proposed model is primarily built on a traditional Vision Transformer encoder and further enhanced by incorporating a spatial prior module for image convolution and feature continuity, followed by feature interaction processing using the spatial feature injector and extractor. Hyperbolic embeddings are used to classify feature matrices from the model at the pixel level. We evaluated the proposed model's performance on the publicly available datasets and compared it with other widely used DR segmentation models. The results show that our model outperforms these widely used DR segmentation models. The incorporation of hyperbolic embeddings and a spatial prior module into the Vision Transformer-based model significantly improves the accuracy of DR segmentation. The hyperbolic embeddings enable us to better capture the underlying geometric structure of the feature matrices, which is important for accurate segmentation. The spatial prior module improves the continuity of the features and helps to better distinguish between lesions and normal tissues. Overall, our proposed model has potential for clinical use in automated DR diagnosis, improving accuracy and speed of diagnosis. Our study shows that the integration of hyperbolic embeddings and a spatial prior module with a Vision Transformer-based model improves the performance of DR segmentation models. Future research can explore the application of our model to other medical imaging tasks, as well as further optimization and validation in real-world clinical settings.

引用

页数：13

共 37 条

[21] Diabetic retinopathy lesion segmentation using deep multi-scale framework
Guo, Tianjiao
Yang, Jie
Yu, Qi
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
[22] SEGAN-BASED LESION SEGMENTATION AND OPTIMIZED RideNN FOR DIABETIC RETINOPATHY CLASSIFICATION
Sagvekar, Vidya
Joshi, Manjusha
BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2023, 35 (04):
[23] A Weakly-Supervised Multi-lesion Segmentation Framework Based on Target-Level Incomplete Annotations
Ju, Jianguo
Ren, Shumin
Qiu, Dandan
Tu, Huijuan
Yin, Juanjuan
Xu, Pengfei
Guan, Ziyu
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 44 - 53
[24] Diabetic Retinopathy Lesion Segmentation Based on Hierarchical Feature Progressive Fusion in Retinal Fundus Images
Ding Pengchao
Li Feng
CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2024, 51 (21):
[25] Transformer-based multi-attention hybrid networks for skin lesion segmentation
Dong, Zhiwei
Li, Jinjiang
Hua, Zhen
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
[26] CMNet: Cascaded context fusion and multi-attention network for multiple lesion segmentation of diabetic retinopathy images
Guo, Yanfei
Du, Hangli
Zhang, Yuanke
Ma, Fei
Meng, Jing
Yuan, Shasha
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
[27] Scheme Based on Multi-Level Patch Attention and Lesion Localization for Diabetic Retinopathy Grading
Xia, Zhuoqun
Hu, Hangyu
Li, Wenjing
Jiang, Qisheng
Pu, Lan
Shu, Yicong
Sangaiah, Arun Kumar
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 140 (01): : 409 - 430
[28] Multi-modal Automatic Video Segmentation with Sentence Transformer Embeddings and KeyBERT-Based Subtopic Extraction
Vasuki, M.
Gangadharan, M. Arun
Daniel, Jibin Thomas
Sadashiv, Arjun
Venugopal, Vivek
Vekkot, Susmitha
2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,
[29] WSRFNet: Wavelet-Based Scale-Specific Recurrent Feedback Network for Diabetic Retinopathy Lesion Segmentation
Li, Xuan
Wu, Xiangqian
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1038 - 1046
[30] RP squeeze U-SegNet model for lesion segmentation and optimization enabled ShuffleNet based multi-level severity diabetic retinopathy classification
Sulaiman, Zulaikha Beevi
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024,

← 1 2 3 4 →