Diabetic retinopathy grading based on multi-scale residual network and cross-attention module

被引:0
|
作者
Singh, Atul Kumar [1 ]
Madarapu, Sandeep [1 ]
Ari, Samit [1 ]
机构
[1] Natl Inst Technol Rourkela, Dept Elect & Commun, Rourkela 769008, Odisha, India
关键词
Convolutional neural network; Cross-attention block; Deep learning; Diabetic retinopathy; Multi-scale residual attention block; CLASSIFICATION; CLAHE;
D O I
10.1016/j.dsp.2024.104888
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Diabetic retinopathy (DR) is a severe effect of diabetes mellitus that mainly impacts the retinal tissue and carries a high risk of blindness. Ophthalmologists face challenges in assessing the severity of DR due to its complexity and time constraints. Consequently, there is an urgent need for the development of automated methods that employ retinal fundus images to detect DR. This study introduces a novel deep learning architecture that utilizes a multi-scale residual attention block (MSRAB) and a cross-attention block (CrAB) for DR grading. The proposed MSRAB employs a convolutional neural network (CNN) with diverse dilation rates to expand its field of view. MSRAB adaptively concentrate on pertinent features and incorporates a residual attention network to improve grading performance. Integrating the residual attention network enables MSRAB to prioritize critical characteristics in retinal fundus images and enhance performance. Similarly, the CrAB integrates channel and spatial attention mechanisms to capture inter-channel interactions and spatial dependencies within the input features. This comprehensive methodology enables the model to concentrate more efficiently on critical regions to discriminate irrelevant features and to capture interrelations across different channels and spatial regions. The proposed model employs a pre-trained backbone network to extract local and global features to capture complex features for precisely identifying DR. This process enhances model efficiency, which is helpful in methods with limited data and processing limitations. The suggested methodology surpasses the existing methodologies in accuracy, recall, precision, and Area Under the Curve.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Diabetic Retinopathy Grading Using Multi-scale Residual Network with Grouped Channel Attention
    Rajan, Rajeev
    Noumida, A.
    Aparna, S.
    Madhurema, V. J.
    Nair, Nandana
    Mohan, Parvathi
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 1671 - 1675
  • [2] Multi-scale multi-attention network for diabetic retinopathy grading
    Xia, Haiying
    Long, Jie
    Song, Shuxiang
    Tan, Yumei
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01):
  • [3] Multi-Scale Attention Network for Diabetic Retinopathy Classification
    Al-Antary, Mohammad T.
    Arafa, Yasmine
    IEEE ACCESS, 2021, 9 : 54190 - 54200
  • [4] Multi-Scale Cross-Attention Fusion Network Based on Image Super-Resolution
    Ma, Yimin
    Xu, Yi
    Liu, Yunqing
    Yan, Fei
    Zhang, Qiong
    Li, Qi
    Liu, Quanyang
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [5] MCADNet: A Multi-Scale Cross-Attention Network for Remote Sensing Image Dehazing
    Tao, Tao
    Xu, Haoran
    Guan, Xin
    Zhou, Hao
    MATHEMATICS, 2024, 12 (23)
  • [6] A VAN-Based Multi-Scale Cross-Attention Mechanism for Skin Lesion Segmentation Network
    Liu, Shuang
    Zhuang, Zeng
    Zheng, Yanfeng
    Kolmanic, Simon
    IEEE ACCESS, 2023, 11 : 81953 - 81964
  • [7] CrossFormer: Multi-scale cross-attention for polyp segmentation
    Chen, Lifang
    Ge, Hongze
    Li, Jiawei
    IET IMAGE PROCESSING, 2023, 17 (12) : 3441 - 3452
  • [8] Multi-scale network with shared cross-attention for audio–visual correlation learning
    Jiwei Zhang
    Yi Yu
    Suhua Tang
    Wei Li
    Jianming Wu
    Neural Computing and Applications, 2023, 35 : 20173 - 20187
  • [9] Multi-scale network with shared cross-attention for audio-visual correlation learning
    Zhang, Jiwei
    Yu, Yi
    Tang, Suhua
    Li, Wei
    Wu, Jianming
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 20173 - 20187
  • [10] Multi-scale cross-attention transformer encoder for event classification
    Hammad, A.
    Moretti, S.
    Nojiri, M.
    JOURNAL OF HIGH ENERGY PHYSICS, 2024, 2024 (03)