Multi-scale multi-attention network for diabetic retinopathy grading

被引:1
|
作者
Xia, Haiying [1 ]
Long, Jie [1 ]
Song, Shuxiang [1 ]
Tan, Yumei [2 ]
机构
[1] Guangxi Normal Univ, Sch Elect & Informat Engn, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541004, Peoples R China
来源
PHYSICS IN MEDICINE AND BIOLOGY | 2024年 / 69卷 / 01期
基金
中国国家自然科学基金;
关键词
diabetic retinopathy grading; lesions attention module; multi-scale feature fusion module; CONVOLUTIONAL NEURAL-NETWORK; SYSTEM; DIAGNOSIS;
D O I
10.1088/1361-6560/ad111d
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. Diabetic retinopathy (DR) grading plays an important role in clinical diagnosis. However, automatic grading of DR is challenging due to the presence of intra-class variation and small lesions. On the one hand, deep features learned by convolutional neural networks often lose valid information about these small lesions. On the other hand, the great variability of lesion features, including differences in type and quantity, can exhibit considerable divergence even among fundus images of the same grade. To address these issues, we propose a novel multi-scale multi-attention network (MMNet). Approach. Firstly, to focus on different lesion features of fundus images, we propose a lesion attention module, which aims to encode multiple different lesion attention feature maps by combining channel attention and spatial attention, thus extracting global feature information and preserving diverse lesion features. Secondly, we propose a multi-scale feature fusion module to learn more feature information for small lesion regions, which combines complementary relationships between different convolutional layers to capture more detailed feature information. Furthermore, we introduce a Cross-layer Consistency Constraint Loss to overcome semantic differences between multi-scale features. Main results. The proposed MMNet obtains a high accuracy of 86.4% and a high kappa score of 88.4% for multi-class DR grading tasks on the EyePACS dataset, while 98.6% AUC, 95.3% accuracy, 92.7% recall, 95.0% precision, and 93.3% F1-score for referral and non-referral classification on the Messidor-1 dataset. Extensive experiments on two challenging benchmarks demonstrate that our MMNet achieves significant improvements and outperforms other state-of-the-art DR grading methods. Significance. MMNet has improved the diagnostic efficiency and accuracy of diabetes retinopathy and promoted the application of computer-aided medical diagnosis in DR screening.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Person re-identification based on multi-scale feature fusion and multi-attention mechanism
    Pu, Jiacheng
    Zou, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 243 - 253
  • [22] MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion
    Liang, Zhiwei
    Zhao, Kui
    Liang, Gang
    Li, Siyu
    Wu, Yifei
    Zhou, Yiping
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [23] Person re-identification based on multi-scale feature fusion and multi-attention mechanism
    Jiacheng Pu
    Wei Zou
    Signal, Image and Video Processing, 2024, 18 : 243 - 253
  • [24] Multi-Attention Network for Sentiment Analysis
    Du, Tingting
    Huang, Yunyin
    Wu, Xian
    Chang, Huiyou
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL (NLPIR 2018), 2018, : 49 - 54
  • [25] Multi-scale Refocusing Attention Siamese Network
    Liu, Guoqiang
    Chen, Zhe
    Shen, Guangze
    2024 5TH INTERNATIONAL CONFERENCE ON GEOLOGY, MAPPING AND REMOTE SENSING, ICGMRS 2024, 2024, : 42 - 46
  • [26] Multi-Scale Attention Network for Image Cropping
    Lian, Tianpei
    Xian, Ke
    Pan, Zhiyu
    Hong, Chaoyi
    Cao, Zhiguo
    Zhong, Weicai
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2640 - 2645
  • [27] Multi-scale attention network for image inpainting
    Qin, Jia
    Bai, Huihui
    Zhao, Yao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
  • [28] Multi-Attention Network for Stereo Matching
    Yang, Xiaowei
    He, Lin
    Zhao, Yong
    Sang, Haiwei
    Yang, Zuliu
    Cheng, Xianjing
    IEEE ACCESS, 2020, 8 : 113371 - 113382
  • [29] Two-channel deep recursive multi-scale network based on multi-attention for no-reference image quality assessment
    Wang, Changzhong
    Lv, Xiang
    Fan, Xiaodong
    Ding, Weiping
    Jiang, Xiaoli
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2421 - 2437
  • [30] Two-channel deep recursive multi-scale network based on multi-attention for no-reference image quality assessment
    Changzhong Wang
    Xiang Lv
    Xiaodong Fan
    Weiping Ding
    Xiaoli Jiang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2421 - 2437