Test accuracy of artificial intelligence-based grading of fundus images in diabetic retinopathy screening: A systematic review

被引:6
|
作者
Zhelev, Zhivko [1 ]
Peters, Jaime [1 ]
Rogers, Morwenna [2 ]
Allen, Michael [3 ]
Kijauskaite, Goda [4 ]
Seedat, Farah [4 ]
Wilkinson, Elizabeth [5 ]
Hyde, Christopher [1 ]
机构
[1] Univ Exeter, Univ Exeter Med Sch, Exeter Test Grp, Coll House 1-25,St Lukes Campus, Exeter EX1 2LU, England
[2] Univ Exeter, Univ Exeter Med Sch, NIHR ARC South West Peninsula PenARC, Exeter, England
[3] Univ Exeter, Univ Exeter Med Sch, Exeter, England
[4] UK Natl Screening Comm, London, England
[5] Northern Devon Healthcare NHS Trust, Exeter, Devon, England
关键词
Artificial intelligence; fundus imaging; sensitivity and specificity; systematic review; diabetic retinopathy; screening; AUTOMATED DETECTION; VALIDATION; PHOTOGRAPHY; PROGRESSION; SOFTWARE;
D O I
10.1177/09691413221144382
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Objectives To systematically review the accuracy of artificial intelligence (AI)-based systems for grading of fundus images in diabetic retinopathy (DR) screening. Methods We searched MEDLINE, EMBASE, the Cochrane Library and the ClinicalTrials.gov from 1st January 2000 to 27th August 2021. Accuracy studies published in English were included if they met the pre-specified inclusion criteria. Selection of studies for inclusion, data extraction and quality assessment were conducted by one author with a second reviewer independently screening and checking 20% of titles. Results were analysed narratively. Results Forty-three studies evaluating 15 deep learning (DL) and 4 machine learning (ML) systems were included. Nine systems were evaluated in a single study each. Most studies were judged to be at high or unclear risk of bias in at least one QUADAS-2 domain. Sensitivity for referable DR and higher grades was >= 85% while specificity varied and was <80% for all ML systems and in 6/31 studies evaluating DL systems. Studies reported high accuracy for detection of ungradable images, but the latter were analysed and reported inconsistently. Seven studies reported that AI was more sensitive but less specific than human graders. Conclusions AI-based systems are more sensitive than human graders and could be safe to use in clinical practice but have variable specificity. However, for many systems evidence is limited, at high risk of bias and may not generalise across settings. Therefore, pre-implementation assessment in the target clinical pathway is essential to obtain reliable and applicable accuracy estimates.
引用
收藏
页码:97 / 112
页数:16
相关论文
共 50 条
  • [31] Feasibility and accuracy of the screening for diabetic retinopathy using a fundus camera and an artificial intelligence pre-evaluation application
    Piatti, A.
    Romeo, F.
    Manti, R.
    Doglio, M.
    Tartaglino, B.
    Nada, E.
    Giorda, C. B.
    ACTA DIABETOLOGICA, 2024, 61 (01) : 63 - 68
  • [32] Evaluation of artificial intelligence-based telemedicine screening for retinopathy of prematurity
    Greenwald, Miles F.
    Danford, Ian D.
    Shahrawat, Malika
    Ostmo, Susan
    Brown, James
    Kalpathy-Cramer, Jayashree
    Bradshaw, Kacy
    Schelonka, Robert
    Cohen, Howard S.
    Chan, R. V. Paul
    Chiang, Michael F.
    Campbell, J. Peter
    JOURNAL OF AAPOS, 2020, 24 (03): : 160 - 162
  • [33] Agreement and Diagnostic Test Accuracy on Grading Diabetic Retinopathy Using Fundus Photographs by Allied Medical Personnel at a Community Diabetic Retinopathy Screening Program in Nepal
    Thapa, Raba
    Bajimaya, Sanyam
    Pradhan, Eli
    Sharma, Sanjita
    Kshetri, Bal Bahadur
    Paudel, Manish
    Paudyal, Govinda
    OPHTHALMIC EPIDEMIOLOGY, 2021, 28 (06) : 509 - 515
  • [34] Clinical validation of an artificial intelligence-based diabetic retinopathy screening tool for a national health system
    Arenas-Cavalli, Jose Tomas
    Abarca, Ignacio
    Rojas-Contreras, Maximiliano
    Bernuy, Fernando
    Donoso, Rodrigo
    EYE, 2022, 36 (01) : 78 - 85
  • [35] Diagnostic accuracy and feasibility of artificial intelligence-based diabetic retinopathy screening in primary care settings: a real-world assessment
    Joseph, Sanil
    Shang, Xianwen
    Kiburg, Katerina
    Henwood, Jacqueline
    Zhu, Zhuoting
    Clark, Malcolm
    MacIsaac, Richard
    Ravilla, Thulasiraj D.
    van Wijngaarden, Peter
    He, Minguang
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [36] Planning an Artificial Intelligence-Based Diabetic Retinopathy Screening Program Using Human Centered Design
    Beversluis, Cameron
    Scanzera, Angelica
    Potharazu, Archit
    Leifer, Ariel
    Kalinoski, Lauren
    Cole, Emily
    Du, David
    Musick, Hugh
    Krishnan, Jerry
    Chan, Robison
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2023, 64 (08)
  • [37] Clinical validation of an artificial intelligence-based diabetic retinopathy screening tool for a national health system
    José Tomás Arenas-Cavalli
    Ignacio Abarca
    Maximiliano Rojas-Contreras
    Fernando Bernuy
    Rodrigo Donoso
    Eye, 2022, 36 : 78 - 85
  • [38] Efficacy of artificial intelligence-based screening for diabetic retinopathy in type 2 diabetes mellitus patients
    Pei, Xiaoting
    Yao, Xi
    Yang, Yingrui
    Zhang, Hongmei
    Xia, Mengting
    Huang, Ranran
    Wang, Yuming
    Li, Zhijie
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2022, 184
  • [39] Correction to: Accuracy of artificial intelligence-based segmentation in maxillofacial structures: a systematic review
    Manea Alahmari
    Maram Alahmari
    Abdulmajeed Almuaddi
    Hossam Abdelmagyd
    Kumuda Rao
    Zaid Hamdoon
    Mohammed Alsaegh
    Nallan C. S. K. Chaitanya
    Shishir Shetty
    BMC Oral Health, 25 (1)
  • [40] Analysis of retinal fundus images for grading of diabetic retinopathy severity
    Fadzil, M. H. Ahmad
    Izhar, Lila Iznita
    Nugroho, Hermawan
    Nugroho, Hanung Adi
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2011, 49 (06) : 693 - 700