Artificial intelligence for diagnosing exudative age-related macular degeneration

被引:0
|
作者
Kang, Chaerim [1 ]
Lo, Jui-En [2 ]
Zhang, Helen [3 ]
Ng, Sueko M. [4 ]
Lin, John C. [5 ]
Scott, Ingrid U. [6 ]
Kalpathy-Cramer, Jayashree [4 ]
Liu, Su-Hsun [4 ,7 ]
Greenberg, Paul B. [1 ,8 ]
机构
[1] Brown Univ, Div Ophthalmol, Providence, RI USA
[2] Case Western Reserve Univ, Dept Internal Med, MetroHealth Med Ctr, Cleveland, OH USA
[3] Brown Univ, Program Liberal Med Educ, Providence, RI USA
[4] Univ Colorado, Dept Ophthalmol, Anschutz Med Campus, Aurora, CO USA
[5] Univ Pennsylvania, Dept Med, Perelman Sch Med, Philadelphia, PA USA
[6] Penn State Coll Med, Dept Ophthalmol & Publ Hlth Sci, Hershey, PA USA
[7] Univ Colorado, Dept Epidemiol, Anschutz Med Campus, Aurora, CO USA
[8] VA Prov Healthcare Syst, Sect Ophthalmol, Providence, RI USA
关键词
OPTICAL COHERENCE TOMOGRAPHY; DISEASE-ACTIVITY PREDICTION; MACHINE LEARNING APPROACH; RETICULAR PSEUDODRUSEN; DIABETIC-RETINOPATHY; DECISION-MAKING; CLASSIFICATION; IMAGES; AMD; OCT;
D O I
10.1002/14651858.CD015522.pub2
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Age-related macular degeneration (AMD) is a retinal disorder characterized by central retinal (macular) damage. Approximately 10% to 20% of non-exudative AMD cases progress to the exudative form, which may result in rapid deterioration of central vision. Individuals with exudative AMD (eAMD) need prompt consultation with retinal specialists to minimize the risk and extent of vision loss. Traditional methods of diagnosing ophthalmic disease rely on clinical evaluation and multiple imaging techniques, which can be resource-consuming. Tests leveraging artificial intelligence (AI) hold the promise of automatically identifying and categorizing pathological features, enabling the timely diagnosis and treatment of eAMD. Objectives To determine the diagnostic accuracy of artificial intelligence (AI) as a triaging tool for exudative age-related macular degeneration (eAMD). Search methods We searched CENTRAL, MEDLINE, Embase, three clinical trials registries, and Data Archiving and Networked Services (DANS) for gray literature. We did not restrict searches by language or publication date. The date of the last search was April 2024. Selection criteria Included studies compared the test performance of algorithms with that of human readers to detect eAMD on retinal images collected from people with AMD who were evaluated at eye clinics in community or academic medical centers, and who were not receiving treatment for eAMD when the images were taken. We included algorithms that were either internally or externally validated or both. Data collection and analysis Pairs of review authors independently extracted data and assessed study quality using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool with revised signaling questions. For studies that reported more than one set of performance results, we extracted only one set of diagnostic accuracy data per study based on the last development stage or the optimal algorithm as indicated by the study authors. For two-class algorithms, we collected data from the 2x2 table whenever feasible. For multi-class algorithms, we first consolidated data from all classes other than eAMD before constructing the corresponding 2x2 tables. Assuming a common positivity threshold applied by the included studies, we chose random-effects, bivariate logistic models to estimate summary sensitivity and specificity as the primary performance metrics. Main results We identified 36 eligible studies that reported 40 sets of algorithm performance data, encompassing over 16,000 participants and 62,000 images. We included 28 studies (78%) that reported 31 algorithms with performance data in the meta-analysis. The remaining nine studies (25%) reported eight algorithms that lacked usable performance data; we reported them in the qualitative synthesis. Study characteristics and risk of bias Most studies were conducted in Asia, followed by Europe, the USA, and collaborative efforts spanning multiple countries. Most studies identified study participants from the hospital setting, while others used retinal images from public repositories; a few studies did not specify image sources. Based on four of the 36 studies reporting demographic information, the age of the study participants ranged from 62 to 82 years. The included algorithms used various retinal image types as model input, such as optical coherence tomography (OCT) images (N = 15), fundus images (N = 6), and multi-modal imaging (N = 7). The predominant core method used was deep neural networks. All studies that reported externally validated algorithms were at high risk of bias mainly due to potential selection bias from either a two-gate design or the inappropriate exclusion of potentially eligible retinal images (or participants). Findings Only three of the 40 included algorithms were externally validated (7.5%, 3/40). The summary sensitivity and specificity were 0.94 (95% confidence interval (CI) 0.90 to 0.97) and 0.99 (95% CI 0.76 to 1.00), respectively, when compared to human graders (3 studies; 27,872 images; low-certainty evidence). The prevalence of images with eAMD ranged from 0.3% to 49%. Twenty-eight algorithms were reportedly either internally validated (20%, 8/40) or tested on a development set (50%, 20/40); the pooled sensitivity and specificity were 0.93 (95% CI 0.89 to 0.96) and 0.96 (95% CI 0.94 to 0.98), respectively, when compared to human graders (28 studies; 33,409 images; low-certainty evidence). We did not identify significant sources of heterogeneity among these 28 algorithms. Although algorithms using OCT images appeared more homogeneous and had the highest summary specificity (0.97, 95% CI 0.93 to 0.98), they were not superior to algorithms using fundus images alone (0.94, 95% CI 0.89 to 0.97) or multimodal imaging (0.96, 95% CI 0.88 to 0.99; P for meta-regression = 0.239). The median prevalence of images with eAMD was 30% (interquartile range [IQR] 22% to 39%). We did not include eight studies that described nine algorithms (one study reported two sets of algorithm results) to distinguish eAMD from normal images, images of other AMD, or other non-AMD retinal lesions in the meta-analysis. Five of these algorithms were generally based on smaller datasets (range 21 to 218 participants per study) yet with a higher prevalence of eAMD images (range 33% to 66%). Relative to human graders, the reported sensitivity in these studies ranged from 0.95 and 0.97, while the specificity ranged from 0.94 to 0.99. Similarly, using small datasets (range 46 to 106), an additional four algorithms for detecting eAMD from other retinal lesions showed high sensitivity (range 0.96 to 1.00) and specificity (range 0.77 to 1.00). Authors' conclusions Low- to very low-certainty evidence suggests that an algorithm-based test may correctly identify most individuals with eAMD without increasing unnecessary referrals (false positives) in either the primary or the specialty care settings. There were significant concerns for applying the review findings due to variations in the eAMD prevalence in the included studies. In addition, among the included algorithm-based tests, diagnostic accuracy estimates were at risk of bias due to study participants not reflecting real-world characteristics, inadequate model validation, and the likelihood of selective results reporting. Limited quality and quantity of externally validated algorithms highlighted the need for high-certainty evidence. This evidence will require a standardized definition for eAMD on different imaging modalities and external validation of the algorithm to assess generalizability.
引用
收藏
页数:222
相关论文
共 50 条
  • [1] Cochrane corner: artificial intelligence for diagnosing exudative age-related macular degeneration
    Ong, Ariel Yuhan
    Hogg, Henry David Jeffry
    Keane, Pearse A.
    EYE, 2025, 39 (04) : 620 - 621
  • [2] Cystoid Macular Degeneration in Exudative Age-Related Macular Degeneration
    Querques, Giuseppe
    Coscas, Florence
    Forte, Raimondo
    Massamba, Nathalie
    Sterkers, Margaret
    Souied, Eric H.
    AMERICAN JOURNAL OF OPHTHALMOLOGY, 2011, 152 (01) : 100 - 107
  • [3] Drusen in Exudative Age-Related Macular Degeneration
    Yu, S. -Y.
    Cho, N. -S.
    Kim, M. -S.
    Kim, E. -S.
    Kwak, H. -W.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2010, 51 (13)
  • [4] Treatment of Exudative Age-Related Macular Degeneration
    Jo, Nam Chun
    JOURNAL OF THE KOREAN MEDICAL ASSOCIATION, 2007, 50 (08): : 731 - 735
  • [5] Pegaptanib in exudative age-related macular degeneration
    Amoaku, WM
    DRUGS, 2005, 65 (11) : 1579 - 1579
  • [6] PegaptanibIn Exudative Age-Related Macular Degeneration
    M. Asif A. Siddiqui
    Gillian M. Keating
    Drugs, 2005, 65 : 1571 - 1577
  • [7] RESVEGA in exudative age-related macular degeneration
    Kubicz, A.
    ACTA OPHTHALMOLOGICA, 2016, 94
  • [8] Artificial intelligence in assessing progression of age-related macular degeneration
    Frank-Publig, Sophie
    Birner, Klaudia
    Riedl, Sophie
    Reiter, Gregor S.
    Schmidt-Erfurth, Ursula
    EYE, 2025, 39 (02) : 262 - 273
  • [9] Comment on Artificial Intelligence in Neovascular Age-Related Macular Degeneration
    Martins, Thiago Goncalves Dos Santos
    KLINISCHE MONATSBLATTER FUR AUGENHEILKUNDE, 2025,
  • [10] The Use of artificial Intelligence (AI) in age-related Macular Degeneration
    Holz, Frank G.
    Schmitz-Valckenberg, Steffen
    OPHTHALMOLOGIE, 2024, 121 (08): : 607 - 608