Assessment of the Diagnostic Performance of a Commercially Available Artificial Intelligence Algorithm for Risk Stratification of Thyroid Nodules on Ultrasound

被引:0
|
作者
Ashton, Jeffrey [1 ]
Morrison, Samantha [2 ]
Erkanli, Alaattin [2 ]
Wildman-Tobriner, Benjamin [1 ]
机构
[1] Duke Univ, Med Ctr, Dept Radiol, 2301 ERWIN RD,Box 3808, Durham, NC 27710 USA
[2] Duke Univ, Sch Med, Dept Biostat & Bioinformat, Durham, NC USA
基金
美国国家卫生研究院;
关键词
AI; neoplasm; ultrasound; algorithm; koios; AMERICAN-COLLEGE; CANCER;
D O I
10.1089/thy.2024.0410
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Thyroid nodules are challenging to accurately characterize on ultrasound (US), though the emergence of risk stratification systems and more recently artificial intelligence (AI) algorithms has improved nodule classification. The purpose of this study was to evaluate the performance of a recent Food and Drug Administration (FDA)-cleared AI tool for detection of malignancy in thyroid nodules on US. Methods: One year of consecutive thyroid US with >= 1 nodule from Duke University Hospital and its affiliate community hospital (649 nodules from 347 patients) were retrospectively evaluated. Included nodules had ground truth diagnoses by surgical pathology, fine needle aspiration (FNA), or three-year follow-up US showing stability. An FDA-cleared AI tool (Koios DS Thyroid) analyzed each nodule to generate (i) American College of Radiology Thyroid Imaging Reporting and Data System (ACR TI-RADS) descriptors, scores, and follow-up recommendations and (ii) an AI-adapter score to further adjust risk assessments and recommendations. Four groups were then compared: (i) Koios with AI-adapter, (ii) Koios without AI-adapter, (iii) clinical radiology report, and (iv) radiology report combined with AI-adapter. Performance of the final recommendations (FNA or no FNA) was determined based on ground truth, and comparison between the four groups was made using sensitivity, specificity, and receiver-operating-curve analysis. Results: Of 649 nodules, 32 were malignant and 617 were benign. Performance of Koios with AI-adapter enabled was similar to radiologists (area under the curve [AUC] 0.70 for both, [CI 0.60-0.81] and [0.60-0.79], respectively). Koios with AI-adapter had improved specificity compared to radiologists (0.63 [CI: 0.59-0.67] versus 0.43 [CI: 0.38-0.48]) but decreased sensitivity (0.69 [CI: 0.50-0.83) versus 0.81 [CI: 0.61, 0.92]). Highest performance was seen when the radiology interpretation was combined with the AI-adapter (AUC 0.76, [CI: 0.67-0.85]). Combined with the AI-adapter, radiologist specificity improved from 0.43 ([CI: 0.38-0.48]) to 0.53 ([CI: 0.49-0.58]) (McNemar's test p < 0.001), resulting in 17% fewer FNA recommendations, with unchanged sensitivity (0.81, p = 1). Conclusion: Koios DS demonstrated standalone performance similar to radiologists, though with lower sensitivity and higher specificity. Performance was best when radiologist interpretations were combined with the AI-adapter component, with improved specificity and reduced unnecessary FNA recommendations.
引用
收藏
页码:1379 / 1388
页数:10
相关论文
共 50 条
  • [21] Diagnostic Performance of Ultrasound-Based Risk Stratification Systems for Thyroid Nodules: A Systematic Review and Meta-Analysis
    Joo, Leehi
    Lee, Min Kyoung
    Lee, Ji Ye
    Ha, Eun Ju
    Na, Dong Gyu
    ENDOCRINOLOGY AND METABOLISM, 2023, 38 (01) : 117 - 128
  • [22] Comparison of diagnostic performance of two ultrasound risk stratification systems for thyroid nodules: a systematic review and meta-analysis
    Yun Jin Kang
    Hee Sun Ahn
    Gulnaz Stybayeva
    Ju Eun Lee
    Se Hwan Hwang
    La radiologia medica, 2023, 128 : 1407 - 1414
  • [23] Thyroid nodules: risk stratification for malignancy with ultrasound and guided biopsy
    Anil, Gopinathan
    Hegde, Amogh
    Chong, F. H. Vincent
    CANCER IMAGING, 2011, 11 (01): : 209 - 223
  • [24] Performance of a commercially available artificial intelligence software for the detection of confirmed pulmonary nodules and masses in canine thoracic radiography
    Pomerantz, Leah Kathleen
    Solano, Mauricio
    Kalosa-Kenyon, Eric
    VETERINARY RADIOLOGY & ULTRASOUND, 2023, 64 (05) : 881 - 889
  • [25] Comparison of ultrasound risk stratification systems for pediatric thyroid nodules
    Yu, Jing
    Cui, Yiyang
    Fu, Chao
    Ma, Xiao
    Si, Caifeng
    Huang, Yuanjing
    Cui, Kefei
    Zhang, Yan
    FRONTIERS IN ENDOCRINOLOGY, 2024, 15
  • [26] Risk Stratification of Thyroid Nodules: From Ultrasound Features to TIRADS
    Rago, Teresa
    Vitti, Paolo
    CANCERS, 2022, 14 (03)
  • [27] Diagnostic performance of ultrasound risk stratification systems on thyroid nodules cytologically classified as indeterminate: a systematic review and meta-analysis
    Xing, Zhichao
    Qiu, Yuxuan
    Zhu, Jingqiang
    Su, Anping
    Wu, Wenshuang
    ULTRASONOGRAPHY, 2023, 42 (04) : 518 - 531
  • [28] Ultrasound systems for risk stratification of thyroid nodules prompt inappropriate biopsy in autonomously functioning thyroid nodules
    Castellana, Marco
    Virili, Camilla
    Paone, Gaetano
    Scappaticcio, Lorenzo
    Piccardo, Arnoldo
    Giovanella, Luca
    Trimboli, Pierpaolo
    CLINICAL ENDOCRINOLOGY, 2020, 93 (01) : 67 - 75
  • [29] AIBx, Artificial Intelligence Model to Risk Stratify Thyroid Nodules
    Thomas, Johnson
    Haertling, Tracy
    THYROID, 2020, 30 (06) : 878 - 884
  • [30] Advanced Ultrasound Application - Impact on Presurgical Risk Stratification of the Thyroid Nodules
    Stoian, Dana
    Ivan, Viviana
    Sporea, Ioan
    Florian, Varcus
    Mozos, Ioana
    Navolan, Dan
    Nemescu, Dragos
    THERAPEUTICS AND CLINICAL RISK MANAGEMENT, 2020, 16 : 21 - 30