Assessment of the Diagnostic Performance of a Commercially Available Artificial Intelligence Algorithm for Risk Stratification of Thyroid Nodules on Ultrasound

被引:0
|
作者
Ashton, Jeffrey [1 ]
Morrison, Samantha [2 ]
Erkanli, Alaattin [2 ]
Wildman-Tobriner, Benjamin [1 ]
机构
[1] Duke Univ, Med Ctr, Dept Radiol, 2301 ERWIN RD,Box 3808, Durham, NC 27710 USA
[2] Duke Univ, Sch Med, Dept Biostat & Bioinformat, Durham, NC USA
基金
美国国家卫生研究院;
关键词
AI; neoplasm; ultrasound; algorithm; koios; AMERICAN-COLLEGE; CANCER;
D O I
10.1089/thy.2024.0410
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Thyroid nodules are challenging to accurately characterize on ultrasound (US), though the emergence of risk stratification systems and more recently artificial intelligence (AI) algorithms has improved nodule classification. The purpose of this study was to evaluate the performance of a recent Food and Drug Administration (FDA)-cleared AI tool for detection of malignancy in thyroid nodules on US. Methods: One year of consecutive thyroid US with >= 1 nodule from Duke University Hospital and its affiliate community hospital (649 nodules from 347 patients) were retrospectively evaluated. Included nodules had ground truth diagnoses by surgical pathology, fine needle aspiration (FNA), or three-year follow-up US showing stability. An FDA-cleared AI tool (Koios DS Thyroid) analyzed each nodule to generate (i) American College of Radiology Thyroid Imaging Reporting and Data System (ACR TI-RADS) descriptors, scores, and follow-up recommendations and (ii) an AI-adapter score to further adjust risk assessments and recommendations. Four groups were then compared: (i) Koios with AI-adapter, (ii) Koios without AI-adapter, (iii) clinical radiology report, and (iv) radiology report combined with AI-adapter. Performance of the final recommendations (FNA or no FNA) was determined based on ground truth, and comparison between the four groups was made using sensitivity, specificity, and receiver-operating-curve analysis. Results: Of 649 nodules, 32 were malignant and 617 were benign. Performance of Koios with AI-adapter enabled was similar to radiologists (area under the curve [AUC] 0.70 for both, [CI 0.60-0.81] and [0.60-0.79], respectively). Koios with AI-adapter had improved specificity compared to radiologists (0.63 [CI: 0.59-0.67] versus 0.43 [CI: 0.38-0.48]) but decreased sensitivity (0.69 [CI: 0.50-0.83) versus 0.81 [CI: 0.61, 0.92]). Highest performance was seen when the radiology interpretation was combined with the AI-adapter (AUC 0.76, [CI: 0.67-0.85]). Combined with the AI-adapter, radiologist specificity improved from 0.43 ([CI: 0.38-0.48]) to 0.53 ([CI: 0.49-0.58]) (McNemar's test p < 0.001), resulting in 17% fewer FNA recommendations, with unchanged sensitivity (0.81, p = 1). Conclusion: Koios DS demonstrated standalone performance similar to radiologists, though with lower sensitivity and higher specificity. Performance was best when radiologist interpretations were combined with the AI-adapter component, with improved specificity and reduced unnecessary FNA recommendations.
引用
收藏
页码:1379 / 1388
页数:10
相关论文
共 50 条
  • [1] Ultrasonographic risk stratification of indeterminate thyroid nodules; a comparison of an artificial intelligence algorithm with radiologist performance
    Tahmasebi, Aylin
    Wang, Shuo
    Daniels, Kelly
    Cottrill, Elizabeth
    Liu, Ji-Bin
    Xu, Jiajun
    Lyshchik, Andrej
    Eisenbrey, John R.
    PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2020,
  • [2] Simplifying risk stratification for thyroid nodules on ultrasound: validation and performance of an artificial intelligence thyroid imaging reporting and data system
    Wildman-Tobriner, Benjamin
    Yang, Jichen
    Allen, Brian C.
    Ho, Lisa M.
    Miller, Chad M.
    Mazurowski, Maciej A.
    CURRENT PROBLEMS IN DIAGNOSTIC RADIOLOGY, 2024, 53 (06) : 695 - 699
  • [3] Effect of the categorization method on the diagnostic performance of ultrasound risk stratification systems for thyroid nodules
    Fu, Chao
    Cui, Yiyang
    Li, Jing
    Yu, Jing
    Wang, Yan
    Si, Caifeng
    Cui, Kefei
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [4] Ultrasound diagnostic criteria for thyroid nodules around the world and artificial intelligence
    Kitagawa, Wataru
    JOURNAL OF MEDICAL ULTRASONICS, 2023, 50 (04) : 463 - 464
  • [5] Ultrasound diagnostic criteria for thyroid nodules around the world and artificial intelligence
    Wataru Kitagawa
    Journal of Medical Ultrasonics, 2023, 50 : 463 - 464
  • [6] Diagnostic performance of artificial intelligence in interpreting thyroid nodules on ultrasound images: a multicenter retrospective study
    Namsena, Pawitchaya
    Songsaeng, Dittapong
    Keatmanee, Chadaporn
    Klabwong, Songphon
    Kunapinun, Alisa
    Soodchuen, Sunsiree
    Tarathipayakul, Thipthara
    Tanasoontrarat, Wasu
    Ekpanyapong, Mongkol
    Dailey, Matthew N.
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (05) : 3676 - 3694
  • [7] Diagnostic efficiency of existing guidelines and the AI-SONIC™ artificial intelligence for ultrasound-based risk assessment of thyroid nodules
    Yang, Linxin
    Lin, Ning
    Wang, Mingyan
    Chen, Gaofang
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [8] External validation of AIBx, an artificial intelligence model for risk stratification, in thyroid nodules
    Swan, Kristine Z.
    Thomas, Johnson
    Nielsen, Viveque E.
    Jespersen, Marie Louise
    Bonnema, Steen J.
    EUROPEAN THYROID JOURNAL, 2022, 11 (02)
  • [9] Impact of the ultrasonography assessment method on the malignancy risk and diagnostic performance of five risk stratification systems in thyroid nodules
    Yang, Go Eun
    Na, Dong Gyu
    ENDOCRINE, 2022, 75 (01) : 137 - 148
  • [10] Impact of the ultrasonography assessment method on the malignancy risk and diagnostic performance of five risk stratification systems in thyroid nodules
    Go Eun Yang
    Dong Gyu Na
    Endocrine, 2022, 75 : 137 - 148