Assessment of the Diagnostic Performance of a Commercially Available Artificial Intelligence Algorithm for Risk Stratification of Thyroid Nodules on Ultrasound

被引:0
|
作者
Ashton, Jeffrey [1 ]
Morrison, Samantha [2 ]
Erkanli, Alaattin [2 ]
Wildman-Tobriner, Benjamin [1 ]
机构
[1] Duke Univ, Med Ctr, Dept Radiol, 2301 ERWIN RD,Box 3808, Durham, NC 27710 USA
[2] Duke Univ, Sch Med, Dept Biostat & Bioinformat, Durham, NC USA
基金
美国国家卫生研究院;
关键词
AI; neoplasm; ultrasound; algorithm; koios; AMERICAN-COLLEGE; CANCER;
D O I
10.1089/thy.2024.0410
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Thyroid nodules are challenging to accurately characterize on ultrasound (US), though the emergence of risk stratification systems and more recently artificial intelligence (AI) algorithms has improved nodule classification. The purpose of this study was to evaluate the performance of a recent Food and Drug Administration (FDA)-cleared AI tool for detection of malignancy in thyroid nodules on US. Methods: One year of consecutive thyroid US with >= 1 nodule from Duke University Hospital and its affiliate community hospital (649 nodules from 347 patients) were retrospectively evaluated. Included nodules had ground truth diagnoses by surgical pathology, fine needle aspiration (FNA), or three-year follow-up US showing stability. An FDA-cleared AI tool (Koios DS Thyroid) analyzed each nodule to generate (i) American College of Radiology Thyroid Imaging Reporting and Data System (ACR TI-RADS) descriptors, scores, and follow-up recommendations and (ii) an AI-adapter score to further adjust risk assessments and recommendations. Four groups were then compared: (i) Koios with AI-adapter, (ii) Koios without AI-adapter, (iii) clinical radiology report, and (iv) radiology report combined with AI-adapter. Performance of the final recommendations (FNA or no FNA) was determined based on ground truth, and comparison between the four groups was made using sensitivity, specificity, and receiver-operating-curve analysis. Results: Of 649 nodules, 32 were malignant and 617 were benign. Performance of Koios with AI-adapter enabled was similar to radiologists (area under the curve [AUC] 0.70 for both, [CI 0.60-0.81] and [0.60-0.79], respectively). Koios with AI-adapter had improved specificity compared to radiologists (0.63 [CI: 0.59-0.67] versus 0.43 [CI: 0.38-0.48]) but decreased sensitivity (0.69 [CI: 0.50-0.83) versus 0.81 [CI: 0.61, 0.92]). Highest performance was seen when the radiology interpretation was combined with the AI-adapter (AUC 0.76, [CI: 0.67-0.85]). Combined with the AI-adapter, radiologist specificity improved from 0.43 ([CI: 0.38-0.48]) to 0.53 ([CI: 0.49-0.58]) (McNemar's test p < 0.001), resulting in 17% fewer FNA recommendations, with unchanged sensitivity (0.81, p = 1). Conclusion: Koios DS demonstrated standalone performance similar to radiologists, though with lower sensitivity and higher specificity. Performance was best when radiologist interpretations were combined with the AI-adapter component, with improved specificity and reduced unnecessary FNA recommendations.
引用
收藏
页码:1379 / 1388
页数:10
相关论文
共 50 条
  • [31] Diagnostic performance of adult-based ultrasound risk stratification systems in pediatric thyroid nodules: a systematic review and meta-analysis
    Xing, Zhichao
    Qiu, Yuxuan
    Zhu, Jingqiang
    Su, Anping
    Wu, Wenshuang
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [32] Combining radiomics with ultrasound-based risk stratification systems for thyroid nodules: an approach for improving performance
    Park, Vivian Y.
    Lee, Eunjung
    Lee, Hye Sun
    Kim, Hye Jung
    Yoon, Jiyoung
    Son, Jinwoo
    Song, Kijun
    Moon, Hee Jung
    Yoon, Jung Hyun
    Kim, Ga Ram
    Kwak, Jin Young
    EUROPEAN RADIOLOGY, 2021, 31 (04) : 2405 - 2413
  • [33] Combining radiomics with ultrasound-based risk stratification systems for thyroid nodules: an approach for improving performance
    Vivian Y. Park
    Eunjung Lee
    Hye Sun Lee
    Hye Jung Kim
    Jiyoung Yoon
    Jinwoo Son
    Kijun Song
    Hee Jung Moon
    Jung Hyun Yoon
    Ga Ram Kim
    Jin Young Kwak
    European Radiology, 2021, 31 : 2405 - 2413
  • [34] Malignancy risk of thyroid nodules: quality assessment of the thyroid ultrasound report
    Raposo, Luis
    Freitas, Claudia
    Martins, Raquel
    Saraiva, Catarina
    Manita, Isabel
    Oliveira, Maria Joao
    Marques, Ana Paula
    Marques, Bernardo
    Rocha, Gustavo
    Martins, Teresa
    Azevedo, Teresa
    Rodrigues, Fernando
    BMC MEDICAL IMAGING, 2022, 22 (01)
  • [35] Malignancy risk of thyroid nodules: quality assessment of the thyroid ultrasound report
    Luís Raposo
    Cláudia Freitas
    Raquel Martins
    Catarina Saraiva
    Isabel Manita
    Maria João Oliveira
    Ana Paula Marques
    Bernardo Marques
    Gustavo Rocha
    Teresa Martins
    Teresa Azevedo
    Fernando Rodrigues
    BMC Medical Imaging, 22
  • [36] Diagnostic Performance of Five Adult-based US Risk Stratification Systems in Pediatric Thyroid Nodules
    Kim, Pyeong Hwa
    Yoon, Hee Mang
    Baek, Jung Hwan
    Chung, Sae Rom
    Choi, Young Jun
    Lee, Jeong Hyun
    Lee, Jin Seong
    Jung, Ah Young
    Cho, Young Ah
    Bak, Boram
    Na, Dong Gyu
    RADIOLOGY, 2022, 305 (01) : 190 - 198
  • [37] Thyroid nodules: diagnostic evaluation based on thyroid cancer risk assessment
    Ospina, Naykky Singh
    Iniguez-Ariza, Nicole M.
    Castro, M. Regina
    BMJ-BRITISH MEDICAL JOURNAL, 2020, 368
  • [38] A Comparison of the Performances of Artificial Intelligence System and Radiologists in the Ultrasound Diagnosis of Thyroid Nodules
    He, Lian-Tu
    Chen, Feng-Juan
    Zhou, Da-Zhi
    Zhang, Yu-Xin
    Li, Ying-Shan
    Tang, Min-Xuan
    Tang, Jia-Xin
    Liu, Shuo
    Chen, Zhi-Jie
    Tang, Qing
    CURRENT MEDICAL IMAGING, 2022, 18 (13) : 1369 - 1377
  • [39] Diagnostic Performance of An Artificial Intelligence System in Breast Ultrasound
    O'Connell, Avice M.
    Bartolotta, Tommaso V.
    Orlando, Alessia
    Jung, Sin-Ho
    Baek, Jihye
    Parker, Kevin J.
    JOURNAL OF ULTRASOUND IN MEDICINE, 2022, 41 (01) : 97 - 105
  • [40] A scoping review of commercially available artificial intelligence-based technologies for diagnosis or risk assessment of skin cancer
    Wen, David
    Coombe, April
    Charalambides, Maria
    Jones, Owain
    Wietek, Nina
    Dinnes, Jac
    Matin, Rubeta
    BRITISH JOURNAL OF DERMATOLOGY, 2022, 187 : 159 - 159