Gross failure rates and failure modes for a commercial AI-based auto-segmentation algorithm in head and neck cancer patients

被引:5
|
作者
Temple, Simon W. P. [1 ]
Rowbottom, Carl G. [1 ,2 ]
机构
[1] Clatterbridge Canc Ctr NHS Fdn Trust, Med Phys Dept, Liverpool, England
[2] Univ Liverpool, Dept Phys, Liverpool, England
来源
关键词
auto-segmentation; deep learning; failure modes; INTEROBSERVER VARIABILITY; DELINEATION; ORGANS; RISK; IMPLEMENTATION; ONCOLOGY; QUALITY;
D O I
10.1002/acm2.14273
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeArtificial intelligence (AI) based commercial software can be used to automatically delineate organs at risk (OAR), with potential for efficiency savings in the radiotherapy treatment planning pathway, and reduction of inter- and intra-observer variability. There has been little research investigating gross failure rates and failure modes of such systems.Method50 head and neck (H&N) patient data sets with "gold standard" contours were compared to AI-generated contours to produce expected mean and standard deviation values for the Dice Similarity Coefficient (DSC), for four common H&N OARs (brainstem, mandible, left and right parotid). An AI-based commercial system was applied to 500 H&N patients. AI-generated contours were compared to manual contours, outlined by an expert human, and a gross failure was set at three standard deviations below the expected mean DSC. Failures were inspected to assess reason for failure of the AI-based system with failures relating to suboptimal manual contouring censored. True failures were classified into 4 sub-types (setup position, anatomy, image artefacts and unknown).ResultsThere were 24 true failures of the AI-based commercial software, a gross failure rate of 1.2%. Fifteen failures were due to patient anatomy, four were due to dental image artefacts, three were due to patient position and two were unknown. True failure rates by OAR were 0.4% (brainstem), 2.2% (mandible), 1.4% (left parotid) and 0.8% (right parotid).ConclusionTrue failures of the AI-based system were predominantly associated with a non-standard element within the CT scan. It is likely that these non-standard elements were the reason for the gross failure, and suggests that patient datasets used to train the AI model did not contain sufficient heterogeneity of data. Regardless of the reasons for failure, the true failure rate for the AI-based system in the H&N region for the OARs investigated was low (similar to 1%).
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Evaluation of Atlas-Based Auto-Segmentation (ABAS) in Head-And-Neck Adaptive Radiotherapy
    Liu, Q.
    Yan, D.
    MEDICAL PHYSICS, 2014, 41 (06) : 208 - 208
  • [32] The impact of training sample size on deep learning-based organ auto-segmentation for head-and-neck patients
    Fang, Yingtao
    Wang, Jiazhou
    Ou, Xiaomin
    Ying, Hongmei
    Hu, Chaosu
    Zhang, Zhen
    Hu, Weigang
    PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (18):
  • [33] Evaluation of the Auto-Segmentation Performance of a Novel Online Adaptive Radiotherapy System for Head and Neck Cancer Treatment
    Yoon, S. W.
    Lin, H.
    Alonso-Basanta, M.
    Anderson, N.
    Apinorasethkul, O.
    Cooper, K.
    Dong, L.
    Kempsey, B.
    Marcel, J.
    Metz, J. M.
    Scheuermann, R. M.
    Li, T.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2020, 108 (03): : E352 - E353
  • [34] Uncertainty map for error prediction in deep learning-based head and neck tumor auto-segmentation
    Ren, J.
    Teuwen, J.
    Nijkamp, J.
    Rasmussen, M.
    Eriksen, J.
    Sonke, J.
    Korreman, S.
    RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S688 - S689
  • [35] Comparison of AI-Based Auto-Segmentation Quality with Different Daily IGRT Imaging Modalities for Adaptive Radiotherapy Treatment Planning
    Han, C.
    Wong, C.
    Oderinde, O. M.
    Watkins, W. T.
    Qing, K.
    Liu, B.
    Williams, T. M.
    Liu, A.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2023, 117 (02): : E670 - E670
  • [36] Retrospective Comparison of Geometrical Accuracy among Atlas-based Auto-segmentation, Deep Learning Auto-segmentation, and Deformable Image Registration in the Treatment Replanning for Adaptive Radiotherapy of Head-and-Neck Cancer
    Nagayasu, Yukari
    Inui, Shoki
    Ueda, Yoshihiro
    Masaoka, Akira
    Tominaga, Masahide
    Miyazaki, Masayoshi
    Konishi, Koji
    JOURNAL OF MEDICAL PHYSICS, 2024, 49 (03) : 335 - 342
  • [37] Development of a Deep Learning-Based Auto-Segmentation of Organs at Risk for Head and Neck Radiotherapy Planning
    Koo, J.
    Latifi, K.
    Caudell, J. J.
    Jordan, P.
    Shen, S.
    Adamson, P. M.
    Feygelman, V.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2022, 112 (05): : E8 - E8
  • [38] Patient modelling in head and neck hyperthermia treatment planning: is CT-based auto-segmentation sufficient?
    Verhaart, R. F.
    Fortunati, V.
    Adibzadeh, F.
    Verduijn, G. M.
    Veenland, J. F.
    Van Walsum, T.
    Paulides, M. M.
    RADIOTHERAPY AND ONCOLOGY, 2014, 111 : S165 - S165
  • [39] Skin cancer of the head and neck with gross or microscopic perineural involvement: Patterns of failure
    Sapir, Eli
    Tolpadi, Anagha
    McHugh, Jonathan
    Samuels, Stuart E.
    Elalfy, Eman
    Spector, Matthew
    Shuman, Andrew G.
    Malloy, Kelly M.
    Prince, Mark E.
    Bradford, Carol R.
    Worden, Francis P.
    Schipper, Matthew
    Eisbruch, Avraham
    RADIOTHERAPY AND ONCOLOGY, 2016, 120 (01) : 81 - 86
  • [40] Clinical validation of commercial deep-learning based auto-segmentation models for organs at risk in the head and neck region: a single institution study
    Johnson, Casey L.
    Press, Robert H.
    Simone, Charles B.
    Shen, Brian
    Tsai, Pingfang
    Hu, Lei
    Yu, Francis
    Apinorasethkul, Chavanon
    Ackerman, Christopher
    Zhai, Huifang
    Lin, Haibo
    Huang, Sheng
    FRONTIERS IN ONCOLOGY, 2024, 14