Gross failure rates and failure modes for a commercial AI-based auto-segmentation algorithm in head and neck cancer patients

被引:5
|
作者
Temple, Simon W. P. [1 ]
Rowbottom, Carl G. [1 ,2 ]
机构
[1] Clatterbridge Canc Ctr NHS Fdn Trust, Med Phys Dept, Liverpool, England
[2] Univ Liverpool, Dept Phys, Liverpool, England
来源
关键词
auto-segmentation; deep learning; failure modes; INTEROBSERVER VARIABILITY; DELINEATION; ORGANS; RISK; IMPLEMENTATION; ONCOLOGY; QUALITY;
D O I
10.1002/acm2.14273
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeArtificial intelligence (AI) based commercial software can be used to automatically delineate organs at risk (OAR), with potential for efficiency savings in the radiotherapy treatment planning pathway, and reduction of inter- and intra-observer variability. There has been little research investigating gross failure rates and failure modes of such systems.Method50 head and neck (H&N) patient data sets with "gold standard" contours were compared to AI-generated contours to produce expected mean and standard deviation values for the Dice Similarity Coefficient (DSC), for four common H&N OARs (brainstem, mandible, left and right parotid). An AI-based commercial system was applied to 500 H&N patients. AI-generated contours were compared to manual contours, outlined by an expert human, and a gross failure was set at three standard deviations below the expected mean DSC. Failures were inspected to assess reason for failure of the AI-based system with failures relating to suboptimal manual contouring censored. True failures were classified into 4 sub-types (setup position, anatomy, image artefacts and unknown).ResultsThere were 24 true failures of the AI-based commercial software, a gross failure rate of 1.2%. Fifteen failures were due to patient anatomy, four were due to dental image artefacts, three were due to patient position and two were unknown. True failure rates by OAR were 0.4% (brainstem), 2.2% (mandible), 1.4% (left parotid) and 0.8% (right parotid).ConclusionTrue failures of the AI-based system were predominantly associated with a non-standard element within the CT scan. It is likely that these non-standard elements were the reason for the gross failure, and suggests that patient datasets used to train the AI model did not contain sufficient heterogeneity of data. Regardless of the reasons for failure, the true failure rate for the AI-based system in the H&N region for the OARs investigated was low (similar to 1%).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Cross-Comparison of Multi-Platform AI Auto-Segmentation Tools Using Independent Multi-Institutional Datasets for Head and Neck Cancer
    Rong, Y.
    Chen, Q.
    Yuan, L.
    Qi, X.
    Latifi, K.
    Yang, X.
    Cai, B.
    Al-Hallaq, H.
    Wu, Q.
    Xiao, Y.
    Benedict, S.
    MEDICAL PHYSICS, 2022, 49 (06) : E781 - E781
  • [42] Deep learning auto-segmentation and automated treatment planning for trismus risk reduction in head and neck cancer radiotherapy
    Thor, Maria
    Iyer, Aditi
    Jiang, Jue
    Apte, Aditya
    Veeraraghavan, Harini
    Allgood, Natasha B.
    Kouri, Jennifer A.
    Zhou, Ying
    LoCastro, Eve
    Elguindi, Sharif
    Hong, Linda
    Hunt, Margie
    Cervino, Laura
    Aristophanous, Michalis
    Zarepisheh, Masoud
    Deasy, Joseph O.
    PHYSICS & IMAGING IN RADIATION ONCOLOGY, 2021, 19 : 96 - 101
  • [43] Atlas-based auto-segmentation of heart structures in breast cancer patients
    Kaderka, R.
    Mundt, R.
    Bryant, A.
    Gillespie, E.
    Eastman, B.
    Atwood, T.
    Murphy, J.
    RADIOTHERAPY AND ONCOLOGY, 2017, 123 : S495 - S495
  • [44] Evaluation of atlas-based auto-segmentation software in prostate cancer patients
    Greenham, Stuart
    Dean, Jenna
    Fu, Cheuk Kuen Kenneth
    Goman, Joanne
    Mulligan, Jeremy
    Tune, Deanna
    Sampson, David
    Westhuyzen, Justin
    Mckay, Michael
    JOURNAL OF MEDICAL RADIATION SCIENCES, 2014, 61 (03): : 151 - 158
  • [45] Evaluation of atlas based auto-segmentation for head and neck target volume delineation in adaptive/replan IMRT.
    Speight, R.
    Karakaya, E.
    Prestwich, R.
    Sen, M.
    Lindsay, R.
    Harding, R.
    Sykes, J.
    XVII INTERNATIONAL CONFERENCE ON THE USE OF COMPUTERS IN RADIATION THERAPY (ICCR 2013), 2014, 489
  • [46] Cascaded deep learning-based auto-segmentation for head and neck cancer patients: Organs at risk on T2-weighted magnetic resonance imaging
    Korte, James C.
    Hardcastle, Nicholas
    Ng, Sweet Ping
    Clark, Brett
    Kron, Tomas
    Jackson, Price
    MEDICAL PHYSICS, 2021, 48 (12) : 7757 - 7772
  • [47] Validation of a multi-atlas based auto-segmentation of the heart in breast cancer patients
    Van Dijk-Peters, F. B. J.
    Sijtsema, N. M.
    Kierkels, R. G. J.
    Vliegenthart, R.
    Langendijk, J. A.
    Maduro, J. H.
    Crijns, A. P. G.
    RADIOTHERAPY AND ONCOLOGY, 2015, 115 : S132 - S133
  • [48] Evaluation of three AI-based CT auto-contouring systems for head&neck, thorax and pelvis
    Casati, M.
    Loi, M.
    Arilli, C.
    Marrazzo, L.
    Talamonti, C.
    Zani, M.
    Compagnucci, A.
    Simontacchi, G.
    Di Cataldo, V.
    Desideri, I.
    Bonomo, P.
    Franza, N.
    Raspanti, D.
    Pellegrini, R.
    Livi, L.
    Pallotta, S.
    RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S276 - S278
  • [49] Auto-segmentation of organs at risk for head and neck radiotherapy planning: From atlas-based to deep learning methods
    Vrtovec, Tomaz
    Mocnik, Domen
    Strojan, Primoz
    Pernus, Franjo
    Ibragimov, Bulat
    MEDICAL PHYSICS, 2020, 47 (09) : E929 - E950
  • [50] A Preliminary Experience of Implementing Deep-Learning Based Auto-Segmentation in Head and Neck Cancer: A Study on Real-World Clinical Cases
    Zhong, Yang
    Yang, Yanju
    Fang, Yingtao
    Wang, Jiazhou
    Hu, Weigang
    FRONTIERS IN ONCOLOGY, 2021, 11