Enhancing the reliability of deep learning-based head and neck tumour segmentation using uncertainty estimation with multi-modal images

被引:1
|
作者
Ren, Jintao [1 ,2 ,3 ]
Teuwen, Jonas [4 ]
Nijkamp, Jasper [1 ,3 ]
Rasmussen, Mathis [1 ,2 ,3 ]
Gouw, Zeno [4 ]
Eriksen, Jesper Grau [2 ,3 ]
Sonke, Jan-Jakob [4 ]
Korreman, Stine [1 ,2 ,3 ]
机构
[1] Aarhus Univ Hosp, Danish Ctr Particle Therapy, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[2] Aarhus Univ Hosp, Dept Oncol, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[3] Aarhus Univ, Dept Clin Med, Palle Juul Jensens Blvd 25, DK-8200 Aarhus N, Denmark
[4] Netherlands Canc Inst, Dept Radiat Oncol, Plesmanlaan 121, NL-1066 CX Amsterdam, Netherlands
来源
PHYSICS IN MEDICINE AND BIOLOGY | 2024年 / 69卷 / 16期
关键词
uncertainty estimation; deep learning; radiotherapy; gross tumour volume; head and neck cancer; tumour segmentation; uncertainty quantification; QUANTIFICATION; OROPHARYNGEAL; DELINEATION; DAHANCA;
D O I
10.1088/1361-6560/ad682d
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. Deep learning shows promise in autosegmentation of head and neck cancer (HNC) primary tumours (GTV-T) and nodal metastases (GTV-N). However, errors such as including non-tumour regions or missing nodal metastases still occur. Conventional methods often make overconfident predictions, compromising reliability. Incorporating uncertainty estimation, which provides calibrated confidence intervals can address this issue. Our aim was to investigate the efficacy of various uncertainty estimation methods in improving segmentation reliability. We evaluated their confidence levels in voxel predictions and ability to reveal potential segmentation errors. Approach. We retrospectively collected data from 567 HNC patients with diverse cancer sites and multi-modality images (CT, PET, T1-, and T2-weighted MRI) along with their clinical GTV-T/N delineations. Using the nnUNet 3D segmentation pipeline, we compared seven uncertainty estimation methods, evaluating them based on segmentation accuracy (Dice similarity coefficient, DSC), confidence calibration (Expected Calibration Error, ECE), and their ability to reveal segmentation errors (Uncertainty-Error overlap using DSC, UE-DSC). Main results. Evaluated on the hold-out test dataset (n = 97), the median DSC scores for GTV-T and GTV-N segmentation across all uncertainty estimation methods had a narrow range, from 0.73 to 0.76 and 0.78 to 0.80, respectively. In contrast, the median ECE exhibited a wider range, from 0.30 to 0.12 for GTV-T and 0.25 to 0.09 for GTV-N. Similarly, the median UE-DSC also ranged broadly, from 0.21 to 0.38 for GTV-T and 0.22 to 0.36 for GTV-N. A probabilistic network-PhiSeg method consistently demonstrated the best performance in terms of ECE and UE-DSC. Significance. Our study highlights the importance of uncertainty estimation in enhancing the reliability of deep learning for autosegmentation of HNC GTV. The results show that while segmentation accuracy can be similar across methods, their reliability, measured by calibration error and uncertainty-error overlap, varies significantly. Used with visualisation maps, these methods may effectively pinpoint uncertainties and potential errors at the voxel level.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Deep Learning-based Brain Tumour Segmentation
    Ventakasubbu, Pattabiraman
    Ramasubramanian, Parvathi
    IETE JOURNAL OF RESEARCH, 2023, 69 (06) : 3156 - 3164
  • [22] RETRACTION: An Efficient Deep Learning-based Video Captioning Framework Using Multi-modal Features
    Varma, S.
    James, D. P.
    EXPERT SYSTEMS, 2025, 42 (02)
  • [23] Multi-task Learning of Semantic Segmentation and Height Estimation for Multi-modal Remote Sensing Images
    Mengyu WANG
    Zhiyuan YAN
    Yingchao FENG
    Wenhui DIAO
    Xian SUN
    Journal of Geodesy and Geoinformation Science, 2023, 6 (04) : 27 - 39
  • [24] Multi-modal tumor segmentation methods based on deep learning: a narrative review
    Xue, Hengzhi
    Yao, Yudong
    Teng, Yueyang
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (01) : 1122 - 1140
  • [25] Identification of Luminal A breast cancer by using deep learning analysis based on multi-modal images
    Liu, Menghan
    Zhang, Shuai
    Du, Yanan
    Zhang, Xiaodong
    Wang, Dawei
    Ren, Wanqing
    Sun, Jingxiang
    Yang, Shiwei
    Zhang, Guang
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [26] Evidential deep learning-based multi-modal environment perception for intelligent vehicles
    Geletu, Mihreteab Negash
    Giurgi, Danut-Vasile
    Josso-Laurain, Thomas
    Devanne, Maxime
    Wogari, Mengesha Mamo
    Lauffenburger, Jean-Philippe
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [27] Multi-Modal Ensemble Deep Learning in Head and Neck Cancer HPV Sub-Typing
    Saikia, Manob Jyoti
    Kuanar, Shiba
    Mahapatra, Dwarikanath
    Faghani, Shahriar
    BIOENGINEERING-BASEL, 2024, 11 (01):
  • [28] RAIF: A deep learning-based architecture for multi-modal aesthetic biometric system
    Iffath, Fariha
    Gavrilova, Marina
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2023, 34 (3-4)
  • [29] Uncertainty in deep learning-based automatic segmentations of head and neck cancer tumors
    Bao Ngoc Huynh
    Groendahl, Aurora Rosvoll
    Tomic, Oliver
    Knudtsen, Ingerid Skjei
    Hoebers, Frank
    van Elmpt, Wouter
    Dale, Einar
    Malinen, Eirik
    Futsaether, Cecilia Marie
    RADIOTHERAPY AND ONCOLOGY, 2024, 194 : S3034 - S3037
  • [30] UNIVERSAL MULTI-MODAL DEEP NETWORK FOR CLASSIFICATION AND SEGMENTATION OF MEDICAL IMAGES
    Harouni, Ahmed
    Karargyris, Alexandros
    Negahdar, Mohammadreza
    Beymer, David
    Syeda-Mahmood, Tanveer
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 872 - 876