Benchmarking Deep Learning Models for Tooth Structure Segmentation

被引:14
|
作者
Schneider, L. [1 ,2 ]
Arsiwala-Scheppach, L. [1 ,2 ]
Krois, J. [1 ,2 ]
Meyer-Lueckel, H. [3 ]
Bressem, K. K. [4 ,5 ]
Niehues, S. M. [4 ]
Schwendicke, F. [1 ,2 ]
机构
[1] Charite, Dept Oral Diagnost Digital Hlth & Hlth Serv Res, Assmannshauser Str 4 6, D-14197 Berlin, Germany
[2] ITU WHO Focus Grp Hlth, Top Grp Dent Diagnost & Digital Dent, Geneva, Switzerland
[3] Univ Bern, Zahnmed Kliniken Univ Bern, Dept Restorat Prevent & Pediat Dent, Bern, Switzerland
[4] Charite Univ Med Berlin, Radiol Klin, Berlin, Germany
[5] Charite Univ Med Berlin, Berlin Inst Hlth, Berlin, Germany
关键词
computer vision; artificial intelligence; segmentation; tooth structures; transfer learning; neural networks;
D O I
10.1177/00220345221100169
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
A wide range of deep learning (DL) architectures with varying depths are available, with developers usually choosing one or a few of them for their specific task in a nonsystematic way. Benchmarking (i.e., the systematic comparison of state-of-the art architectures on a specific task) may provide guidance in the model development process and may allow developers to make better decisions. However, comprehensive benchmarking has not been performed in dentistry yet. We aimed to benchmark a range of architecture designs for 1 specific, exemplary case: tooth structure segmentation on dental bitewing radiographs. We built 72 models for tooth structure (enamel, dentin, pulp, fillings, crowns) segmentation by combining 6 different DL network architectures (U-Net, U-Net++, Feature Pyramid Networks, LinkNet, Pyramid Scene Parsing Network, Mask Attention Network) with 12 encoders from 3 different encoder families (ResNet, VGG, DenseNet) of varying depth (e.g., VGG13, VGG16, VGG19). On each model design, 3 initialization strategies (ImageNet, CheXpert, random initialization) were applied, resulting overall into 216 trained models, which were trained up to 200 epochs with the Adam optimizer (learning rate = 0.0001) and a batch size of 32. Our data set consisted of 1,625 human-annotated dental bitewing radiographs. We used a 5-fold cross-validation scheme and quantified model performances primarily by the F1-score. Initialization with ImageNet or CheXpert weights significantly outperformed random initialization (P < 0.05). Deeper and more complex models did not necessarily perform better than less complex alternatives. VGG-based models were more robust across model configurations, while more complex models (e.g., from the ResNet family) achieved peak performances. In conclusion, initializing models with pretrained weights may be recommended when training models for dental radiographic analysis. Less complex model architectures may be competitive alternatives if computational resources and training time are restricting factors. Models developed and found superior on nondental data sets may not show this behavior for dental domain-specific tasks.
引用
收藏
页码:1343 / 1349
页数:7
相关论文
共 50 条
  • [21] Medical Ultrasound Image Segmentation With Deep Learning Models
    Wang, Chuantao
    Zhang, Jinhua
    Liu, Siyu
    IEEE ACCESS, 2023, 11 : 10158 - 10168
  • [22] Benchmarking the Effect of Poisoning Defenses on the Security and Bias of Deep Learning Models
    Baracaldo, Nathalie
    Ahmed, Farhan
    Eykholt, Kevin
    Zhou, Yi
    Priya, Shriti
    Lee, Taesung
    Kadhe, Swanand
    Tan, Mike
    Polavaram, Sridevi
    Suggs, Sterling
    Gao, Yuyang
    Slater, David
    2023 IEEE SECURITY AND PRIVACY WORKSHOPS, SPW, 2023, : 45 - 56
  • [23] Benchmarking and scaling of deep learning models for land cover image classification
    Papoutsis, Ioannis
    Bountos, Nikolaos Ioannis
    Zavras, Angelos
    Michail, Dimitrios
    Tryfonopoulos, Christos
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 195 : 250 - 268
  • [24] Benchmarking Deep Learning Models on Myriad and Snapdragon Processors for Space Applications
    Dunkel, Emily R.
    Swope, Jason
    Candela, Alberto
    West, Lauren
    Chien, Steve A.
    Towfic, Zaid
    Buckley, Leonie
    Romero-Canas, Juan
    Espinosa-Aranda, Jose Luis
    Hervas-Martin, Elena
    Fernandez, Mark R.
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2023, 20 (10): : 660 - 674
  • [25] Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices
    Alqahtani, Daghash K.
    Cheema, Muhammad Aamir
    Toosi, Adel N.
    SERVICE-ORIENTED COMPUTING, ICSOC 2024, PT I, 2025, 15404 : 142 - 150
  • [26] Benchmarking explanation methods for mental state decoding with deep learning models
    Thomas, Armin W.
    Re, Christopher
    Poldrack, Russell A.
    NEUROIMAGE, 2023, 273
  • [27] Transformer-Based Deep Learning Network for Tooth Segmentation on Panoramic Radiographs
    SHENG Chen
    WANG Lin
    HUANG Zhenhuan
    WANG Tian
    GUO Yalin
    HOU Wenjie
    XU Laiqing
    WANG Jiazhu
    YAN Xue
    JournalofSystemsScience&Complexity, 2023, 36 (01) : 257 - 272
  • [28] Deep learning-based tooth segmentation methods in medical imaging: A review
    Chen, Xiaokang
    Ma, Nan
    Xu, Tongkai
    Xu, Cheng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART H-JOURNAL OF ENGINEERING IN MEDICINE, 2024, 238 (02) : 115 - 131
  • [29] Transformer-Based Deep Learning Network for Tooth Segmentation on Panoramic Radiographs
    Sheng Chen
    Wang Lin
    Huang Zhenhuan
    Wang Tian
    Guo Yalin
    Hou Wenjie
    Xu Laiqing
    Wang Jiazhu
    Yan Xue
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2023, 36 (01) : 257 - 272
  • [30] Transformer-Based Deep Learning Network for Tooth Segmentation on Panoramic Radiographs
    Chen Sheng
    Lin Wang
    Zhenhuan Huang
    Tian Wang
    Yalin Guo
    Wenjie Hou
    Laiqing Xu
    Jiazhu Wang
    Xue Yan
    Journal of Systems Science and Complexity, 2023, 36 : 257 - 272