Weighted Sparse Convolution and Transformer Feature Aggregation Networks for 3D Dental Segmentation

被引:0
|
作者
Ahn, Jung Su [1 ]
Cho, Young-Rae [2 ,3 ]
机构
[1] Yonsei Univ Mirae Campus, Grad Sch Comp Sci, Wonju 26493, Gangwon Do, South Korea
[2] Yonsei Univ Mirae Campus, Dept Software, Wonju 26493, Gangwon Do, South Korea
[3] Yonsei Univ Mirae Campus, Dept Digital Healthcare, Wonju 26493, Gangwon Do, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
3D dental images; intraoral scanner; segmentation; transformer; sparse convolution;
D O I
10.1109/ACCESS.2024.3462521
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The conventional alginate technique, widely employed in dentistry to capture tooth morphology, has faced challenges, particularly due to potential discomfort and the risk of allergy reactions among specific patient groups. Consequently, 3D intraoral scanners (IOS), enabling contactless acquisition of dental shapes, have gained widespread adoption. However, for tooth segmentation in 3D dental images obtained through IOS, the majority of methods heavily rely on labor-intensive annotation of high-quality datasets. In this study, we introduce the Weighted Sparse Convolution and Transformer Feature Aggregation Network (WCTN) as a model designed for the segmentation of teeth within 3D dental datasets. The voxel-grid partitioning mechanism in this model efficiently clusters point clouds to extract features with minimal resource usage. Employing weighted sparse convolution operations, WCTN extracts local features from grouped points, followed by sequential capturing of global features through Transformer modules. An adaptive feature fusion strategy was devised, seamlessly combining local and global features to yield robust representations, particularly optimized for 3D dental datasets with uniform density. We evaluated three different versions of WCTN, distinguished by the number of features. Among them, the largest-scale model exhibited superior performance, compared to graph-based models and Transformer-based models with the overall accuracy improvements of 1.89% and 7.18%, respectively. This result highlights the outstanding performance achieved by aggregating diverse features. In conclusion, the proposed model possesses the potential to expedite and automate tooth segmentation tasks, promising to enhance current clinical practices.
引用
收藏
页码:135172 / 135184
页数:13
相关论文
共 50 条
  • [21] Supervoxel Convolution for Online 3D Semantic Segmentation
    Huang, Shi-Sheng
    Ma, Ze-Yu
    Mu, Tai-Jiang
    Fu, Hongbo
    Hu, Shi-Min
    ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (03):
  • [22] Video Object Segmentation with 3D Convolution Network
    Tang, Huiyun
    Tao, Pin
    Ma, Rui
    Shi, Yuanchun
    ICCCV 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON CONTROL AND COMPUTER VISION, 2019, : 28 - 32
  • [23] Dilated Transformer with Feature Aggregation Module for Action Segmentation
    Du, Zexing
    Wang, Qing
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6181 - 6197
  • [24] Dilated Transformer with Feature Aggregation Module for Action Segmentation
    Zexing Du
    Qing Wang
    Neural Processing Letters, 2023, 55 : 6181 - 6197
  • [25] 3D Medical Axial Transformer: A Lightweight Transformer Model for 3D Brain Tumor Segmentation
    Liu, Cheng
    Kiryu, Hisanori
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 799 - 813
  • [26] 3D point cloud classification and segmentation based on dual attention and weighted dynamic graph convolution
    Xiao, Jian
    Wang, Xiaohong
    Li, Wei
    Yang, Yifei
    Luo, Ji
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (18): : 2823 - 2835
  • [27] Hierarchical Aggregation for 3D Instance Segmentation
    Chen, Shaoyu
    Fang, Jiemin
    Zhang, Qian
    Liu, Wenyu
    Wang, Xinggang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15447 - 15456
  • [28] AUTOMATIC SEGMENTATION FOR 3D DENTAL RECONSTRUCTION
    Pavaloiu, Ionel-Bujorel
    Goga, Nicolae
    Marin, Iuliana
    Vasilateanu, Andrei
    2015 6TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2015, : 216 - 221
  • [29] Virtual Sparse Convolution for Multimodal 3D Object Detection
    Wu, Hai
    Wen, Chenglu
    Shi, Shaoshuai
    Li, Xin
    Wang, Cheng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21653 - 21662
  • [30] PCSCNet: Fast 3D semantic segmentation of LiDAR point cloud for autonomous car using point convolution and sparse convolution network
    Park, Jaehyun
    Kim, Chansoo
    Kim, Soyeong
    Jo, Kichun
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212