Weighted Sparse Convolution and Transformer Feature Aggregation Networks for 3D Dental Segmentation

被引:0
|
作者
Ahn, Jung Su [1 ]
Cho, Young-Rae [2 ,3 ]
机构
[1] Yonsei Univ Mirae Campus, Grad Sch Comp Sci, Wonju 26493, Gangwon Do, South Korea
[2] Yonsei Univ Mirae Campus, Dept Software, Wonju 26493, Gangwon Do, South Korea
[3] Yonsei Univ Mirae Campus, Dept Digital Healthcare, Wonju 26493, Gangwon Do, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
3D dental images; intraoral scanner; segmentation; transformer; sparse convolution;
D O I
10.1109/ACCESS.2024.3462521
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The conventional alginate technique, widely employed in dentistry to capture tooth morphology, has faced challenges, particularly due to potential discomfort and the risk of allergy reactions among specific patient groups. Consequently, 3D intraoral scanners (IOS), enabling contactless acquisition of dental shapes, have gained widespread adoption. However, for tooth segmentation in 3D dental images obtained through IOS, the majority of methods heavily rely on labor-intensive annotation of high-quality datasets. In this study, we introduce the Weighted Sparse Convolution and Transformer Feature Aggregation Network (WCTN) as a model designed for the segmentation of teeth within 3D dental datasets. The voxel-grid partitioning mechanism in this model efficiently clusters point clouds to extract features with minimal resource usage. Employing weighted sparse convolution operations, WCTN extracts local features from grouped points, followed by sequential capturing of global features through Transformer modules. An adaptive feature fusion strategy was devised, seamlessly combining local and global features to yield robust representations, particularly optimized for 3D dental datasets with uniform density. We evaluated three different versions of WCTN, distinguished by the number of features. Among them, the largest-scale model exhibited superior performance, compared to graph-based models and Transformer-based models with the overall accuracy improvements of 1.89% and 7.18%, respectively. This result highlights the outstanding performance achieved by aggregating diverse features. In conclusion, the proposed model possesses the potential to expedite and automate tooth segmentation tasks, promising to enhance current clinical practices.
引用
收藏
页码:135172 / 135184
页数:13
相关论文
共 50 条
  • [31] FFA-Net: fast feature aggregation network for 3D point cloud segmentation
    Ruting Cheng
    Hui Zeng
    Baoqing Zhang
    Xuan Wang
    Tianmeng Zhao
    Machine Vision and Applications, 2023, 34
  • [32] FFA-Net: fast feature aggregation network for 3D point cloud segmentation
    Cheng, Ruting
    Zeng, Hui
    Zhang, Baoqing
    Wang, Xuan
    Zhao, Tianmeng
    MACHINE VISION AND APPLICATIONS, 2023, 34 (05)
  • [33] Stratified Transformer for 3D Point Cloud Segmentation
    Lai, Xin
    Liu, Jianhui
    Jiang, Li
    Wang, Liwei
    Zhao, Hengshuang
    Liu, Shu
    Qi, Xiaojuan
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8490 - 8499
  • [34] Semantic segmentation of 3D point cloud based on boundary point estimation and sparse convolution neural network
    Yang J.
    Zhang C.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (06): : 1121 - 1132
  • [35] Superpoint Transformer for 3D Scene Instance Segmentation
    Sun, Jiahao
    Qing, Chunmei
    Tan, Junpeng
    Xu, Xiangmin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2393 - 2401
  • [36] Efficient 3D Semantic Segmentation with Superpoint Transformer
    Robert, Damien
    Raguet, Hugo
    Landrieu, Loic
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17149 - 17158
  • [37] Query Refinement Transformer for 3D Instance Segmentation
    Lu, Jiahao
    Deng, Jiacheng
    Wang, Chuxin
    He, Jianfeng
    Zhang, Tianzhu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18470 - 18480
  • [38] A 3D Palmprint Recognition Method based on Local Sparse Representation and Weighted Shape Index Feature
    Yang, Dongliang
    Song, Changjiang
    Gao, Fengjiao
    Wu, Gang
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 4537 - 4540
  • [39] Dynamic Convolution for 3D Point Cloud Instance Segmentation
    He, Tong
    Shen, Chunhua
    van den Hengel, Anton
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5697 - 5711
  • [40] Light3DHS: A lightweight 3D hippocampus segmentation method using multiscale convolution attention and vision transformer
    Xiao, Zhiyong
    Zhang, Yuhong
    Deng, Zhaohong
    Liu, Fei
    NEUROIMAGE, 2024, 292