Weighted Sparse Convolution and Transformer Feature Aggregation Networks for 3D Dental Segmentation

被引：0

作者：

Ahn, Jung Su ^{[1
]}

Cho, Young-Rae ^{[2
,3
]}

机构：

[1] Yonsei Univ Mirae Campus, Grad Sch Comp Sci, Wonju 26493, Gangwon Do, South Korea

[2] Yonsei Univ Mirae Campus, Dept Software, Wonju 26493, Gangwon Do, South Korea

[3] Yonsei Univ Mirae Campus, Dept Digital Healthcare, Wonju 26493, Gangwon Do, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

新加坡国家研究基金会;

关键词：

3D dental images; intraoral scanner; segmentation; transformer; sparse convolution;

D O I：

10.1109/ACCESS.2024.3462521

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The conventional alginate technique, widely employed in dentistry to capture tooth morphology, has faced challenges, particularly due to potential discomfort and the risk of allergy reactions among specific patient groups. Consequently, 3D intraoral scanners (IOS), enabling contactless acquisition of dental shapes, have gained widespread adoption. However, for tooth segmentation in 3D dental images obtained through IOS, the majority of methods heavily rely on labor-intensive annotation of high-quality datasets. In this study, we introduce the Weighted Sparse Convolution and Transformer Feature Aggregation Network (WCTN) as a model designed for the segmentation of teeth within 3D dental datasets. The voxel-grid partitioning mechanism in this model efficiently clusters point clouds to extract features with minimal resource usage. Employing weighted sparse convolution operations, WCTN extracts local features from grouped points, followed by sequential capturing of global features through Transformer modules. An adaptive feature fusion strategy was devised, seamlessly combining local and global features to yield robust representations, particularly optimized for 3D dental datasets with uniform density. We evaluated three different versions of WCTN, distinguished by the number of features. Among them, the largest-scale model exhibited superior performance, compared to graph-based models and Transformer-based models with the overall accuracy improvements of 1.89% and 7.18%, respectively. This result highlights the outstanding performance achieved by aggregating diverse features. In conclusion, the proposed model possesses the potential to expedite and automate tooth segmentation tasks, promising to enhance current clinical practices.

引用

页码：135172 / 135184

页数：13

共 50 条

[21] Supervoxel Convolution for Online 3D Semantic Segmentation
Huang, Shi-Sheng
Ma, Ze-Yu
Mu, Tai-Jiang
Fu, Hongbo
Hu, Shi-Min
ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (03):
[22] Video Object Segmentation with 3D Convolution Network
Tang, Huiyun
Tao, Pin
Ma, Rui
Shi, Yuanchun
ICCCV 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON CONTROL AND COMPUTER VISION, 2019, : 28 - 32
[23] Dilated Transformer with Feature Aggregation Module for Action Segmentation
Du, Zexing
Wang, Qing
NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6181 - 6197
[24] Dilated Transformer with Feature Aggregation Module for Action Segmentation
Zexing Du
Qing Wang
Neural Processing Letters, 2023, 55 : 6181 - 6197
[25] 3D Medical Axial Transformer: A Lightweight Transformer Model for 3D Brain Tumor Segmentation
Liu, Cheng
Kiryu, Hisanori
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 799 - 813
[26] 3D point cloud classification and segmentation based on dual attention and weighted dynamic graph convolution
Xiao, Jian
Wang, Xiaohong
Li, Wei
Yang, Yifei
Luo, Ji
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (18): : 2823 - 2835
[27] Hierarchical Aggregation for 3D Instance Segmentation
Chen, Shaoyu
Fang, Jiemin
Zhang, Qian
Liu, Wenyu
Wang, Xinggang
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15447 - 15456
[28] AUTOMATIC SEGMENTATION FOR 3D DENTAL RECONSTRUCTION
Pavaloiu, Ionel-Bujorel
Goga, Nicolae
Marin, Iuliana
Vasilateanu, Andrei
2015 6TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2015, : 216 - 221
[29] Virtual Sparse Convolution for Multimodal 3D Object Detection
Wu, Hai
Wen, Chenglu
Shi, Shaoshuai
Li, Xin
Wang, Cheng
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21653 - 21662
[30] PCSCNet: Fast 3D semantic segmentation of LiDAR point cloud for autonomous car using point convolution and sparse convolution network
Park, Jaehyun
Kim, Chansoo
Kim, Soyeong
Jo, Kichun
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212

← 1 2 3 4 5 →