Few-shot defect segmentation based on cross-modal attention aggregation and adaptive prototype generation network

被引：0

作者：

Liu, Shi-Tong ^{[1
]}

Zhang, Yun-Zhou ^{[1
]}

Shan, De-Xing ^{[1
]}

Jin, Yang ^{[1
]}

Ning, Jian ^{[1
]}

机构：

[1] College of Information Science and Engineering, Northeastern University, Shenyang,110819, China

来源：

Kongzhi yu Juece/Control and Decision | 2024年 / 39卷 / 11期

关键词：

Gluing - Modal analysis - Point defects - Query processing - Semantic Segmentation;

D O I：

10.13195/j.kzyjc.2023.1006

中图分类号：

学科分类号：

摘要：

Defect segmentation technology based on deep learning is crucial to ensure production efficiency and improve product quality. However, there are many areas in which large-scale defect samples cannot be collected in applications, resulting in a sharp decline in the performance of traditional detection methods. In addition, defect regions suffer from small size, weak texture information, and inconspicuous contrast with non-defect regions, which hinder the application of visual detection techniques. This paper proposes a multi-modal few-shot defect segmentation method based on vision and point cloud. Cross-modal attention is used to aggregate RGB semantic information and point cloud structure information to achieve efficient fusion of the two modalities. Then, basic foreground prototypes, adaptive background prototypes and forgetting compensation prototypes are generated by combining multi-modal features and masks to improve representation ability, dynamically match prototypes and query features according to the similarity, and complete effective segmentation of unseen object defects after feature enrichment. Experiments on two few-shot defect segmentation datasets, Defect-3i and Mvtec 3D-2i, show that the mean Intersection-over-Union (mIoU) in 1-shot and 5-shot settings exceeds other advanced algorithms by 0.11 % and 0.20 %, 5.23 % and 5.10 %, respectively, verifying the rationality of the proposed few-shot architecture and the advancement of the multi-modal network. © 2024 Northeast University. All rights reserved.

引用

页码：3655 / 3663

共 50 条

[21] Psanet: prototype-guided salient attention for few-shot segmentation
Li, Hao
Huang, Guoheng
Yuan, Xiaochen
Zheng, Zewen
Chen, Xuhang
Zhong, Guo
Pun, Chi-Man
VISUAL COMPUTER, 2025, 41 (04): : 2987 - 3001
[22] Cross-modal augmentation for few-shot multimodal fake news detection
Jiang, Ye
Wang, Taihang
Xu, Xiaoman
Wang, Yimin
Song, Xingyi
Maynard, Diana
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
[23] Attention and Adaptive Bilinear Matching Network for Cross-Domain Few-Shot Defect Classification of Industrial Parts
Sa, Liangbing
Yu, Chongchong
Chen, Ziyan
Zhao, Xia
Yang, Yafeng
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[24] Cross-modal de-deviation for enhancing few-shot classification
Pan, Mei -Hong
Shen, Hong -Bin
PATTERN RECOGNITION, 2024, 152
[25] CobNet: Cross Attention on Object and Background for Few-Shot Segmentation
Guan, Haoyan
Michael, Spratling
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 39 - 45
[26] Cross-modal guides spatio-temporal enrichment network for few-shot action recognition
Chen, Zhiwen
Yang, Yi
Li, Li
Li, Min
APPLIED INTELLIGENCE, 2024, 54 (22) : 11196 - 11211
[27] Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation
Shi, Xinyu
Wei, Dong
Zhang, Yu
Lu, Donghuan
Ning, Munan
Chen, Jiashun
Ma, Kai
Zheng, Yefeng
COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 151 - 168
[28] DCMA-Net: dual cross-modal attention for fine-grained few-shot recognition
Yan Zhou
Xiao Ren
Jianxun Li
Yin Yang
Haibin Zhou
Multimedia Tools and Applications, 2024, 83 : 14521 - 14537
[29] DCMA-Net: dual cross-modal attention for fine-grained few-shot recognition
Zhou, Yan
Ren, Xiao
Li, Jianxun
Yang, Yin
Zhou, Haibin
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 14521 - 14537
[30] MFANet: Multifeature Aggregation Network for Cross-Granularity Few-Shot Seamless Steel Tubes Surface Defect Segmentation
Song, Kechen
Feng, Hu
Cao, Tonglei
Cui, Wenqi
Yan, Yunhui
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (07) : 9725 - 9735

← 1 2 3 4 5 →