Few-shot defect segmentation based on cross-modal attention aggregation and adaptive prototype generation network

被引:0
|
作者
Liu, Shi-Tong [1 ]
Zhang, Yun-Zhou [1 ]
Shan, De-Xing [1 ]
Jin, Yang [1 ]
Ning, Jian [1 ]
机构
[1] College of Information Science and Engineering, Northeastern University, Shenyang,110819, China
来源
Kongzhi yu Juece/Control and Decision | 2024年 / 39卷 / 11期
关键词
Gluing - Modal analysis - Point defects - Query processing - Semantic Segmentation;
D O I
10.13195/j.kzyjc.2023.1006
中图分类号
学科分类号
摘要
Defect segmentation technology based on deep learning is crucial to ensure production efficiency and improve product quality. However, there are many areas in which large-scale defect samples cannot be collected in applications, resulting in a sharp decline in the performance of traditional detection methods. In addition, defect regions suffer from small size, weak texture information, and inconspicuous contrast with non-defect regions, which hinder the application of visual detection techniques. This paper proposes a multi-modal few-shot defect segmentation method based on vision and point cloud. Cross-modal attention is used to aggregate RGB semantic information and point cloud structure information to achieve efficient fusion of the two modalities. Then, basic foreground prototypes, adaptive background prototypes and forgetting compensation prototypes are generated by combining multi-modal features and masks to improve representation ability, dynamically match prototypes and query features according to the similarity, and complete effective segmentation of unseen object defects after feature enrichment. Experiments on two few-shot defect segmentation datasets, Defect-3i and Mvtec 3D-2i, show that the mean Intersection-over-Union (mIoU) in 1-shot and 5-shot settings exceeds other advanced algorithms by 0.11 % and 0.20 %, 5.23 % and 5.10 %, respectively, verifying the rationality of the proposed few-shot architecture and the advancement of the multi-modal network. © 2024 Northeast University. All rights reserved.
引用
收藏
页码:3655 / 3663
相关论文
共 50 条
  • [21] Psanet: prototype-guided salient attention for few-shot segmentation
    Li, Hao
    Huang, Guoheng
    Yuan, Xiaochen
    Zheng, Zewen
    Chen, Xuhang
    Zhong, Guo
    Pun, Chi-Man
    VISUAL COMPUTER, 2025, 41 (04): : 2987 - 3001
  • [22] Cross-modal augmentation for few-shot multimodal fake news detection
    Jiang, Ye
    Wang, Taihang
    Xu, Xiaoman
    Wang, Yimin
    Song, Xingyi
    Maynard, Diana
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
  • [23] Attention and Adaptive Bilinear Matching Network for Cross-Domain Few-Shot Defect Classification of Industrial Parts
    Sa, Liangbing
    Yu, Chongchong
    Chen, Ziyan
    Zhao, Xia
    Yang, Yafeng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [24] Cross-modal de-deviation for enhancing few-shot classification
    Pan, Mei -Hong
    Shen, Hong -Bin
    PATTERN RECOGNITION, 2024, 152
  • [25] CobNet: Cross Attention on Object and Background for Few-Shot Segmentation
    Guan, Haoyan
    Michael, Spratling
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 39 - 45
  • [26] Cross-modal guides spatio-temporal enrichment network for few-shot action recognition
    Chen, Zhiwen
    Yang, Yi
    Li, Li
    Li, Min
    APPLIED INTELLIGENCE, 2024, 54 (22) : 11196 - 11211
  • [27] Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation
    Shi, Xinyu
    Wei, Dong
    Zhang, Yu
    Lu, Donghuan
    Ning, Munan
    Chen, Jiashun
    Ma, Kai
    Zheng, Yefeng
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 151 - 168
  • [28] DCMA-Net: dual cross-modal attention for fine-grained few-shot recognition
    Yan Zhou
    Xiao Ren
    Jianxun Li
    Yin Yang
    Haibin Zhou
    Multimedia Tools and Applications, 2024, 83 : 14521 - 14537
  • [29] DCMA-Net: dual cross-modal attention for fine-grained few-shot recognition
    Zhou, Yan
    Ren, Xiao
    Li, Jianxun
    Yang, Yin
    Zhou, Haibin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 14521 - 14537
  • [30] MFANet: Multifeature Aggregation Network for Cross-Granularity Few-Shot Seamless Steel Tubes Surface Defect Segmentation
    Song, Kechen
    Feng, Hu
    Cao, Tonglei
    Cui, Wenqi
    Yan, Yunhui
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (07) : 9725 - 9735