Few-shot defect segmentation based on cross-modal attention aggregation and adaptive prototype generation network

被引:0
|
作者
Liu, Shi-Tong [1 ]
Zhang, Yun-Zhou [1 ]
Shan, De-Xing [1 ]
Jin, Yang [1 ]
Ning, Jian [1 ]
机构
[1] College of Information Science and Engineering, Northeastern University, Shenyang,110819, China
来源
Kongzhi yu Juece/Control and Decision | 2024年 / 39卷 / 11期
关键词
Gluing - Modal analysis - Point defects - Query processing - Semantic Segmentation;
D O I
10.13195/j.kzyjc.2023.1006
中图分类号
学科分类号
摘要
Defect segmentation technology based on deep learning is crucial to ensure production efficiency and improve product quality. However, there are many areas in which large-scale defect samples cannot be collected in applications, resulting in a sharp decline in the performance of traditional detection methods. In addition, defect regions suffer from small size, weak texture information, and inconspicuous contrast with non-defect regions, which hinder the application of visual detection techniques. This paper proposes a multi-modal few-shot defect segmentation method based on vision and point cloud. Cross-modal attention is used to aggregate RGB semantic information and point cloud structure information to achieve efficient fusion of the two modalities. Then, basic foreground prototypes, adaptive background prototypes and forgetting compensation prototypes are generated by combining multi-modal features and masks to improve representation ability, dynamically match prototypes and query features according to the similarity, and complete effective segmentation of unseen object defects after feature enrichment. Experiments on two few-shot defect segmentation datasets, Defect-3i and Mvtec 3D-2i, show that the mean Intersection-over-Union (mIoU) in 1-shot and 5-shot settings exceeds other advanced algorithms by 0.11 % and 0.20 %, 5.23 % and 5.10 %, respectively, verifying the rationality of the proposed few-shot architecture and the advancement of the multi-modal network. © 2024 Northeast University. All rights reserved.
引用
收藏
页码:3655 / 3663
相关论文
共 50 条
  • [41] Few-Shot Learning with Visual Distribution Calibration and Cross-Modal Distribution Alignment
    Wang, Runqi
    Zheng, Hao
    Duan, Xiaoyue
    Liu, Jianzhuang
    Lu, Yuning
    Wang, Tian
    Xu, Songcen
    Zhang, Baochang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23445 - 23454
  • [42] Cross-Modal Feature Distribution Calibration for Few-Shot Visual Question Answering
    Zhang, Jing
    Liu, Xiaoqiang
    Chen, Mingzhe
    Wang, Zhe
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7151 - 7159
  • [43] Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory
    Huang, Yan
    Wang, Jingdong
    Wang, Liang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2968 - 2983
  • [44] Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
    Lin, Zhiqiu
    Yu, Samuel
    Kuang, Zhiyi
    Pathak, Deepak
    Ramanan, Deva
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19325 - 19337
  • [45] ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching
    Huang, Yan
    Wang, Liang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5773 - 5782
  • [46] Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement
    Wang, Jing
    Li, Jinagyun
    Chen, Chen
    Zhang, Yisi
    Shen, Haoran
    Zhang, Tianxiang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5463 - 5471
  • [47] Adaptive class augmented prototype network for few-shot relation extraction
    Li, Rongzhen
    Zhong, Jiang
    Hu, Wenyue
    Dai, Qizhu
    Wang, Chen
    Wang, Wenzhu
    Li, Xue
    NEURAL NETWORKS, 2024, 169 : 134 - 142
  • [48] Adaptive Prototype Interaction Network for Few-Shot Knowledge Graph Completion
    Li, Yuling
    Yu, Kui
    Zhang, Yuhong
    Liang, Jiye
    Wu, Xindong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15237 - 15250
  • [49] Adaptive Prototype Interaction Network for Few-Shot Knowledge Graph Completion
    Li, Yuling
    Yu, Kui
    Zhang, Yuhong
    Liang, Jiye
    Wu, Xindong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15237 - 15250
  • [50] Few-shot defect classification via feature aggregation based on graph neural network
    Zhang, Pengcheng
    Zheng, Peixiao
    Guo, Xin
    Chen, Enqing
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 101