Few-shot defect segmentation based on cross-modal attention aggregation and adaptive prototype generation network

被引:0
|
作者
Liu, Shi-Tong [1 ]
Zhang, Yun-Zhou [1 ]
Shan, De-Xing [1 ]
Jin, Yang [1 ]
Ning, Jian [1 ]
机构
[1] College of Information Science and Engineering, Northeastern University, Shenyang,110819, China
来源
Kongzhi yu Juece/Control and Decision | 2024年 / 39卷 / 11期
关键词
Gluing - Modal analysis - Point defects - Query processing - Semantic Segmentation;
D O I
10.13195/j.kzyjc.2023.1006
中图分类号
学科分类号
摘要
Defect segmentation technology based on deep learning is crucial to ensure production efficiency and improve product quality. However, there are many areas in which large-scale defect samples cannot be collected in applications, resulting in a sharp decline in the performance of traditional detection methods. In addition, defect regions suffer from small size, weak texture information, and inconspicuous contrast with non-defect regions, which hinder the application of visual detection techniques. This paper proposes a multi-modal few-shot defect segmentation method based on vision and point cloud. Cross-modal attention is used to aggregate RGB semantic information and point cloud structure information to achieve efficient fusion of the two modalities. Then, basic foreground prototypes, adaptive background prototypes and forgetting compensation prototypes are generated by combining multi-modal features and masks to improve representation ability, dynamically match prototypes and query features according to the similarity, and complete effective segmentation of unseen object defects after feature enrichment. Experiments on two few-shot defect segmentation datasets, Defect-3i and Mvtec 3D-2i, show that the mean Intersection-over-Union (mIoU) in 1-shot and 5-shot settings exceeds other advanced algorithms by 0.11 % and 0.20 %, 5.23 % and 5.10 %, respectively, verifying the rationality of the proposed few-shot architecture and the advancement of the multi-modal network. © 2024 Northeast University. All rights reserved.
引用
收藏
页码:3655 / 3663
相关论文
共 50 条
  • [1] Adaptive Cross-Modal Few-shot Learning
    Xing, Chen
    Rostamzadeh, Negar
    Oreshkin, Boris N.
    Pinheiro, Pedro O.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [2] Few-shot activity recognition with cross-modal memory network
    Zhang, Lingling
    Chang, Xiaojun
    Liu, Jun
    Luo, Minnan
    Prakash, Mahesh
    Hauptmann, Alexander G.
    PATTERN RECOGNITION, 2020, 108
  • [3] Cross Position Aggregation Network for Few-Shot Strip Steel Surface Defect Segmentation
    Feng, Hu
    Song, Kechen
    Cui, Wenqi
    Zhang, Yiming
    Yan, Yunhui
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [4] Selective Prototype Network for Few-Shot Metal Surface Defect Segmentation
    Yu, Ruiyun
    Guo, Bingyang
    Yang, Kang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [5] Holistic Prototype Attention Network for Few-Shot Video Object Segmentation
    Tang, Yin
    Chen, Tao
    Jiang, Xiruo
    Yao, Yazhou
    Xie, Guo-Sen
    Shen, Heng-Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 6699 - 6709
  • [6] Multiscale Adaptive Prototype Transformer Network for Few-Shot Strip Steel Surface Defect Segmentation
    Huang, Jiacheng
    Wu, Yong
    Zhou, Xiaofei
    Lin, Jia
    Chen, Zhangping
    Zhang, Guodao
    Xia, Lei
    Zhang, Jiyong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [7] Intermediate prototype network for few-shot segmentation
    Luo, Xiaoliu
    Duan, Zhao
    Zhang, Taiping
    SIGNAL PROCESSING, 2023, 203
  • [8] Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition
    Wang, Xiao
    Yan, Yan
    Hu, Hai-Miao
    Li, Bo
    Wang, Hanzi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1257 - 1271
  • [9] POEM: A prototype cross and emphasis network for few-shot semantic segmentation
    Cheng, Xu
    Li, Haoyuan
    Deng, Shuya
    Peng, Yonghong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 234
  • [10] A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation
    Chen, Sihan
    Chen, Yadang
    Zheng, Yuhui
    Yang, Zhi-Xin
    Wu, Enhua
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 659 - 667