Few-shot defect segmentation based on cross-modal attention aggregation and adaptive prototype generation network

被引：0

作者：

Liu, Shi-Tong ^{[1
]}

Zhang, Yun-Zhou ^{[1
]}

Shan, De-Xing ^{[1
]}

Jin, Yang ^{[1
]}

Ning, Jian ^{[1
]}

机构：

[1] College of Information Science and Engineering, Northeastern University, Shenyang,110819, China

来源：

Kongzhi yu Juece/Control and Decision | 2024年 / 39卷 / 11期

关键词：

Gluing - Modal analysis - Point defects - Query processing - Semantic Segmentation;

D O I：

10.13195/j.kzyjc.2023.1006

中图分类号：

学科分类号：

摘要：

Defect segmentation technology based on deep learning is crucial to ensure production efficiency and improve product quality. However, there are many areas in which large-scale defect samples cannot be collected in applications, resulting in a sharp decline in the performance of traditional detection methods. In addition, defect regions suffer from small size, weak texture information, and inconspicuous contrast with non-defect regions, which hinder the application of visual detection techniques. This paper proposes a multi-modal few-shot defect segmentation method based on vision and point cloud. Cross-modal attention is used to aggregate RGB semantic information and point cloud structure information to achieve efficient fusion of the two modalities. Then, basic foreground prototypes, adaptive background prototypes and forgetting compensation prototypes are generated by combining multi-modal features and masks to improve representation ability, dynamically match prototypes and query features according to the similarity, and complete effective segmentation of unseen object defects after feature enrichment. Experiments on two few-shot defect segmentation datasets, Defect-3i and Mvtec 3D-2i, show that the mean Intersection-over-Union (mIoU) in 1-shot and 5-shot settings exceeds other advanced algorithms by 0.11 % and 0.20 %, 5.23 % and 5.10 %, respectively, verifying the rationality of the proposed few-shot architecture and the advancement of the multi-modal network. © 2024 Northeast University. All rights reserved.

引用

页码：3655 / 3663

共 50 条

[1] Adaptive Cross-Modal Few-shot Learning
Xing, Chen
Rostamzadeh, Negar
Oreshkin, Boris N.
Pinheiro, Pedro O.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[2] Few-shot activity recognition with cross-modal memory network
Zhang, Lingling
Chang, Xiaojun
Liu, Jun
Luo, Minnan
Prakash, Mahesh
Hauptmann, Alexander G.
PATTERN RECOGNITION, 2020, 108
[3] Cross Position Aggregation Network for Few-Shot Strip Steel Surface Defect Segmentation
Feng, Hu
Song, Kechen
Cui, Wenqi
Zhang, Yiming
Yan, Yunhui
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[4] Selective Prototype Network for Few-Shot Metal Surface Defect Segmentation
Yu, Ruiyun
Guo, Bingyang
Yang, Kang
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[5] Holistic Prototype Attention Network for Few-Shot Video Object Segmentation
Tang, Yin
Chen, Tao
Jiang, Xiruo
Yao, Yazhou
Xie, Guo-Sen
Shen, Heng-Tao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 6699 - 6709
[6] Multiscale Adaptive Prototype Transformer Network for Few-Shot Strip Steel Surface Defect Segmentation
Huang, Jiacheng
Wu, Yong
Zhou, Xiaofei
Lin, Jia
Chen, Zhangping
Zhang, Guodao
Xia, Lei
Zhang, Jiyong
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[7] Intermediate prototype network for few-shot segmentation
Luo, Xiaoliu
Duan, Zhao
Zhang, Taiping
SIGNAL PROCESSING, 2023, 203
[8] Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition
Wang, Xiao
Yan, Yan
Hu, Hai-Miao
Li, Bo
Wang, Hanzi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1257 - 1271
[9] POEM: A prototype cross and emphasis network for few-shot semantic segmentation
Cheng, Xu
Li, Haoyuan
Deng, Shuya
Peng, Yonghong
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 234
[10] A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation
Chen, Sihan
Chen, Yadang
Zheng, Yuhui
Yang, Zhi-Xin
Wu, Enhua
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 659 - 667

← 1 2 3 4 5 →