NN2ViT: Neural Networks and Vision Transformers based approach for Visual Anomaly Detection in Industrial Images

被引:0
|
作者
Wahid, Junaid Abdul [1 ]
Ayoub, Muhammad [2 ]
Xu, Mingliang [1 ]
Jiang, Xiaoheng [1 ]
Shi, Lei [3 ]
Hussain, Shabir [4 ]
机构
[1] Zhengzhou Univ, Sch Comp Sci & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Comp Sci & Engn, Shanghai, Peoples R China
[3] Zhengzhou Univ, Sch Cyber Space & Secur, Zhengzhou 450002, Peoples R China
[4] Tsinghua Univ, Inst Biopharmaceut & Hlth Engn, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
关键词
Anomaly detection; Anomaly segmentation; Industrial data; Neural network; Vision transformers;
D O I
10.1016/j.neucom.2024.128845
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensuring product quality through automated anomaly detection is crucial in manufacturing. Traditional methods often struggle to capture both local and global features effectively, relying heavily on predefined templates that limit their adaptability and accuracy. To address these challenges, this study propose NN2ViT, a novel approach that integrates a Single Shot Detector (SSD) for local feature detection and the Segment Anything Model (SAM) for global feature segmentation. This integration allows fora comprehensive analysis of anomalies in industrial images. Our method improves anomaly segmentation performance by fine-tuning SAM for precise segmentation in industrial product images. Experiments on the MVTec benchmark dataset demonstrate that NN2ViT outperforms traditional models and achieved the highest 95.54% and 96.23% Image AUROC and AP scores, respectively thus enhancing interpretability and adaptability to various anomaly patterns. This research presents a significant advancement in manufacturing quality control, contributing to improved product quality and operational efficiency.
引用
收藏
页数:13
相关论文
共 22 条
  • [21] A novel invasive plant detection approach using time series images from unmanned aerial systems based on convolutional and recurrent neural networks
    Yanhui Guo
    Yun Zhao
    Thomas A. Rothfus
    Adam S. Avalos
    Neural Computing and Applications, 2022, 34 : 20135 - 20147
  • [22] Weld seam object detection system based on the fusion of 2D images and 3D point clouds using interpretable neural networks
    Wang, Shengbo
    Li, Zengxu
    Chen, Guodong
    Yue, Yaobin
    SCIENTIFIC REPORTS, 2024, 14 (01):