Unsupervised Visual Anomaly Detection Using Self-Supervised Pre-Trained Transformer

被引:0
|
作者
Kim, Jun-Hyung [1 ]
Kwon, Goo-Rak [1 ]
机构
[1] Chosun Univ, Dept Informat & Commun Engn, Gwangju 61452, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Image reconstruction; Image segmentation; Transformers; Computational modeling; Location awareness; Feature extraction; Anomaly detection; Data augmentation; Self-supervised learning; data-augmentation; self-supervised learning; transformer;
D O I
10.1109/ACCESS.2024.3454753
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the various industrial manufacturing processes, the automatic visual inspection system is an essential part as it reduces the chances of delivering defective products and the cost of training and hiring experts for manual inspection. In this work, we propose a new unsupervised anomaly detection method inspired by the masked language model for the automatic visual inspection system. The proposed method consists of an image tokenizer and two subnetworks, a reconstruction subnetwork, and a segmentation subnetwork. We adopt a pre-trained self-supervised vision Transformer model to use it as an image tokenizer. Our first subnetwork is trained to predict the anomaly-free patch tokens and the second subnetwork is trained to produce anomaly segmentation results from both the reconstructed and input patch tokens. During training, only the two subnetworks are optimized, and parameters of an image tokenizer are frozen. Experimental results show that the proposed method exhibits better performance than conventional methods in detecting defective products by achieving 99.05% I-AUROC on MVTecAD dataset and 94.8% I-AUROC on BTAD.
引用
收藏
页码:127604 / 127613
页数:10
相关论文
共 50 条
  • [31] ON THE USE OF SELF-SUPERVISED PRE-TRAINED ACOUSTIC AND LINGUISTIC FEATURES FOR CONTINUOUS SPEECH EMOTION RECOGNITION
    Macary, Manon
    Tahon, Marie
    Esteve, Yannick
    Rousseau, Anthony
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 373 - 380
  • [32] Employing bimodal representations to predict DNA bendability within a self-supervised pre-trained framework
    Yang, Minghao
    Zhang, Shichen
    Zheng, Zhihang
    Zhang, Pengfei
    Liang, Yan
    Tang, Shaojun
    NUCLEIC ACIDS RESEARCH, 2024, 52 (06)
  • [33] GhostEncoder: Stealthy backdoor attacks with dynamic triggers to pre-trained encoders in self-supervised learning
    Wang, Qiannan
    Yin, Changchun
    Fang, Liming
    Liu, Zhe
    Wang, Run
    Lin, Chenhao
    COMPUTERS & SECURITY, 2024, 142
  • [34] Self-supervised Bidirectional Prompt Tuning for Entity-enhanced Pre-trained Language Model
    Zou, Jiaxin
    Xu, Xianghong
    Hou, Jiawei
    Yang, Qiang
    Zheng, Hai-Tao
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [35] Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?
    Sarkar, Eklavya
    Magimai-Doss, Mathew
    INTERSPEECH 2023, 2023, : 1189 - 1193
  • [36] Token Boosting for Robust Self-Supervised Visual Transformer Pre-training
    Li, Tianjiao
    Foo, Lin Geng
    Hu, Ping
    Shang, Xindi
    Rahmani, Hossein
    Yuan, Zehuan
    Liu, Jun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24027 - 24038
  • [37] Hyperspectral anomaly detection with self-supervised anomaly prior
    Liu, Yidan
    Jiang, Kai
    Xie, Weiying
    Zhang, Jiaqing
    Li, Yunsong
    Fang, Leyuan
    NEURAL NETWORKS, 2025, 187
  • [38] WAKE: A Weakly Supervised Business Process Anomaly Detection Framework via a Pre-Trained Autoencoder
    Guan, Wei
    Cao, Jian
    Zhao, Haiyan
    Gu, Yang
    Qian, Shiyou
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2745 - 2758
  • [39] Generative Pre-Trained Transformer for Cardiac Abnormality Detection
    Gaudilliere, Pierre Louis
    Sigurthorsdottir, Halla
    Aguet, Clementine
    Van Zaen, Jerome
    Lemay, Mathieu
    Delgado-Gonzalo, Ricard
    2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
  • [40] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
    Liu, Feng
    Zhang, Xiaosong
    Peng, Zhiliang
    Guo, Zonghao
    Wan, Fang
    Ji, Xiangyang
    Ye, Qixiang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6802 - 6811