Unsupervised Visual Anomaly Detection Using Self-Supervised Pre-Trained Transformer

被引：0

作者：

Kim, Jun-Hyung ^{[1
]}

Kwon, Goo-Rak ^{[1
]}

机构：

[1] Chosun Univ, Dept Informat & Commun Engn, Gwangju 61452, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Image reconstruction; Image segmentation; Transformers; Computational modeling; Location awareness; Feature extraction; Anomaly detection; Data augmentation; Self-supervised learning; data-augmentation; self-supervised learning; transformer;

D O I：

10.1109/ACCESS.2024.3454753

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the various industrial manufacturing processes, the automatic visual inspection system is an essential part as it reduces the chances of delivering defective products and the cost of training and hiring experts for manual inspection. In this work, we propose a new unsupervised anomaly detection method inspired by the masked language model for the automatic visual inspection system. The proposed method consists of an image tokenizer and two subnetworks, a reconstruction subnetwork, and a segmentation subnetwork. We adopt a pre-trained self-supervised vision Transformer model to use it as an image tokenizer. Our first subnetwork is trained to predict the anomaly-free patch tokens and the second subnetwork is trained to produce anomaly segmentation results from both the reconstructed and input patch tokens. During training, only the two subnetworks are optimized, and parameters of an image tokenizer are frozen. Experimental results show that the proposed method exhibits better performance than conventional methods in detecting defective products by achieving 99.05% I-AUROC on MVTecAD dataset and 94.8% I-AUROC on BTAD.

引用

页码：127604 / 127613

页数：10

共 50 条

[31] ON THE USE OF SELF-SUPERVISED PRE-TRAINED ACOUSTIC AND LINGUISTIC FEATURES FOR CONTINUOUS SPEECH EMOTION RECOGNITION
Macary, Manon
Tahon, Marie
Esteve, Yannick
Rousseau, Anthony
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 373 - 380
[32] Employing bimodal representations to predict DNA bendability within a self-supervised pre-trained framework
Yang, Minghao
Zhang, Shichen
Zheng, Zhihang
Zhang, Pengfei
Liang, Yan
Tang, Shaojun
NUCLEIC ACIDS RESEARCH, 2024, 52 (06)
[33] GhostEncoder: Stealthy backdoor attacks with dynamic triggers to pre-trained encoders in self-supervised learning
Wang, Qiannan
Yin, Changchun
Fang, Liming
Liu, Zhe
Wang, Run
Lin, Chenhao
COMPUTERS & SECURITY, 2024, 142
[34] Self-supervised Bidirectional Prompt Tuning for Entity-enhanced Pre-trained Language Model
Zou, Jiaxin
Xu, Xianghong
Hou, Jiawei
Yang, Qiang
Zheng, Hai-Tao
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[35] Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?
Sarkar, Eklavya
Magimai-Doss, Mathew
INTERSPEECH 2023, 2023, : 1189 - 1193
[36] Token Boosting for Robust Self-Supervised Visual Transformer Pre-training
Li, Tianjiao
Foo, Lin Geng
Hu, Ping
Shang, Xindi
Rahmani, Hossein
Yuan, Zehuan
Liu, Jun
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24027 - 24038
[37] Hyperspectral anomaly detection with self-supervised anomaly prior
Liu, Yidan
Jiang, Kai
Xie, Weiying
Zhang, Jiaqing
Li, Yunsong
Fang, Leyuan
NEURAL NETWORKS, 2025, 187
[38] WAKE: A Weakly Supervised Business Process Anomaly Detection Framework via a Pre-Trained Autoencoder
Guan, Wei
Cao, Jian
Zhao, Haiyan
Gu, Yang
Qian, Shiyou
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2745 - 2758
[39] Generative Pre-Trained Transformer for Cardiac Abnormality Detection
Gaudilliere, Pierre Louis
Sigurthorsdottir, Halla
Aguet, Clementine
Van Zaen, Jerome
Lemay, Mathieu
Delgado-Gonzalo, Ricard
2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
[40] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Liu, Feng
Zhang, Xiaosong
Peng, Zhiliang
Guo, Zonghao
Wan, Fang
Ji, Xiangyang
Ye, Qixiang
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6802 - 6811

← 1 2 3 4 5 →