Masked Swin Transformer Unet for Industrial Anomaly Detection

被引:69
|
作者
Jiang, Jielin [1 ,2 ,3 ,4 ]
Zhu, Jiale [1 ]
Bilal, Muhammad [5 ]
Cui, Yan [6 ]
Kumar, Neeraj [7 ,8 ,9 ]
Dou, Ruihan [10 ]
Su, Feng [11 ]
Xu, Xiaolong [1 ,2 ,3 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
[5] Hankuk Univ Foreign Studies, Dept Comp & Elect Syst Engn, Yongin 17035, South Korea
[6] Nanjing Normal Univ Special Educ, Coll Math & Informat Sci, Nanjing 210038, Peoples R China
[7] Univ Petr & Energy Studies, Sch Comp Sci, Dehra Dun 248007, India
[8] Asia Univ, Dept Comp Sci & Informat Engn, Taichung 41354, Taiwan
[9] King Abdulaziz Univ, Jeddah 22254, Saudi Arabia
[10] Univ Waterloo, Fac Math, Waterloo, ON N2L 3G1, Canada
[11] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
Anomaly detection; inpainting; Swin Transformer; Unet;
D O I
10.1109/TII.2022.3199228
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intelligent detection process for industrial anomalies employs artificial intelligence methods to classify images that deviate from a normal appearance. Traditional convolutional neural network (CNN)-based anomaly detection algorithms mainly use the network to restructure abnormal areas and detect anomalies by calculating the errors between the original image and reconstructed image. However, the traditional CNNs struggle to extract global context information, resulting in poor anomaly detection performance. Thus, a masked Swin Transformer Unet (MSTUnet) for anomaly detection is proposed. To solve the problem of insufficient abnormal samples in the training phase, an anomaly simulation and mask strategy is first applied on anomaly-free samples to generate a simulated anomaly and, then, the Swin Transformer's powerful global learning ability is used to inpaint the masked area. Finally, a convolution-based Unet network is used for end-to-end anomaly detection. Experimental results on industrial dataset MVTec AD show that MSTUnet achieves superior anomaly detection and localization performance.
引用
收藏
页码:2200 / 2209
页数:10
相关论文
共 50 条
  • [31] UNet-like network fused swin transformer and CNN for semantic image synthesis
    Ke, Aihua
    Luo, Jian
    Cai, Bo
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [32] MCST-UNET: HIGH-RESOLUTION SEMANTIC SEGMENTATION WITH SWIN TRANSFORMER MODULES
    Guo, Hongwei
    Ma, Chuang
    Xu, Guangxia
    Liu, Pengcheng
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2024, 25 (06) : 1313 - 1323
  • [33] SwinE-UNet3+: swin transformer encoder network for medical image segmentation
    Zou, Ping
    Wu, Jian-Sheng
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2023, 12 (01) : 99 - 105
  • [34] An industrial product surface anomaly detection method based on masked image modeling
    Tang, Shancheng
    Li, Heng
    Dai, Fenghua
    Yang, Jiqing
    Jin, Zicheng
    Lu, Jianhui
    Zhang, Ying
    NONDESTRUCTIVE TESTING AND EVALUATION, 2024,
  • [35] Small Object Detection for Birds with Swin Transformer
    Huo, Da
    Kastner, Marc A.
    Liu, Tingwei
    Kawanishi, Yasutomo
    Hirayama, Takatsugu
    Komamizu, Takahiro
    Ide, Ichiro
    2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
  • [36] Down Syndrome detection with Swin Transformer architecture
    Wang, Chengyu
    Yu, Limin
    Su, Jionglong
    Mahy, Trevor
    Selis, Valerio
    Yang, Chunxiao
    Ma, Fei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [37] Cervical OCT image classification using contrastive masked autoencoders with Swin Transformer
    Wang, Qingbin
    Xiong, Yuxuan
    Zhu, Hanfeng
    Mu, Xuefeng
    Zhang, Yan
    Ma, Yutao
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 118
  • [38] Swin Transformer for Pedestrian and Occluded Pedestrian Detection
    Liang, Jung-An
    Ding, Jian-Jiun
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [39] Ts-Unet: A Temporal Smoothed Unet for Video Anomaly Detection
    Wang, Zhiguo
    Yang, Zhongliang
    Zhang, Yujin
    Su, Nan
    Wang, Guijin
    IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 789 - 798
  • [40] Prior Normality Prompt Transformer for Multiclass Industrial Image Anomaly Detection
    Yao, Haiming
    Cao, Yunkang
    Luo, Wei
    Zhang, Weihang
    Yu, Wenyong
    Shen, Weiming
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (10) : 11866 - 11876