Masked Swin Transformer Unet for Industrial Anomaly Detection

被引:69
|
作者
Jiang, Jielin [1 ,2 ,3 ,4 ]
Zhu, Jiale [1 ]
Bilal, Muhammad [5 ]
Cui, Yan [6 ]
Kumar, Neeraj [7 ,8 ,9 ]
Dou, Ruihan [10 ]
Su, Feng [11 ]
Xu, Xiaolong [1 ,2 ,3 ,4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
[5] Hankuk Univ Foreign Studies, Dept Comp & Elect Syst Engn, Yongin 17035, South Korea
[6] Nanjing Normal Univ Special Educ, Coll Math & Informat Sci, Nanjing 210038, Peoples R China
[7] Univ Petr & Energy Studies, Sch Comp Sci, Dehra Dun 248007, India
[8] Asia Univ, Dept Comp Sci & Informat Engn, Taichung 41354, Taiwan
[9] King Abdulaziz Univ, Jeddah 22254, Saudi Arabia
[10] Univ Waterloo, Fac Math, Waterloo, ON N2L 3G1, Canada
[11] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
Anomaly detection; inpainting; Swin Transformer; Unet;
D O I
10.1109/TII.2022.3199228
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intelligent detection process for industrial anomalies employs artificial intelligence methods to classify images that deviate from a normal appearance. Traditional convolutional neural network (CNN)-based anomaly detection algorithms mainly use the network to restructure abnormal areas and detect anomalies by calculating the errors between the original image and reconstructed image. However, the traditional CNNs struggle to extract global context information, resulting in poor anomaly detection performance. Thus, a masked Swin Transformer Unet (MSTUnet) for anomaly detection is proposed. To solve the problem of insufficient abnormal samples in the training phase, an anomaly simulation and mask strategy is first applied on anomaly-free samples to generate a simulated anomaly and, then, the Swin Transformer's powerful global learning ability is used to inpaint the masked area. Finally, a convolution-based Unet network is used for end-to-end anomaly detection. Experimental results on industrial dataset MVTec AD show that MSTUnet achieves superior anomaly detection and localization performance.
引用
收藏
页码:2200 / 2209
页数:10
相关论文
共 50 条
  • [1] SUNet: Swin Transformer UNet for Image Denoising
    Fan, Chi-Mao
    Liu, Tsung-Jung
    Liu, Kuan-Hsien
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2333 - 2337
  • [2] SWUNet: Swin Transformer Based UNet for Hyperspectral Reconstruction
    Hussain, Sadia
    Lall, Brejesh
    Workshop on Hyperspectral Image and Signal Processing, Evolution in Remote Sensing, 2023,
  • [3] Swin Transformer UNet for Very High Resolution Image Dehazing
    Bian, Yuxin
    Zhang, Enguang
    Wang, Jiayan
    Xie, Rixin
    Jiang, Shenlu
    SENSORS AND MATERIALS, 2022, 34 (11) : 4029 - 4037
  • [4] Ground-based image deconvolution with Swin Transformer UNet
    Akhaury, U.
    Jablonka, P.
    Starck, J.-L.
    Courbin, F.
    Astronomy and Astrophysics, 2024, 688
  • [5] Ground-based image deconvolution with Swin Transformer UNet
    Akhaury, U.
    Jablonka, P.
    Starck, J. -l.
    Courbin, F.
    ASTRONOMY & ASTROPHYSICS, 2024, 688
  • [6] CTS-Unet : Urban change detection by convolutional Siamese concatenate network with Swin transformer
    Heidary, Farnoosh
    Yazdi, Mehran
    Setoodeh, Peyman
    Dehghani, Maryam
    ADVANCES IN SPACE RESEARCH, 2023, 72 (10) : 4272 - 4281
  • [7] Combining Swin Transformer With UNet for Remote Sensing Image Semantic Segmentation
    Fan, Lili
    Zhou, Yu
    Liu, Hongmei
    Li, Yunjie
    Cao, Dongpu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 11
  • [8] Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection
    Madan, Neelu
    Ristea, Nicolae-Catalin
    Ionescu, Radu Tudor
    Nasrollahi, Kamal
    Khan, Fahad Shahbaz
    Moeslund, Thomas B.
    Shah, Mubarak
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 525 - 542
  • [9] Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Di
    Yao, Rui
    Xue, Yong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] Attention Swin Transformer UNet for Landslide Segmentation in Remotely Sensed Images
    Liu, Bingxue
    Wang, Wei
    Wu, Yuming
    Gao, Xing
    REMOTE SENSING, 2024, 16 (23)