CViT: A Convolution Vision Transformer for Video Abnormal Behavior Detection and Localization

被引:0
|
作者
Roka S. [1 ]
Diwakar M. [1 ,2 ]
机构
[1] CSE Department, Graphic Era deemed to be University, Dehradun
[2] Graphic Era Hill University, Dehradun
关键词
Abnormal behavior; Abnormality; Anomaly detection; AUC; EER; Normal; Transformer; YOLO;
D O I
10.1007/s42979-023-02294-y
中图分类号
学科分类号
摘要
Video anomaly detection is a critical task because of the rare, irregular, and unbounded nature of abnormal events. Currently, most approaches only rely on CNN for such tasks, but due to spatial inductive bias, it can extract only local features from images which is insufficient for video anomaly detection. Recently, transformer-based approaches are getting popular due to their global self-attention mechanism and are considered alternatives to CNN convolution for sequence-to-sequence anomaly detection. Unfortunately, because of a lack of inadequate low-level information, it has limited localization abilities. In this paper, we have proposed a new approach using the CViT block. We design our approach by fusing U-Net and transformer and modified encoder by stacking the CViT block one after the other. This type of combination permits our model to extract richer local and global features from RGB frames. Our approach contains two modules: anomaly detection module is used to detect abnormal frames using PSNR and anomaly score. Whereas the anomaly localization module accepts only a list of abnormal frames and contains the object detection algorithm YOLO to highlight abnormal objects. Our approach was first evaluated by our own custom dataset GEU and for comparison, we use standard benchmark datasets UCSD, CUHK Avenue, and ShanghaiTech. Comparative results depict better performance of our approach in detecting abnormal events. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [21] Anomalous Crowd Behavior Detection and Localization in Video Surveillance
    Chen, Chunyu
    Shao, Yu
    2014 IEEE INTERNATIONAL CONFERENCE ON CONTROL SCIENCE AND SYSTEMS ENGINEERING, 2014, : 190 - 194
  • [22] Research on Video Abnormal Behavior Detection Based on Deep Learning
    Peng Jiali
    Zhao Yingliang
    Wang Liming
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (06)
  • [23] Topology Information Guided Video Abnormal Behavior Detection Method
    Chen, Mingyi
    Li, Hongjun
    Computer Engineering and Applications, 2024, 60 (16) : 228 - 235
  • [24] Research on Detection Method of Abnormal Behavior of People in Video Surveillance
    Zhai, Bo
    2018 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL & ELECTRONICS ENGINEERING AND COMPUTER SCIENCE (ICEEECS 2018), 2018, : 289 - 293
  • [25] VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization
    Mishra, Pankaj
    Verk, Riccardo
    Fornasier, Daniele
    Piciarelli, Claudio
    Foresti, Gian Luca
    PROCEEDINGS OF 2021 IEEE 30TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2021,
  • [26] Anomaly Detection and Localization in Optical Networks Using Vision Transformer and SOP Monitoring
    Abdelli, K.
    Lonardi, M.
    Gripp, J.
    Correa, D.
    Olsson, S.
    Boitier, F.
    Layec, P.
    2024 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION, OFC, 2024,
  • [27] VITRANSPAD: VIDEO TRANSFORMER USING CONVOLUTION AND SELF-ATTENTION FOR FACE PRESENTATION ATTACK DETECTION
    Ming, Zuheng
    Yu, Zitong
    Al-Ghadi, Musab
    Visani, Muriel
    Luqman, Muhammad Muzzamil
    Burie, Jean-Christophe
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4248 - 4252
  • [28] Fast and accurate detection and localization of abnormal behavior in crowded scenes
    Mohammad Sabokrou
    Mahmood Fathy
    Zahra Moayed
    Reinhard Klette
    Machine Vision and Applications, 2017, 28 : 965 - 985
  • [29] Fast and accurate detection and localization of abnormal behavior in crowded scenes
    Sabokrou, Mohammad
    Fathy, Mahmood
    Moayed, Zahra
    Klette, Reinhard
    MACHINE VISION AND APPLICATIONS, 2017, 28 (08) : 965 - 985
  • [30] SViT: A Spectral Vision Transformer for the Detection of REM Sleep Behavior Disorder
    Gunter, Katarina Mary
    Brink-Kjaer, Andreas
    Mignot, Emmanuel
    Sorensen, Helge B. D.
    During, Emmanuel
    Jennum, Poul
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (09) : 4285 - 4292