Transformer-enabled weakly supervised abnormal event detection in intelligent video surveillance systems

被引:0
|
作者
Paulraj, Shalmiya [1 ]
Vairavasundaram, Subramaniyaswamy [2 ]
机构
[1] SASTRA Deemed Univ, Sch Comp, Thanjavur 613401, India
[2] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore 632014, India
关键词
Artificial intelligence; Abnormal event detection; Computer vision; Transformer models; Global self-attention; Intelligent video surveillance; Real-time monitoring;
D O I
10.1016/j.engappai.2024.109496
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video Anomaly Detection (VAD) for weakly supervised data operates with limited video-level annotations. It also holds the practical significance to play a pivotal role in surveillance and security applications like public safety, patient monitoring, autonomous vehicles, etc. Moreover, VAD extends its utility to various industrial settings, where it is instrumental in safeguarding workers' safety, enabling real-time production quality monitoring, and predictive maintenance. These diverse applications highlight the versatility of VAD and its potential to transform processes across various industries, making it an essential tool along with traditional surveillance applications. The majority of the existing studies have been focused on mitigating critical aspects of VAD, such as reducing false alarm rates and misdetection. These challenges can be effectively addressed by capturing the intricate spatiotemporal pattern within video data. Therefore, the proposed work named Swin Transformer-based Hybrid Temporal Adaptive Module (ST-HTAM) Abnormal Event Detection introduces an intuitive temporal module along with leveraging the strengths of the Swin (Shifted window-based) Transformers for spatial analysis. The novel aspect of this work lies in the hybridization of global self-attention and Convolutional-Long Short Term Memory (C-LSTM) Networks are renowned for capturing both global and local temporal dependencies. By extracting these spatial and temporal components, the proposed method, ST-HTAM, offers a comprehensive understanding of anomalous events. Altogether, it enhances the accuracy and robustness of Weakly Supervised VAD (WS-VAD). Finally, an anomaly scoring mechanism is employed in the classification step to facilitate effective anomaly detection from test video data. The proposed system is tailored to operate in real-time and highlights the dual focus on sophisticated Artificial Intelligence (AI) techniques and their impactful use cases across diverse domains. Comprehensive experiments are conducted on benchmark datasets that clearly show the substantial superiority of the ST-HTAM over state-of-the-art approaches. Code is available at https://github. com/Shalmiyapaulraj78/STHTAM-VAD.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Intelligent Transport Surveillance Memory Enhanced Method for Detection of Abnormal Behavior in Video
    Zhang, Deng-Hui
    Journal of Advanced Transportation, 2022, 2022
  • [32] Unsupervised Learning Approach for Abnormal Event Detection in Surveillance Video by Hybrid Autoencoder
    Zhou, Fuqiang
    Wang, Lin
    Li, Zuoxin
    Zuo, Wangxia
    Tan, Haishu
    NEURAL PROCESSING LETTERS, 2020, 52 (02) : 961 - 975
  • [33] Unsupervised Learning Approach for Abnormal Event Detection in Surveillance Video by Hybrid Autoencoder
    Fuqiang Zhou
    Lin Wang
    Zuoxin Li
    Wangxia Zuo
    Haishu Tan
    Neural Processing Letters, 2020, 52 : 961 - 975
  • [34] Audio Pyramid Transformer with Domain Adaption for Weakly Supervised Sound Event Detection and Audio Classification
    Xin, Yifei
    Yang, Dongchao
    Zou, Yuexian
    INTERSPEECH 2022, 2022, : 1546 - 1550
  • [35] Scenario-Guided Transformer-Enabled Multi-Modal Unknown Event Classification for Air Transport
    Yang, Yang
    Zhang, Yishan
    Qian, Shengsheng
    Cai, Kaiquan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 21658 - 21671
  • [36] Weakly Supervised Video Anomaly Detection via Self-Guided Temporal Discriminative Transformer
    Huang, Chao
    Liu, Chengliang
    Wen, Jie
    Wu, Lian
    Xu, Yong
    Jiang, Qiuping
    Wang, Yaowei
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (05) : 3197 - 3210
  • [37] Relabeling Abnormal Videos via Intra-Video Label Propagation for Weakly Supervised Video Anomaly Detection
    Thou, Wenhao
    Li, Yingxuan
    Zhao, Jiancheng
    Zhao, Chunhui
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1200 - 1205
  • [38] The Impact of Video Transcoding Parameters on Event Detection for Surveillance Systems
    Kafetzakis, Emmanouil
    Xilouris, Christos
    Kourtis, Michail Alexandros
    Nieto, Marcos
    Jargalsaikhan, Iveel
    Little, Suzanne
    2013 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2013, : 333 - 338
  • [39] Weakly Supervised Video Salient Object Detection
    Zhao, Wangbo
    Zhang, Jing
    Li, Long
    Barnes, Nick
    Liu, Nian
    Han, Junwei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16821 - 16830
  • [40] Overlooked Video Classification in Weakly Supervised Video Anomaly Detection
    Tan, Weijun
    Yao, Qi
    Liu, Jingfeng
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 212 - 220