Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning

被引:22
|
作者
Zhang, Dasheng [1 ]
Huang, Chao [2 ]
Liu, Chengliang [2 ]
Xu, Yong [2 ,3 ]
机构
[1] Chongqing Univ, Sch Artificial Intelligence, Chongqing 401135, Peoples R China
[2] Harbin Inst Technol, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
国家重点研发计划;
关键词
Feature extraction; Transformers; Task analysis; Anomaly detection; Training; Surveillance; Training data; Deep learning; video anomaly detection; vision transformer; weakly-supervised learning;
D O I
10.1109/LSP.2022.3175092
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Weakly supervised video anomaly detection is a challenging problem due to the lack of frame-level labels in training videos. Most previous works typically tackle this task with the multiple instance learning paradigm, which divides a video into multiple snippets and trains a snippet classifier to distinguish anomalies from normal snippets via video-level supervision information. Although existing approaches achieve remarkable progresses, these solutions are still limited in the insufficient representations. In this paper, we propose a novel weakly supervised temporal relation learning framework for anomaly detection, which efficiently explores the temporal relation between snippets and enhances the discriminative powers of features using only video-level labelled videos. To this end, we design a transformer-enabled feature encoder to convert the input task-agnostic features into discriminative task-specific features by mining the semantic correlation and position relation between video snippets. As a result, our model can make a more accurate anomaly detection for current video snippet based on the learned discriminative features. Experimental results indicate that the proposed method is superior to existing state-of-the-art approaches, which demonstrates the effectiveness of our model.
引用
收藏
页码:1197 / 1201
页数:5
相关论文
共 50 条
  • [31] Attention-based framework for weakly supervised video anomaly detection
    Hualin Ma
    Liyan Zhang
    The Journal of Supercomputing, 2022, 78 : 8409 - 8429
  • [32] Attention-based framework for weakly supervised video anomaly detection
    Ma, Hualin
    Zhang, Liyan
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (06): : 8409 - 8429
  • [33] Inter-Clip Feature Similarity Based Weakly Supervised Video Anomaly Detection via Multi-Scale Temporal MLP
    Zhong, Yuanhong
    Zhu, Ruyue
    Yan, Ge
    Gan, Ping
    Shen, Xuerui
    Zhu, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1961 - 1970
  • [34] Weakly-Supervised Video Anomaly Detection with MTDA-Net
    Wu, Huixin
    Yang, Mengfan
    Wei, Fupeng
    Shi, Ge
    Jiang, Wei
    Qiao, Yaqiong
    Dong, Hangcheng
    ELECTRONICS, 2023, 12 (22)
  • [35] Multimodal and multiscale feature fusion for weakly supervised video anomaly detection
    Sun, Wenwen
    Cao, Lin
    Guo, Yanan
    Du, Kangning
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [36] Weakly-Supervised Video Anomaly Detection With Snippet Anomalous Attention
    Fan, Yidan
    Yu, Yongxin
    Lu, Wenhuan
    Han, Yahong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5480 - 5492
  • [37] Spiking Reinforcement Learning for Weakly-Supervised Anomaly Detection
    Jin, Ao
    Wu, Zhichao
    Zhu, Li
    Xia, Qianchen
    Yang, Xin
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT V, 2024, 14451 : 175 - 187
  • [38] TFD-Net: Transformer Deviation Network for Weakly Supervised Anomaly Detection
    Gan, Hongping
    Zheng, Hejie
    Wu, Zhangfa
    Ma, Chunyan
    Liu, Jie
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2025, 22 (01): : 941 - 954
  • [39] Weakly-supervised anomaly detection in video surveillance via graph convolutional label noise cleaning
    Li, Nannan
    Zhong, Jia-Xing
    Shu, Xiujun
    Guo, Huiwen
    NEUROCOMPUTING, 2022, 481 : 154 - 167
  • [40] Transformer with Spatio-Temporal Representation for Video Anomaly Detection
    Sun, Xiaohu
    Chen, Jinyi
    Shen, Xulin
    Li, Hongjun
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2022, 2022, 13813 : 213 - 222