Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning

被引：22

作者：

Zhang, Dasheng ^{[1
]}

Huang, Chao ^{[2
]}

Liu, Chengliang ^{[2
]}

Xu, Yong ^{[2
,3
]}

机构：

[1] Chongqing Univ, Sch Artificial Intelligence, Chongqing 401135, Peoples R China

[2] Harbin Inst Technol, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China

[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2022年 / 29卷

基金：

国家重点研发计划;

关键词：

Feature extraction; Transformers; Task analysis; Anomaly detection; Training; Surveillance; Training data; Deep learning; video anomaly detection; vision transformer; weakly-supervised learning;

D O I：

10.1109/LSP.2022.3175092

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Weakly supervised video anomaly detection is a challenging problem due to the lack of frame-level labels in training videos. Most previous works typically tackle this task with the multiple instance learning paradigm, which divides a video into multiple snippets and trains a snippet classifier to distinguish anomalies from normal snippets via video-level supervision information. Although existing approaches achieve remarkable progresses, these solutions are still limited in the insufficient representations. In this paper, we propose a novel weakly supervised temporal relation learning framework for anomaly detection, which efficiently explores the temporal relation between snippets and enhances the discriminative powers of features using only video-level labelled videos. To this end, we design a transformer-enabled feature encoder to convert the input task-agnostic features into discriminative task-specific features by mining the semantic correlation and position relation between video snippets. As a result, our model can make a more accurate anomaly detection for current video snippet based on the learned discriminative features. Experimental results indicate that the proposed method is superior to existing state-of-the-art approaches, which demonstrates the effectiveness of our model.

引用

页码：1197 / 1201

页数：5

共 50 条

[31] Attention-based framework for weakly supervised video anomaly detection
Hualin Ma
Liyan Zhang
The Journal of Supercomputing, 2022, 78 : 8409 - 8429
[32] Attention-based framework for weakly supervised video anomaly detection
Ma, Hualin
Zhang, Liyan
JOURNAL OF SUPERCOMPUTING, 2022, 78 (06): : 8409 - 8429
[33] Inter-Clip Feature Similarity Based Weakly Supervised Video Anomaly Detection via Multi-Scale Temporal MLP
Zhong, Yuanhong
Zhu, Ruyue
Yan, Ge
Gan, Ping
Shen, Xuerui
Zhu, Dong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1961 - 1970
[34] Weakly-Supervised Video Anomaly Detection with MTDA-Net
Wu, Huixin
Yang, Mengfan
Wei, Fupeng
Shi, Ge
Jiang, Wei
Qiao, Yaqiong
Dong, Hangcheng
ELECTRONICS, 2023, 12 (22)
[35] Multimodal and multiscale feature fusion for weakly supervised video anomaly detection
Sun, Wenwen
Cao, Lin
Guo, Yanan
Du, Kangning
SCIENTIFIC REPORTS, 2024, 14 (01):
[36] Weakly-Supervised Video Anomaly Detection With Snippet Anomalous Attention
Fan, Yidan
Yu, Yongxin
Lu, Wenhuan
Han, Yahong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5480 - 5492
[37] Spiking Reinforcement Learning for Weakly-Supervised Anomaly Detection
Jin, Ao
Wu, Zhichao
Zhu, Li
Xia, Qianchen
Yang, Xin
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT V, 2024, 14451 : 175 - 187
[38] TFD-Net: Transformer Deviation Network for Weakly Supervised Anomaly Detection
Gan, Hongping
Zheng, Hejie
Wu, Zhangfa
Ma, Chunyan
Liu, Jie
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2025, 22 (01): : 941 - 954
[39] Weakly-supervised anomaly detection in video surveillance via graph convolutional label noise cleaning
Li, Nannan
Zhong, Jia-Xing
Shu, Xiujun
Guo, Huiwen
NEUROCOMPUTING, 2022, 481 : 154 - 167
[40] Transformer with Spatio-Temporal Representation for Video Anomaly Detection
Sun, Xiaohu
Chen, Jinyi
Shen, Xulin
Li, Hongjun
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2022, 2022, 13813 : 213 - 222

← 1 2 3 4 5 →