Progressive prediction: Video anomaly detection via multi-grained prediction

被引:1
|
作者
Zeng, Xianlin [1 ]
Jiang, Yalong [2 ]
Wang, Yufeng [2 ]
Fu, Qiang [2 ]
Ding, Wenrui [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beihang Univ, Unmanned Syst Res Inst, Beijing 100191, Peoples R China
基金
北京市自然科学基金;
关键词
computer vision; unsupervised learning; video signal processing; video surveillance; IDENTIFICATION; ALGAE; NETWORKS;
D O I
10.1049/ipr2.13117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video Anomaly Detection (VAD) has been an active research field for several decades. However, most existing approaches merely extract a single type of feature from videos and define a single paradigm to indicate the extent of abnormalities. A coarse-to-fine three-level prediction is built by integrating different levels of spatio-temporal representations, better highlighting the difference between normal and abnormal behaviors. First, an object-level trajectory prediction is proposed to model human historical position using a graph transformer network. Subsequently, skeleton-level prediction is achieved by incorporating the positional information from the trajectory prediction. More importantly, based on the predicted skeleton, a skeleton-guided pixel-level region prediction is performed. A novel Skeleton Conditioned Generative Adversarial Network (SCGAN) is designed to explore the correlation between skeleton-level and pixel-level motion prediction. Benefiting from SCGAN, the prediction of human regions is contributed by both coarse-grained and fine-grained motion features. This three-level prediction, namely Progressive Prediction Video Anomaly Detection (P3VAD), enlarges the prediction error on irregular motion patterns. Besides, a pixel-level analysis method is proposed to achieve Background-bias Elimination (BE) and denoise the predicted region. Experimental results validate the effectiveness of P3VAD on the four benchmark datasets (ShanghaiTech, CUHK Avenue, IITB-Corridor, and ADOC). This three-level prediction, namely Progressive Prediction Video Anomaly Detection (P3VAD), enlarges the prediction error on irregular motion patterns. This is the first effort to progressively combine three-level predictions from coarse to fine-grained for VAD. We demonstrate the effectiveness of our framework by conducting an extensive experimental evaluation on the four publicly large-scale benchmark datasets in both micro-AUC and macro-AUC metrics. image
引用
收藏
页码:2568 / 2583
页数:16
相关论文
共 50 条
  • [31] MPE: Multi-frame prediction error-based video anomaly detection framework for robust anomaly inference
    Kim, Yujun
    Kim, Young-Gab
    PATTERN RECOGNITION, 2025, 164
  • [32] Video-Text Retrieval by Supervised Sparse Multi-Grained Learning
    Wang, Yimu
    Shi, Peng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 633 - 649
  • [33] Multi-Grained Deep Feature Learning for Robust Pedestrian Detection
    Lin, Chunze
    Lu, Jiwen
    Zhou, Jie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (12) : 3608 - 3621
  • [34] Clinical Trial Retrieval via Multi-grained Similarity Learning
    Luo, Junyu
    Qian, Cheng
    Glass, Lucas
    Ma, Fenglong
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2950 - 2954
  • [35] Surveillance video anomaly detection via non-local U-Net frame prediction
    Zhang, Qianqian
    Feng, Guorui
    Wu, Hanzhou
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (19) : 27073 - 27088
  • [36] Improving apparel detection with category grouping and multi-grained branches
    Qing Tian
    Sampath Chanda
    Amit Kumar K C
    Douglas Gray
    Multimedia Tools and Applications, 2023, 82 : 7383 - 7400
  • [37] Improving apparel detection with category grouping and multi-grained branches
    Tian, Qing
    Chanda, Sampath
    Kumar, Amit K. C.
    Gray, Douglas
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) : 7383 - 7400
  • [38] Surveillance video anomaly detection via non-local U-Net frame prediction
    Qianqian Zhang
    Guorui Feng
    Hanzhou Wu
    Multimedia Tools and Applications, 2022, 81 : 27073 - 27088
  • [39] Margin Learning Embedded Prediction for Video Anomaly Detection with A Few Anomalies
    Liu, Wen
    Luo, Weixin
    Li, Zhengxin
    Zhao, Peilin
    Gao, Shenghua
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3023 - 3030
  • [40] Spatio-temporal prediction and reconstruction network for video anomaly detection
    Liu, Ting
    Zhang, Chengqing
    Niu, Xiaodong
    Wang, Liming
    PLOS ONE, 2022, 17 (05):