Progressive prediction: Video anomaly detection via multi-grained prediction

被引:1
|
作者
Zeng, Xianlin [1 ]
Jiang, Yalong [2 ]
Wang, Yufeng [2 ]
Fu, Qiang [2 ]
Ding, Wenrui [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beihang Univ, Unmanned Syst Res Inst, Beijing 100191, Peoples R China
基金
北京市自然科学基金;
关键词
computer vision; unsupervised learning; video signal processing; video surveillance; IDENTIFICATION; ALGAE; NETWORKS;
D O I
10.1049/ipr2.13117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video Anomaly Detection (VAD) has been an active research field for several decades. However, most existing approaches merely extract a single type of feature from videos and define a single paradigm to indicate the extent of abnormalities. A coarse-to-fine three-level prediction is built by integrating different levels of spatio-temporal representations, better highlighting the difference between normal and abnormal behaviors. First, an object-level trajectory prediction is proposed to model human historical position using a graph transformer network. Subsequently, skeleton-level prediction is achieved by incorporating the positional information from the trajectory prediction. More importantly, based on the predicted skeleton, a skeleton-guided pixel-level region prediction is performed. A novel Skeleton Conditioned Generative Adversarial Network (SCGAN) is designed to explore the correlation between skeleton-level and pixel-level motion prediction. Benefiting from SCGAN, the prediction of human regions is contributed by both coarse-grained and fine-grained motion features. This three-level prediction, namely Progressive Prediction Video Anomaly Detection (P3VAD), enlarges the prediction error on irregular motion patterns. Besides, a pixel-level analysis method is proposed to achieve Background-bias Elimination (BE) and denoise the predicted region. Experimental results validate the effectiveness of P3VAD on the four benchmark datasets (ShanghaiTech, CUHK Avenue, IITB-Corridor, and ADOC). This three-level prediction, namely Progressive Prediction Video Anomaly Detection (P3VAD), enlarges the prediction error on irregular motion patterns. This is the first effort to progressively combine three-level predictions from coarse to fine-grained for VAD. We demonstrate the effectiveness of our framework by conducting an extensive experimental evaluation on the four publicly large-scale benchmark datasets in both micro-AUC and macro-AUC metrics. image
引用
收藏
页码:2568 / 2583
页数:16
相关论文
共 50 条
  • [21] Progressive Multi-granularity Analysis for Video Prediction
    Xu, Jingwei
    Ni, Bingbing
    Yang, Xiaokang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (03) : 601 - 618
  • [22] Future Video Prediction from a Single Frame for Video Anomaly Detection
    Baradaran, Mohammad
    Bergevin, Robert
    ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 472 - 486
  • [23] Video Dialog via Multi-Grained Convolutional Self-Attention Context Multi-Modal Networks
    Gu, Mao
    Zhao, Zhou
    Jin, Weike
    Cai, Deng
    Wu, Fei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4453 - 4466
  • [24] MULTI-GRAINED DEEP FEATURE LEARNING FOR PEDESTRIAN DETECTION
    Lin, Chunze
    Lu, Jiwen
    Zhou, Jie
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [25] Salient object detection via multi-grained refinement polygon topology positive feedback
    Yang, Mo
    Liu, Ziyan
    Wu, Ying
    Dong, Wen
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [26] Adversarial composite prediction of normal video dynamics for anomaly detection
    Li, Gang
    He, Ping
    Li, Huibin
    Zhang, Fan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
  • [27] Video Prediction and Anomaly Detection Algorithm Based On Dual Discriminator
    Fan, Sinuo
    Meng, Fanjie
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA 2020), 2020, : 123 - 127
  • [28] Robust Unsupervised Video Anomaly Detection by Multipath Frame Prediction
    Wang, Xuanzhao
    Che, Zhengping
    Jiang, Bo
    Xiao, Ning
    Yang, Ke
    Tang, Jian
    Ye, Jieping
    Wang, Jingyu
    Qi, Qi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (06) : 2301 - 2312
  • [29] VIDEO ANOMALY DETECTION VIA PREDICTION NETWORK WITH ENHANCED SPATIO-TEMPORAL MEMORY EXCHANGE
    Shen, Guodong
    Ouyang, Yuqi
    Sanchez, Victor
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3728 - 3732
  • [30] Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion
    Zhou, Yangming
    Yang, Yuzhou
    Ying, Qichao
    Qian, Zhenxing
    Zhang, Xinpeng
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 343 - 352