Spatio-temporal graph-based self-labeling for video anomaly detection

被引:0
|
作者
Xing, Meng [1 ,2 ]
Feng, Zhiyong [3 ]
Su, Yong [4 ]
Zhang, Yiming [3 ]
Oh, Changjae [5 ]
Gribova, Valeriya [6 ]
Filaretoy, Vladimir Fedorovich [6 ]
Huang, Deshuang [1 ,7 ]
机构
[1] Ningbo Inst Digital Twin, Eastern Inst Technol, 568 Tongxin Rd,Zhuangshi St, Ningbo 315201, Zhejiang, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, 96 JinZhai Rd, Hefei 230026, Anhui, Peoples R China
[3] Tianjin Univ, Coll Intelligence & Comp, 135 Yaguan Rd,Haihe Educ Pk, Tianjin 300350, Peoples R China
[4] Tianjin Normal Univ, Tianjin Key Lab Wireless Mobile Commun & Power Tra, 393 Binshui West Rd, Tianjin 300387, Peoples R China
[5] Queen Mary Univ London, Ctr Intelligent Sensing, Mile End Rd, London E1 4NS, England
[6] Russian Acad Sci, Inst Automat & Control Proc, Far Eastern Branch, Radio St 5, Vladivostok 690041, Primorsky Krai, Russia
[7] Shanghai East Hosp, Inst Regenerat Med, 150 Jimo Rd, Shanghai 200120, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
VAD; ST-graph; Self-labeling; Not-normal space; Object-level criterion; ABNORMAL EVENT DETECTION; CONVOLUTIONAL NETWORKS;
D O I
10.1016/j.neucom.2025.129576
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video anomaly detection (VAD) aims to identify abnormal events in a video sequence. Existing methods achieve VAD by learning the decision boundary between the normal space and the abnormal space pre-defined in the training data. However, these methods trend to neglect the distribution gap between the pre-defined abnormal space and the real one, which lead to overfitting on the normal space or bias toward the pre-defined abnormal space. In this paper, we propose a spatio-temporal graph-based self-labeling method that not only focuses on the pre-defined abnormal space but considers the real abnormal space, enabling it to capture the decision boundary between the normal space and a complementary space, called as the not-normal space. We first construct a spatio-temporal graph (ST-Graph) based on the objects of input video and utilize a spatio-temporal graph convolution network (ST-GCN) to model the interaction between objects. We then propose a self-labeling- based learning mechanism that encourages the proposed ST-GCN to record the normal events while abstaining from labeling the pseudo-abnormal events, thereby aggregating the pre-defined and real abnormal spaces into not-normal space. To evaluate the model performance on localizing anomalous objects and capturing interactions between objects, we further introduce an object-level criterion that bridges frame-level and pixel- level criteria. Our method is validated on three datasets and achieves state-of-the-art frame-level AUC results on Avenue (92.5%), and outperforms existing ST-Graph-based methods on UCSD Ped2 (96.5%) and ShanghaiTech (76.8%).
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Dynamic Spatio-Temporal Graph-Based CNNs for Traffic Flow Prediction
    Chen, Ken
    Chen, Fei
    Lai, Baisheng
    Jin, Zhongming
    Liu, Yong
    Li, Kai
    Wei, Long
    Wang, Pengfei
    Tang, Yandong
    Huang, Jianqiang
    Hua, Xian-Sheng
    IEEE ACCESS, 2020, 8 : 185136 - 185145
  • [32] Online Anomaly Detection of Wind Turbines Based on Hierarchical Spatio-temporal Graph Neural Network
    Zheng Y.
    Wang C.
    Liu B.
    Yang J.
    Huang C.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2024, 48 (05): : 107 - 119
  • [33] A Hierarchical Spatio-Temporal Graph Convolutional Neural Network for Anomaly Detection in Videos
    Zeng, Xianlin
    Jiang, Yalong
    Ding, Wenrui
    Li, Hongguang
    Hao, Yafeng
    Qiu, Zifeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (01) : 200 - 212
  • [34] Self-labeling video prediction
    Zhang, Wendong
    Wang, Yunbo
    Yang, Xiaokang
    DISPLAYS, 2023, 79
  • [35] HIERARCHICAL ACTIVITY DISCOVERY WITHIN SPATIO-TEMPORAL CONTEXT FOR VIDEO ANOMALY DETECTION
    Xu, Dan
    Wu, Xinyu
    Song, Dezhen
    Li, Nannan
    Chen, Yen-Lun
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3597 - 3601
  • [36] Bidirectional Spatio-Temporal Feature Learning With Multiscale Evaluation for Video Anomaly Detection
    Zhong, Yuanhong
    Chen, Xia
    Hu, Yongting
    Tang, Panliang
    Ren, Fan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8285 - 8296
  • [37] Spectrum Anomaly Detection Based on Spatio-Temporal Network Prediction
    Peng, Chuang
    Hu, Weilin
    Wang, Lunwen
    ELECTRONICS, 2022, 11 (11)
  • [38] Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection
    Li, Guoqiu
    Cai, Guanxiong
    Zeng, Xingyu
    Zhao, Rui
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 333 - 350
  • [39] Anomaly Detection Using Spatio-Temporal Context Learned by Video Clip Sorting
    Shao, Wen
    Kawakami, Rei
    Naemura, Takeshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 1094 - 1102
  • [40] Spatio-temporal Anomaly Detection in Traffic Data
    Wang, Qing
    Lv, Weifeng
    Du, Bowen
    ISCSIC'18: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, 2018,