Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection

被引:15
|
作者
Li, Guoqiu [1 ]
Cai, Guanxiong [2 ]
Zeng, Xingyu [2 ]
Zhao, Rui [2 ,3 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] SenseTime Res, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai, Peoples R China
来源
关键词
Scale-aware; Weakly-supervised video anomaly detection; Spatio-temporal relation modeling;
D O I
10.1007/978-3-031-19772-7_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress in video anomaly detection (VAD) has shown that feature discrimination is the key to effectively distinguishing anomalies from normal events. We observe that many anomalous events occur in limited local regions, and the severe background noise increases the difficulty of feature learning. In this paper, we propose a scale-aware weakly supervised learning approach to capture local and salient anomalous patterns from the background, using only coarse video-level labels as supervision. We achieve this by segmenting frames into non-overlapping patches and then capturing inconsistencies among different regions through our patch spatial relation (PSR) module, which consists of self-attention mechanisms and dilated convolutions. To address the scale variation of anomalies and enhance the robustness of our method, a multi-scale patch aggregation method is further introduced to enable local-to-global spatial perception by merging features of patches with different scales. Considering the importance of temporal cues, we extend the relation modeling from the spatial domain to the spatio-temporal domain with the help of the existing video temporal relation network to effectively encode the spatio-temporal dynamics in the video. Experimental results show that our proposed method achieves new state-of-the-art performance on UCF-Crime and ShanghaiTech benchmarks. Code are available at https://github.com/nutuniv/SSRL.
引用
收藏
页码:333 / 350
页数:18
相关论文
共 50 条
  • [21] MULTI-SCALE ANALYSIS OF CONTEXTUAL INFORMATION WITHIN SPATIO-TEMPORAL VIDEO VOLUMES FOR ANOMALY DETECTION
    Li, Nannan
    Guo, Huiwen
    Xu, Dan
    Wu, Xinyu
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2363 - 2367
  • [22] STEP: Spatio-Temporal Progressive Learning for Video Action Detection
    Yang, Xitong
    Yang, Xiaodong
    Liu, Ming-Yu
    Xiao, Fanyi
    Davis, Larry
    Kautz, Jan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 264 - 272
  • [23] An unsupervised video anomaly detection method via Optical Flow decomposition and Spatio-Temporal feature learning
    Fan, Jin
    Ji, Yuxiang
    Wu, Huifeng
    Ge, Yan
    Sun, Danfeng
    Wu, Jia
    PATTERN RECOGNITION LETTERS, 2024, 185 : 239 - 246
  • [24] HIERARCHICAL ACTIVITY DISCOVERY WITHIN SPATIO-TEMPORAL CONTEXT FOR VIDEO ANOMALY DETECTION
    Xu, Dan
    Wu, Xinyu
    Song, Dezhen
    Li, Nannan
    Chen, Yen-Lun
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3597 - 3601
  • [25] Spatio-temporal based video anomaly detection using deep neural networks
    Chaurasia R.K.
    Jaiswal U.C.
    International Journal of Information Technology, 2023, 15 (3) : 1569 - 1581
  • [26] Anomaly Detection Using Spatio-Temporal Context Learned by Video Clip Sorting
    Shao, Wen
    Kawakami, Rei
    Naemura, Takeshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05) : 1094 - 1102
  • [27] Video anomaly detection based on attention and efficient spatio-temporal feature extraction
    Rahimpour, Seyed Mohammad
    Kazemi, Mohammad
    Moallem, Payman
    Safayani, Mehran
    VISUAL COMPUTER, 2024, 40 (10): : 6825 - 6841
  • [28] Learning Social Spatio-Temporal Relation Graph in the Wild and a Video Benchmark
    Wang, Haoran
    Jiao, Licheng
    Liu, Fang
    Li, Lingling
    Liu, Xu
    Ji, Deyi
    Gan, Weihao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (06) : 2951 - 2964
  • [29] Spatio-temporal Anomaly Detection in Traffic Data
    Wang, Qing
    Lv, Weifeng
    Du, Bowen
    ISCSIC'18: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, 2018,
  • [30] Video anomaly detection based on multi-scale optical flow spatio-temporal enhancement and normality mining
    He, Qiang
    Shi, Ruinian
    Chen, Linlin
    Huo, Lianzhi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 1873 - 1888