Video Anomaly Detection Based on Space-Time Fusion Graph Network Learning

被引:0
|
作者
Zhou H. [1 ]
Zhan Y. [1 ]
Mao Q. [1 ]
机构
[1] School of Computer Science and Telecommunication Engineering, Jiangsu University, Zhenjiang
来源
Zhan, Yongzhao (yzzhan@ujs.edu.cn) | 1600年 / Science Press卷 / 58期
基金
中国国家自然科学基金;
关键词
Adaptive weighting; Graph convolutional network; Spatial similarity graph; Temporal trend graph; Video anomaly detection;
D O I
10.7544/issn1000-1239.2021.20200264
中图分类号
学科分类号
摘要
There are strong correlations among spatial-temporal features of abnormal events in videos. Aiming at the problem of performance for abnormal event detection caused by these correlations, a video anomaly detection method based on space-time fusion graph network learning is proposed. In this method, spatial similarity graph and temporal trend graph for the segments are constructed in terms of the features of the segments. The spatial similarity graph is built dynamically by treating the features of the video segments as the vertexes in graph. In this graph, the weights of edges are dynamically formed by taking the relationship between vertex and its Top-k similarity vertexes into account. The temporal trend graph is built by taking the time distance for m sequential segments into account. The space-time fusion graph convolutional network is constructed by adaptively weighting the spatial similarity graph and temporal trend graph. The video embedding features are learnt and generated by using this graph convolutional network. A graph sparse regularization is added to the ranking loss, in order to reduce the over-smoothing effect of graph model and improve detection performance. The experiments are conducted on two challenging video datasets: UCF-Crime(University of Central Florida crime dataset) and ShanghaiTech. ROC(receiver operating characteristic curve) and AUC (area under curve) are taken as performance metrics. Our method obtains the AUC score of 80.76% rising by 5.35% compared with the baseline on UCF-Crime dataset, and also gets the score of 89.88% rising by 5.44% compared with SOTA(state of the art) weakly supervised algorithm on ShanghaiTech. The experimental results show that our proposed method can improve the performance of video abnormal event detection effectively. © 2021, Science Press. All right reserved.
引用
收藏
页码:48 / 59
页数:11
相关论文
共 31 条
  • [1] Pawar K, Attar V., Deep learning approaches for video-based anomalous activity detection, World Wide Web, 22, 2, pp. 571-601, (2019)
  • [2] Li Weixin, Mahadevan V, Vasconcelos N., Anomaly detection and localization in crowded scenes, IEEE Transactions on Pattern Analysis and Machine Intelligence, 36, 1, pp. 18-32, (2014)
  • [3] Wang Xinggang, Yan Yongluan, Tang Peng, Et al., Revisiting multiple instance neural networks, Pattern Recognition, 74, pp. 15-24, (2018)
  • [4] Zhou Zhihua, Sun Yuyin, Li Yufeng, Multi-instance learning by treating instances as non-i.i.d. samples, Proc of the 26th Annual Int Conf on Machine Learning, pp. 1249-1256, (2009)
  • [5] Saligrama V, Konrad J, Jodoin P M., Video anomaly identification, IEEE Signal Process, 27, 5, pp. 18-33, (2010)
  • [6] Leyva R, Sanchez V, Li C., Video anomaly detection with compact feature sets for online performance, IEEE Transactions on Image Process, 26, 7, pp. 3463-3478, (2017)
  • [7] Lu Cewu, Shi Jiangping, Jia Jiaya, Abnormal event detection at 150 FPS in MATLAB, Proc of the IEEE Int Conf on Computer Vision, pp. 2720-2727, (2013)
  • [8] Hasan M, Choi J, Neumann J, Et al., Learning temporal regularity in video sequences, Proc of the IEEE Int Conf on Computer Vision and Pattern Recognition, pp. 733-742, (2016)
  • [9] Sabokrou M, Fayyaz M, Fathy M, Et al., Deep-cascade: Cascading 3D deep neural networks for fast anomaly detection and localization in crowded scenes, IEEE Transactions on Image Processing, 26, 4, pp. 1992-2004, (2017)
  • [10] Xu Dan, Ricci E, Yan Yan, Et al., Learning deep repre-sentations of appearance and motion for anomalous event detection, Proc of the British Machine Vision Conf, (2015)