Learning Spatiotemporal Features With 3DCNN and ConvGRU for Video Anomaly Detection

被引:0
|
作者
Wang, Xin [1 ]
Xie, Weixin [1 ]
Song, Jiayi [1 ]
机构
[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen, Peoples R China
关键词
3DCNN; ConvGRU; Video anomaly detection;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video anomaly detection aims to analyze the abnormal events or behaviors from massive monitoring video data, which is extremely challenging due to the ambiguous definition of abnormal behavior and the complex monitoring scene. Feature representation based on the hand-crafted of video local spatial area is more complicated, and it is difficult to learn the essential feature from the input video. In this paper, a deep autoencoder network combined with 3DCNN and ConvGRU is proposed to learn the spatiotemporal features for video anomaly. Firstly, 3DCNN and bidirectional ConvGRU are used to encode the local-global spatial features and short-long-term temporal features in the spatiotemporal dimension. Secondly, the reconstruction branch is introduced to reconstruct video frames, while the prediction branch is utilized to make the encoder to learn the better spatiotemporal feature at the training phase. In addition, the regularization of adjacent frames in a loss function is carried on to improve the temporal feature. The weights of the C3D model trained by action recognition are transferred to 3DCNN to prevent model over fitting. Experiments on real anomaly datasets shows the effectiveness of our proposed deep model.
引用
收藏
页码:474 / 479
页数:6
相关论文
共 50 条
  • [1] Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM for Gesture Recognition
    Zhang, Liang
    Zhu, Guangming
    Shen, Peiyi
    Song, Juan
    Shah, Syed Afaq
    Bennamoun, Mohammed
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3120 - 3128
  • [2] ConvGRU-CNN: Spatiotemporal Deep Learning for Real-World Anomaly Detection in Video Surveillance System
    Gandapur, Maryam Qasim
    Verdu, Elena
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 88 - 95
  • [3] Spatiotemporal Representation Learning for Video Anomaly Detection
    Li, Zhaoyan
    Li, Yaoshun
    Gao, Zhisheng
    IEEE ACCESS, 2020, 8 (08): : 25531 - 25542
  • [4] Video Saliency Detection by using an Enhance Methodology Involving a Combination of 3DCNN with Histograms
    Kumar, Suresh R.
    Mahalakshmi, P.
    Jothilakshmi, R.
    Kavitha, M. S.
    Balamuralitharan, S.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2022, 17 (02)
  • [5] 3DCNN landslide susceptibility considering spatial-factor features
    Liu, Mengmeng
    Liu, Jiping
    Xu, Shenghua
    Chen, Cai
    Bao, Shuai
    Wang, Zhuolu
    Du, Jun
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2023, 11
  • [6] Target Detection in Clutter Regions Based on 3DCNN for HFSWR
    Zhong, Jiangnan
    Zhang, Ling
    Li, Cheng
    Niu, Jiong
    Liu, Zhaokai
    Wang, Cheng
    Li, Zongtai
    OCEANS 2024 - SINGAPORE, 2024,
  • [7] VIDEO ANOMALY DETECTION IN SPATIOTEMPORAL CONTEXT
    Jiang, Fan
    Yuan, Junsong
    Tsaftaris, Sotirios A.
    Katsaggelos, Aggelos K.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 705 - 708
  • [8] A Lightweight Driver Drowsiness Detection System Using 3DCNN With LSTM
    Alameen, Sara A.
    Alhothali, Areej M.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (01): : 895 - 912
  • [9] Hand Gesture Recognition for Sign Languages Using 3DCNN for Efficient Detection
    Elangovan, Taranya
    Annie, R. Arockia Xavier
    Sundaresan, Keerthana
    Pradhakshya, J. D.
    COMPUTER METHODS, IMAGING AND VISUALIZATION IN BIOMECHANICS AND BIOMEDICAL ENGINEERING II, 2023, 38 : 215 - 233
  • [10] Pedestrian Detection from Sparse Point-Cloud using 3DCNN
    Tatebe, Yoshiki
    Deguchi, Daisuke
    Kawanishi, Yasutomo
    Ide, Ichiro
    Murase, Hiroshi
    Sakai, Utsushi
    2018 INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT), 2018,