Learning Spatiotemporal Features With 3DCNN and ConvGRU for Video Anomaly Detection

被引:0
|
作者
Wang, Xin [1 ]
Xie, Weixin [1 ]
Song, Jiayi [1 ]
机构
[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen, Peoples R China
关键词
3DCNN; ConvGRU; Video anomaly detection;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video anomaly detection aims to analyze the abnormal events or behaviors from massive monitoring video data, which is extremely challenging due to the ambiguous definition of abnormal behavior and the complex monitoring scene. Feature representation based on the hand-crafted of video local spatial area is more complicated, and it is difficult to learn the essential feature from the input video. In this paper, a deep autoencoder network combined with 3DCNN and ConvGRU is proposed to learn the spatiotemporal features for video anomaly. Firstly, 3DCNN and bidirectional ConvGRU are used to encode the local-global spatial features and short-long-term temporal features in the spatiotemporal dimension. Secondly, the reconstruction branch is introduced to reconstruct video frames, while the prediction branch is utilized to make the encoder to learn the better spatiotemporal feature at the training phase. In addition, the regularization of adjacent frames in a loss function is carried on to improve the temporal feature. The weights of the C3D model trained by action recognition are transferred to 3DCNN to prevent model over fitting. Experiments on real anomaly datasets shows the effectiveness of our proposed deep model.
引用
收藏
页码:474 / 479
页数:6
相关论文
共 50 条
  • [21] An intelligent adaptive learning framework for fake video detection using spatiotemporal features
    Allada Koteswaramma
    M. Babu Rao
    G. Jaya Suma
    Signal, Image and Video Processing, 2024, 18 : 2231 - 2241
  • [22] DBPNDNet: dual-branch networks using 3DCNN toward pulmonary nodule detection
    Jian, Muwei
    Jin, Haodong
    Zhang, Linsong
    Wei, Benzheng
    Yu, Hui
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (02) : 563 - 573
  • [23] Transfer learning for video anomaly detection
    Bansod, Suprit
    Nandedkar, Abhijeet
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 1967 - 1975
  • [24] Learning deep spatiotemporal features for video captioning
    Daskalakis, Eleftherios
    Tzelepi, Maria
    Tefas, Anastasios
    PATTERN RECOGNITION LETTERS, 2018, 116 : 143 - 149
  • [25] DBPNDNet: dual-branch networks using 3DCNN toward pulmonary nodule detection
    Muwei Jian
    Haodong Jin
    Linsong Zhang
    Benzheng Wei
    Hui Yu
    Medical & Biological Engineering & Computing, 2024, 62 : 563 - 573
  • [26] Multi-class Classification of Alzheimer's Disease using 3DCNN Features and Multilayer Perceptron
    Raju, Manu
    Gopi, Varun P.
    Anitha, V. S.
    2021 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2021, : 368 - 373
  • [27] Fast anomaly detection in video surveillance system using robust spatiotemporal and deep learning methods
    Kotkar, Vijay A. A.
    Sucharita, V.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34259 - 34286
  • [28] Deep Multi-view Representation Learning for Video Anomaly Detection Using Spatiotemporal Autoencoders
    Deepak, K.
    Srivathsan, G.
    Roshan, S.
    Chandrakala, S.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (03) : 1333 - 1349
  • [29] Fast anomaly detection in video surveillance system using robust spatiotemporal and deep learning methods
    Vijay A. Kotkar
    V. Sucharita
    Multimedia Tools and Applications, 2023, 82 : 34259 - 34286
  • [30] Deep Multi-view Representation Learning for Video Anomaly Detection Using Spatiotemporal Autoencoders
    K. Deepak
    G. Srivathsan
    S. Roshan
    S. Chandrakala
    Circuits, Systems, and Signal Processing, 2021, 40 : 1333 - 1349