Learning Spatiotemporal Features With 3DCNN and ConvGRU for Video Anomaly Detection

被引:0
|
作者
Wang, Xin [1 ]
Xie, Weixin [1 ]
Song, Jiayi [1 ]
机构
[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen, Peoples R China
关键词
3DCNN; ConvGRU; Video anomaly detection;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video anomaly detection aims to analyze the abnormal events or behaviors from massive monitoring video data, which is extremely challenging due to the ambiguous definition of abnormal behavior and the complex monitoring scene. Feature representation based on the hand-crafted of video local spatial area is more complicated, and it is difficult to learn the essential feature from the input video. In this paper, a deep autoencoder network combined with 3DCNN and ConvGRU is proposed to learn the spatiotemporal features for video anomaly. Firstly, 3DCNN and bidirectional ConvGRU are used to encode the local-global spatial features and short-long-term temporal features in the spatiotemporal dimension. Secondly, the reconstruction branch is introduced to reconstruct video frames, while the prediction branch is utilized to make the encoder to learn the better spatiotemporal feature at the training phase. In addition, the regularization of adjacent frames in a loss function is carried on to improve the temporal feature. The weights of the C3D model trained by action recognition are transferred to 3DCNN to prevent model over fitting. Experiments on real anomaly datasets shows the effectiveness of our proposed deep model.
引用
收藏
页码:474 / 479
页数:6
相关论文
共 50 条
  • [41] SPATIOTEMPORAL UTILIZATION OF DEEP FEATURES FOR VIDEO SALIENCY DETECTION
    Le, Trung-Nghia
    Sugimoto, Akihiro
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [42] Applying 3DCNN to Recognize the weighted Time-frequency Features by Phase Locking Value for Epilepsy Prediction
    Li, Mingai
    Zhang, Ziyue
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3617 - 3621
  • [43] Perceptual metric learning for video anomaly detection
    Ramachandra, Bharathkumar
    Jones, Michael
    Vatsavai, Ranga Raju
    MACHINE VISION AND APPLICATIONS, 2021, 32 (03)
  • [44] Perceptual metric learning for video anomaly detection
    Bharathkumar Ramachandra
    Michael Jones
    Ranga Raju Vatsavai
    Machine Vision and Applications, 2021, 32
  • [45] Video anomaly detection guided by clustering learning
    Qiu, Shaoming
    Ye, Jingfeng
    Zhao, Jiancheng
    He, Lei
    Liu, Liangyu
    E, Bicong
    Huang, Xinchen
    PATTERN RECOGNITION, 2024, 153
  • [46] Light-weight 3DCNN for DeepFakes, FaceSwap and Face2Face facial forgery detection
    Kohli, Aditi
    Gupta, Abhinav
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 31391 - 31403
  • [47] Video anomaly detection with autoregressive modeling of covariance features
    Ali Enver Bilecen
    Alp Ozalp
    M. Sami Yavuz
    Huseyin Ozkan
    Signal, Image and Video Processing, 2022, 16 : 1027 - 1034
  • [48] Video anomaly detection with autoregressive modeling of covariance features
    Bilecen, Ali Enver
    Ozalp, Alp
    Yavuz, M. Sami
    Ozkan, Huseyin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (04) : 1027 - 1034
  • [49] Enhancing Latent Features for Unsupervised Video Anomaly Detection
    Zhou, Linmao
    Chang, Hong
    Kang, Nan
    Zhao, Xiangjun
    Ma, Bingpeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 299 - 310
  • [50] P3CMQA: Single-Model Quality Assessment Using 3DCNN with Profile-Based Features
    Takei, Yuma
    Ishida, Takashi
    BIOENGINEERING-BASEL, 2021, 8 (03):