Learning Spatiotemporal Features With 3DCNN and ConvGRU for Video Anomaly Detection

被引：0

作者：

Wang, Xin ^{[1
]}

Xie, Weixin ^{[1
]}

Song, Jiayi ^{[1
]}

机构：

[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2018年

关键词：

3DCNN; ConvGRU; Video anomaly detection;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Video anomaly detection aims to analyze the abnormal events or behaviors from massive monitoring video data, which is extremely challenging due to the ambiguous definition of abnormal behavior and the complex monitoring scene. Feature representation based on the hand-crafted of video local spatial area is more complicated, and it is difficult to learn the essential feature from the input video. In this paper, a deep autoencoder network combined with 3DCNN and ConvGRU is proposed to learn the spatiotemporal features for video anomaly. Firstly, 3DCNN and bidirectional ConvGRU are used to encode the local-global spatial features and short-long-term temporal features in the spatiotemporal dimension. Secondly, the reconstruction branch is introduced to reconstruct video frames, while the prediction branch is utilized to make the encoder to learn the better spatiotemporal feature at the training phase. In addition, the regularization of adjacent frames in a loss function is carried on to improve the temporal feature. The weights of the C3D model trained by action recognition are transferred to 3DCNN to prevent model over fitting. Experiments on real anomaly datasets shows the effectiveness of our proposed deep model.

引用

页码：474 / 479

页数：6

共 50 条

[41] SPATIOTEMPORAL UTILIZATION OF DEEP FEATURES FOR VIDEO SALIENCY DETECTION
Le, Trung-Nghia
Sugimoto, Akihiro
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
[42] Applying 3DCNN to Recognize the weighted Time-frequency Features by Phase Locking Value for Epilepsy Prediction
Li, Mingai
Zhang, Ziyue
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3617 - 3621
[43] Perceptual metric learning for video anomaly detection
Ramachandra, Bharathkumar
Jones, Michael
Vatsavai, Ranga Raju
MACHINE VISION AND APPLICATIONS, 2021, 32 (03)
[44] Perceptual metric learning for video anomaly detection
Bharathkumar Ramachandra
Michael Jones
Ranga Raju Vatsavai
Machine Vision and Applications, 2021, 32
[45] Video anomaly detection guided by clustering learning
Qiu, Shaoming
Ye, Jingfeng
Zhao, Jiancheng
He, Lei
Liu, Liangyu
E, Bicong
Huang, Xinchen
PATTERN RECOGNITION, 2024, 153
[46] Light-weight 3DCNN for DeepFakes, FaceSwap and Face2Face facial forgery detection
Kohli, Aditi
Gupta, Abhinav
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 31391 - 31403
[47] Video anomaly detection with autoregressive modeling of covariance features
Ali Enver Bilecen
Alp Ozalp
M. Sami Yavuz
Huseyin Ozkan
Signal, Image and Video Processing, 2022, 16 : 1027 - 1034
[48] Video anomaly detection with autoregressive modeling of covariance features
Bilecen, Ali Enver
Ozalp, Alp
Yavuz, M. Sami
Ozkan, Huseyin
SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (04) : 1027 - 1034
[49] Enhancing Latent Features for Unsupervised Video Anomaly Detection
Zhou, Linmao
Chang, Hong
Kang, Nan
Zhao, Xiangjun
Ma, Bingpeng
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 299 - 310
[50] P3CMQA: Single-Model Quality Assessment Using 3DCNN with Profile-Based Features
Takei, Yuma
Ishida, Takashi
BIOENGINEERING-BASEL, 2021, 8 (03):

← 1 2 3 4 5 →