Learning Spatiotemporal Features With 3DCNN and ConvGRU for Video Anomaly Detection

被引：0

作者：

Wang, Xin ^{[1
]}

Xie, Weixin ^{[1
]}

Song, Jiayi ^{[1
]}

机构：

[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2018年

关键词：

3DCNN; ConvGRU; Video anomaly detection;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Video anomaly detection aims to analyze the abnormal events or behaviors from massive monitoring video data, which is extremely challenging due to the ambiguous definition of abnormal behavior and the complex monitoring scene. Feature representation based on the hand-crafted of video local spatial area is more complicated, and it is difficult to learn the essential feature from the input video. In this paper, a deep autoencoder network combined with 3DCNN and ConvGRU is proposed to learn the spatiotemporal features for video anomaly. Firstly, 3DCNN and bidirectional ConvGRU are used to encode the local-global spatial features and short-long-term temporal features in the spatiotemporal dimension. Secondly, the reconstruction branch is introduced to reconstruct video frames, while the prediction branch is utilized to make the encoder to learn the better spatiotemporal feature at the training phase. In addition, the regularization of adjacent frames in a loss function is carried on to improve the temporal feature. The weights of the C3D model trained by action recognition are transferred to 3DCNN to prevent model over fitting. Experiments on real anomaly datasets shows the effectiveness of our proposed deep model.

引用

页码：474 / 479

页数：6

共 50 条

[21] An intelligent adaptive learning framework for fake video detection using spatiotemporal features
Allada Koteswaramma
M. Babu Rao
G. Jaya Suma
Signal, Image and Video Processing, 2024, 18 : 2231 - 2241
[22] DBPNDNet: dual-branch networks using 3DCNN toward pulmonary nodule detection
Jian, Muwei
Jin, Haodong
Zhang, Linsong
Wei, Benzheng
Yu, Hui
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (02) : 563 - 573
[23] Transfer learning for video anomaly detection
Bansod, Suprit
Nandedkar, Abhijeet
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 1967 - 1975
[24] Learning deep spatiotemporal features for video captioning
Daskalakis, Eleftherios
Tzelepi, Maria
Tefas, Anastasios
PATTERN RECOGNITION LETTERS, 2018, 116 : 143 - 149
[25] DBPNDNet: dual-branch networks using 3DCNN toward pulmonary nodule detection
Muwei Jian
Haodong Jin
Linsong Zhang
Benzheng Wei
Hui Yu
Medical & Biological Engineering & Computing, 2024, 62 : 563 - 573
[26] Multi-class Classification of Alzheimer's Disease using 3DCNN Features and Multilayer Perceptron
Raju, Manu
Gopi, Varun P.
Anitha, V. S.
2021 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2021, : 368 - 373
[27] Fast anomaly detection in video surveillance system using robust spatiotemporal and deep learning methods
Kotkar, Vijay A. A.
Sucharita, V.
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34259 - 34286
[28] Deep Multi-view Representation Learning for Video Anomaly Detection Using Spatiotemporal Autoencoders
Deepak, K.
Srivathsan, G.
Roshan, S.
Chandrakala, S.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (03) : 1333 - 1349
[29] Fast anomaly detection in video surveillance system using robust spatiotemporal and deep learning methods
Vijay A. Kotkar
V. Sucharita
Multimedia Tools and Applications, 2023, 82 : 34259 - 34286
[30] Deep Multi-view Representation Learning for Video Anomaly Detection Using Spatiotemporal Autoencoders
K. Deepak
G. Srivathsan
S. Roshan
S. Chandrakala
Circuits, Systems, and Signal Processing, 2021, 40 : 1333 - 1349

← 1 2 3 4 5 →