Anomaly recognition from surveillance videos using 3D convolution neural network

被引：42

作者：

Maqsood, Ramna ^{[1
]}

Bajwa, Usama Ijaz ^{[1
]}

Saleem, Gulshan ^{[1
]}

Raza, Rana Hammad ^{[2
]}

Anwar, Muhammad Waqas ^{[1
]}

机构：

[1] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus 1-5 KM Def Rd Off Raiwind Rd, Lahore, Pakistan

[2] Natl Univ Sci & Technol NUST NUST PNEC, Habib Ibrahim Rehmatullah Rd, Sindh, Pakistan

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2021年 / 80卷 / 12期

关键词：

Anomalous activity recognition; 3DConvNets; Spatial augmentation; Spatial annotation;

D O I：

10.1007/s11042-021-10570-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Anomalous activity recognition deals with identifying the patterns and events that vary from the normal stream. In a surveillance paradigm, these events range from abuse to fighting and road accidents to snatching, etc. Due to the sparse occurrence of anomalous events, anomalous activity recognition from surveillance videos is a challenging research task. The approaches reported can be generally categorized as handcrafted and deep learning-based. Most of the reported studies address binary classification i.e. anomaly detection from surveillance videos. But these reported approaches did not address other anomalous events e.g. abuse, fight, road accidents, shooting, stealing, vandalism, and robbery, etc. from surveillance videos. Therefore, this paper aims to provide an effective framework for the recognition of different real-world anomalies from videos. This study provides a simple, yet effective approach for learning spatiotemporal features using deep 3-dimensional convolutional networks (3D ConvNets) trained on the University of Central Florida (UCF) Crime video dataset. Firstly, the frame-level labels of the UCF Crime dataset are provided, and then to extract anomalous spatiotemporal features more efficiently a fine-tuned 3D ConvNets is proposed. Findings of the proposed study are twofold 1) There exist specific, detectable, and quantifiable features in UCF Crime video feed that associate with each other 2) Multiclass learning can improve generalizing competencies of the 3D ConvNets by effectively learning frame-level information of dataset and can be leveraged in terms of better results by applying spatial augmentation. The proposed study extracted 3D features by providing frame level information and spatial augmentation to a fine-tuned pre-trained model, namely 3DConvNets. Besides, the learned features are compact enough and the proposed approach outperforms significantly from state of art approaches in terms of accuracy on anomalous activity recognition having 82% AUC.

引用

页码：18693 / 18716

页数：24

共 50 条

[31] 3D Deformable Convolution Temporal Reasoning network for action recognition
Ou, Yangjun
Chen, Zhenzhong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 93
[32] 2D and 3D Face Recognition Using Convolutional Neural Network
Hu, Huiying
Shah, Syed Afaq Ali
Bennamoun, Mohammed
Molton, Michael
TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 133 - 138
[33] A shallow 3D convolutional neural network for violence detection in videos
Dündar, Naz
Keçeli, Ali Seydi
Kaya, Aydın
Sever, Hayri
Egyptian Informatics Journal, 2024, 26
[34] A shallow 3D convolutional neural network for violence detection in videos
Dundar, Naz
Keceli, Ali Seydi
Kaya, Aydin
Sever, Hayri
EGYPTIAN INFORMATICS JOURNAL, 2024, 26
[35] Multimodal Biometrics Recognition Using a Deep Convolutional Neural Network with Transfer Learning in Surveillance Videos
Aung, Hsu Mon Lei
Pluempitiwiriyawej, Charnchai
Hamamoto, Kazuhiko
Wangsiripitak, Somkiat
COMPUTATION, 2022, 10 (07)
[36] 3D Convolutional Neural Network for Action Recognition
Zhang, Junhui
Chen, Li
Tian, Jing
COMPUTER VISION, PT I, 2017, 771 : 600 - 607
[37] Noisy Phoneme Recognition Using 2D Convolution Neural Network
Ramonaite, Justina
Korvel, Grazina
2023 IEEE 10TH JUBILEE WORKSHOP ON ADVANCES IN INFORMATION, ELECTRONIC AND ELECTRICAL ENGINEERING, AIEEE, 2023,
[38] Anomaly Event Detection Using Generative Adversarial Network for Surveillance Videos
Ganokratanaa, Thittaporn
Aramvith, Supavadee
Sebe, Nicu
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1395 - 1399
[39] Bayesian Feed Forward Neural Network-Based Efficient Anomaly Detection from Surveillance Videos
Murugesan, M.
Thilagamani, S.
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (01): : 389 - 405
[40] Fully Automatic Face Recognition from 3D Videos
Hayat, Munawar
Bennamoun, Mohammed
El-Sallam, Amar A.
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1415 - 1418

← 1 2 3 4 5 →