Anomaly recognition from surveillance videos using 3D convolution neural network

被引:42
|
作者
Maqsood, Ramna [1 ]
Bajwa, Usama Ijaz [1 ]
Saleem, Gulshan [1 ]
Raza, Rana Hammad [2 ]
Anwar, Muhammad Waqas [1 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus 1-5 KM Def Rd Off Raiwind Rd, Lahore, Pakistan
[2] Natl Univ Sci & Technol NUST NUST PNEC, Habib Ibrahim Rehmatullah Rd, Sindh, Pakistan
关键词
Anomalous activity recognition; 3DConvNets; Spatial augmentation; Spatial annotation;
D O I
10.1007/s11042-021-10570-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomalous activity recognition deals with identifying the patterns and events that vary from the normal stream. In a surveillance paradigm, these events range from abuse to fighting and road accidents to snatching, etc. Due to the sparse occurrence of anomalous events, anomalous activity recognition from surveillance videos is a challenging research task. The approaches reported can be generally categorized as handcrafted and deep learning-based. Most of the reported studies address binary classification i.e. anomaly detection from surveillance videos. But these reported approaches did not address other anomalous events e.g. abuse, fight, road accidents, shooting, stealing, vandalism, and robbery, etc. from surveillance videos. Therefore, this paper aims to provide an effective framework for the recognition of different real-world anomalies from videos. This study provides a simple, yet effective approach for learning spatiotemporal features using deep 3-dimensional convolutional networks (3D ConvNets) trained on the University of Central Florida (UCF) Crime video dataset. Firstly, the frame-level labels of the UCF Crime dataset are provided, and then to extract anomalous spatiotemporal features more efficiently a fine-tuned 3D ConvNets is proposed. Findings of the proposed study are twofold 1) There exist specific, detectable, and quantifiable features in UCF Crime video feed that associate with each other 2) Multiclass learning can improve generalizing competencies of the 3D ConvNets by effectively learning frame-level information of dataset and can be leveraged in terms of better results by applying spatial augmentation. The proposed study extracted 3D features by providing frame level information and spatial augmentation to a fine-tuned pre-trained model, namely 3DConvNets. Besides, the learned features are compact enough and the proposed approach outperforms significantly from state of art approaches in terms of accuracy on anomalous activity recognition having 82% AUC.
引用
收藏
页码:18693 / 18716
页数:24
相关论文
共 50 条
  • [31] 3D Deformable Convolution Temporal Reasoning network for action recognition
    Ou, Yangjun
    Chen, Zhenzhong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 93
  • [32] 2D and 3D Face Recognition Using Convolutional Neural Network
    Hu, Huiying
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    Molton, Michael
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 133 - 138
  • [33] A shallow 3D convolutional neural network for violence detection in videos
    Dündar, Naz
    Keçeli, Ali Seydi
    Kaya, Aydın
    Sever, Hayri
    Egyptian Informatics Journal, 2024, 26
  • [34] A shallow 3D convolutional neural network for violence detection in videos
    Dundar, Naz
    Keceli, Ali Seydi
    Kaya, Aydin
    Sever, Hayri
    EGYPTIAN INFORMATICS JOURNAL, 2024, 26
  • [35] Multimodal Biometrics Recognition Using a Deep Convolutional Neural Network with Transfer Learning in Surveillance Videos
    Aung, Hsu Mon Lei
    Pluempitiwiriyawej, Charnchai
    Hamamoto, Kazuhiko
    Wangsiripitak, Somkiat
    COMPUTATION, 2022, 10 (07)
  • [36] 3D Convolutional Neural Network for Action Recognition
    Zhang, Junhui
    Chen, Li
    Tian, Jing
    COMPUTER VISION, PT I, 2017, 771 : 600 - 607
  • [37] Noisy Phoneme Recognition Using 2D Convolution Neural Network
    Ramonaite, Justina
    Korvel, Grazina
    2023 IEEE 10TH JUBILEE WORKSHOP ON ADVANCES IN INFORMATION, ELECTRONIC AND ELECTRICAL ENGINEERING, AIEEE, 2023,
  • [38] Anomaly Event Detection Using Generative Adversarial Network for Surveillance Videos
    Ganokratanaa, Thittaporn
    Aramvith, Supavadee
    Sebe, Nicu
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1395 - 1399
  • [39] Bayesian Feed Forward Neural Network-Based Efficient Anomaly Detection from Surveillance Videos
    Murugesan, M.
    Thilagamani, S.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (01): : 389 - 405
  • [40] Fully Automatic Face Recognition from 3D Videos
    Hayat, Munawar
    Bennamoun, Mohammed
    El-Sallam, Amar A.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1415 - 1418