Anomaly recognition from surveillance videos using 3D convolution neural network

被引:42
|
作者
Maqsood, Ramna [1 ]
Bajwa, Usama Ijaz [1 ]
Saleem, Gulshan [1 ]
Raza, Rana Hammad [2 ]
Anwar, Muhammad Waqas [1 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus 1-5 KM Def Rd Off Raiwind Rd, Lahore, Pakistan
[2] Natl Univ Sci & Technol NUST NUST PNEC, Habib Ibrahim Rehmatullah Rd, Sindh, Pakistan
关键词
Anomalous activity recognition; 3DConvNets; Spatial augmentation; Spatial annotation;
D O I
10.1007/s11042-021-10570-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomalous activity recognition deals with identifying the patterns and events that vary from the normal stream. In a surveillance paradigm, these events range from abuse to fighting and road accidents to snatching, etc. Due to the sparse occurrence of anomalous events, anomalous activity recognition from surveillance videos is a challenging research task. The approaches reported can be generally categorized as handcrafted and deep learning-based. Most of the reported studies address binary classification i.e. anomaly detection from surveillance videos. But these reported approaches did not address other anomalous events e.g. abuse, fight, road accidents, shooting, stealing, vandalism, and robbery, etc. from surveillance videos. Therefore, this paper aims to provide an effective framework for the recognition of different real-world anomalies from videos. This study provides a simple, yet effective approach for learning spatiotemporal features using deep 3-dimensional convolutional networks (3D ConvNets) trained on the University of Central Florida (UCF) Crime video dataset. Firstly, the frame-level labels of the UCF Crime dataset are provided, and then to extract anomalous spatiotemporal features more efficiently a fine-tuned 3D ConvNets is proposed. Findings of the proposed study are twofold 1) There exist specific, detectable, and quantifiable features in UCF Crime video feed that associate with each other 2) Multiclass learning can improve generalizing competencies of the 3D ConvNets by effectively learning frame-level information of dataset and can be leveraged in terms of better results by applying spatial augmentation. The proposed study extracted 3D features by providing frame level information and spatial augmentation to a fine-tuned pre-trained model, namely 3DConvNets. Besides, the learned features are compact enough and the proposed approach outperforms significantly from state of art approaches in terms of accuracy on anomalous activity recognition having 82% AUC.
引用
收藏
页码:18693 / 18716
页数:24
相关论文
共 50 条
  • [41] Violence detection in videos using interest frame extraction and 3D convolutional neural network
    Mahmoodi, Javad
    Nezamabadi-pour, Hossein
    Abbasi-Moghadam, Dariush
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (15) : 20945 - 20961
  • [42] Violence detection in videos using interest frame extraction and 3D convolutional neural network
    Javad Mahmoodi
    Hossein Nezamabadi-pour
    Dariush Abbasi-Moghadam
    Multimedia Tools and Applications, 2022, 81 : 20945 - 20961
  • [43] Convolutional Neural Network for 3D Object Recognition using Volumetric Representation
    Xu, Xiaofan
    Dehghani, Alireza
    Corrigan, David
    Caulfield, Sam
    Moloney, David
    2016 FIRST INTERNATIONAL WORKSHOP ON SENSING, PROCESSING AND LEARNING FOR INTELLIGENT MACHINES (SPLINE), 2016,
  • [44] 3D Object Recognition Using Multi-moment and Neural Network
    Xu Sheng
    Peng Qi-cong
    2008 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEM, 2008, : 1119 - 1123
  • [45] 3D convolution neural network-based person identification using gait cycles
    P. Supraja
    Rijo Jackson Tom
    Ravi Shekhar Tiwari
    V. Vijayakumar
    Yan Liu
    Evolving Systems, 2021, 12 : 1045 - 1056
  • [46] Relating brain structure images to personality characteristics using 3D convolution neural network
    Cao, Lixian
    Liang, Yanchun
    Lv, Wei
    Park, Kaechang
    Miura, Yasuhiro
    Shinomiya, Yuki
    Yoshida, Shinichi
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2021, 6 (03) : 338 - 346
  • [47] 3D convolution neural network-based person identification using gait cycles
    Supraja, P.
    Tom, Rijo Jackson
    Tiwari, Ravi Shekhar
    Vijayakumar, V.
    Liu, Yan
    EVOLVING SYSTEMS, 2021, 12 (04) : 1045 - 1056
  • [48] An Efficient Anomaly Recognition Framework Using an Attention Residual LSTM in Surveillance Videos
    Ullah, Waseem
    Ullah, Amin
    Hussain, Tanveer
    Khan, Zulfiqar Ahmad
    Baik, Sung Wook
    SENSORS, 2021, 21 (08)
  • [49] High Speed and Accuracy of Animation 3D Pose Recognition Based on an Improved Deep Convolution Neural Network
    Ding, Wei
    Li, Wenfa
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [50] 3D Brain Image Segmentation Using 3D Tiled Convolution Neural Networks
    Haque, Md Mahibul
    Ria, Jobeda Khanam
    Al Mannan, Fahad
    Majumder, Sadman
    Uddin, Reaz
    Abed, Mahjabeen Tamanna
    Alam, Md Ashraful
    PATTERN RECOGNITION AND PREDICTION XXXV, 2024, 13040