Anomaly recognition from surveillance videos using 3D convolution neural network

被引:42
|
作者
Maqsood, Ramna [1 ]
Bajwa, Usama Ijaz [1 ]
Saleem, Gulshan [1 ]
Raza, Rana Hammad [2 ]
Anwar, Muhammad Waqas [1 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus 1-5 KM Def Rd Off Raiwind Rd, Lahore, Pakistan
[2] Natl Univ Sci & Technol NUST NUST PNEC, Habib Ibrahim Rehmatullah Rd, Sindh, Pakistan
关键词
Anomalous activity recognition; 3DConvNets; Spatial augmentation; Spatial annotation;
D O I
10.1007/s11042-021-10570-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomalous activity recognition deals with identifying the patterns and events that vary from the normal stream. In a surveillance paradigm, these events range from abuse to fighting and road accidents to snatching, etc. Due to the sparse occurrence of anomalous events, anomalous activity recognition from surveillance videos is a challenging research task. The approaches reported can be generally categorized as handcrafted and deep learning-based. Most of the reported studies address binary classification i.e. anomaly detection from surveillance videos. But these reported approaches did not address other anomalous events e.g. abuse, fight, road accidents, shooting, stealing, vandalism, and robbery, etc. from surveillance videos. Therefore, this paper aims to provide an effective framework for the recognition of different real-world anomalies from videos. This study provides a simple, yet effective approach for learning spatiotemporal features using deep 3-dimensional convolutional networks (3D ConvNets) trained on the University of Central Florida (UCF) Crime video dataset. Firstly, the frame-level labels of the UCF Crime dataset are provided, and then to extract anomalous spatiotemporal features more efficiently a fine-tuned 3D ConvNets is proposed. Findings of the proposed study are twofold 1) There exist specific, detectable, and quantifiable features in UCF Crime video feed that associate with each other 2) Multiclass learning can improve generalizing competencies of the 3D ConvNets by effectively learning frame-level information of dataset and can be leveraged in terms of better results by applying spatial augmentation. The proposed study extracted 3D features by providing frame level information and spatial augmentation to a fine-tuned pre-trained model, namely 3DConvNets. Besides, the learned features are compact enough and the proposed approach outperforms significantly from state of art approaches in terms of accuracy on anomalous activity recognition having 82% AUC.
引用
收藏
页码:18693 / 18716
页数:24
相关论文
共 50 条
  • [1] Anomaly recognition from surveillance videos using 3D convolution neural network
    Ramna Maqsood
    Usama Ijaz Bajwa
    Gulshan Saleem
    Rana Hammad Raza
    Muhammad Waqas Anwar
    Multimedia Tools and Applications, 2021, 80 : 18693 - 18716
  • [2] Automatic Traffic State Recognition from Road Videos Based on 3D Convolution Neural Network
    Peng B.
    Tang J.
    Zhang Y.
    Cai X.
    Meng F.
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2021, 56 (01): : 153 - 159
  • [3] Automatic recognition of schizophrenia from facial videos using 3D convolutional neural network
    Huang, Jie
    Zhao, Yanli
    Qu, Wei
    Tian, Zhanxiao
    Tan, Yunlong
    Wang, Zhiren
    Tan, Shuping
    ASIAN JOURNAL OF PSYCHIATRY, 2022, 77
  • [4] Human Action Recognition based on 3D Convolution Neural Networks from RGBD Videos
    Al-Akam, Rawya
    Paulus, Dietrich
    Gharabaghi, Darius
    26. INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS, VISUALIZATION AND COMPUTER VISION (WSCG 2018), 2018, 2803 : 18 - 26
  • [5] Using 3D Convolutional Neural Network in Surveillance Videos for Recognizing Human Actions
    Pushparaj, Sathyashrisharmilha
    Arumugam, Sakthivel
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (04) : 693 - 700
  • [6] Human Action Recognition using 3D Convolutional Neural Networks with 3D Motion Cuboids in Surveillance Videos
    Arunnehru, J.
    Chamundeeswari, G.
    Bharathi, S. Prasanna
    INTERNATIONAL CONFERENCE ON ROBOTICS AND SMART MANUFACTURING (ROSMA2018), 2018, 133 : 471 - 477
  • [7] Abnormal Activity Recognition from Surveillance Videos Using Convolutional Neural Network
    Habib, Shabana
    Hussain, Altaf
    Albattah, Waleed
    Islam, Muhammad
    Khan, Sheroz
    Khan, Rehan Ullah
    Khan, Khalil
    SENSORS, 2021, 21 (24)
  • [8] Video Anomaly Detection using Inflated 3D Convolution Network
    Koshti, Dipali
    Kamoji, Supriya
    Kalnad, Nehal
    Sreekumar, Suyash
    Bhujbal, Shreya
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 729 - 733
  • [9] Efficient anomaly recognition using surveillance videos
    Saleem G.
    Bajwa U.I.
    Raza R.H.
    Alqahtani F.H.
    Tolba A.
    Xia F.
    PeerJ Computer Science, 2022, 8
  • [10] Efficient anomaly recognition using surveillance videos
    Saleem, Gulshan
    Bajwa, Usama Ijaz
    Raza, Rana Hammad
    Alqahtani, Fayez Hussain
    Tolba, Amr
    Xia, Feng
    PEERJ COMPUTER SCIENCE, 2022, 8