Multi-view gait recognition system using spatio-temporal features and deep learning

被引:27
|
作者
Gul, Saba [1 ]
Malik, Muhammad Imran [1 ]
Khan, Gul Muhammad [2 ]
Shafait, Faisal [1 ]
机构
[1] Natl Univ Sci & Technol, Sch Elect Engn & Comp Sci, Islamabad, Pakistan
[2] Univ Engn & Technol, Natl Ctr AI, Peshawar, Pakistan
关键词
3D convolutional deep neural network (3D; CNN); Gait bio-metric; Gait energy image; Person identification; Optimization;
D O I
10.1016/j.eswa.2021.115057
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Systems based on physiological biometrics are ubiquitous but requires subject cooperation or high resolution to capture. Gait recognition is a great avenue for identification and authentication due to uniqueness of individual stride in an un-intrusive manner. Machine vision systems have been designed to capture the uniqueness of stride of a specific person but factors such as change in speed of stride, view point, clothes and carrying accessories make gait recognition challenging and open to innovation. Our proposed approach attempts to tackle these problems by capturing the spatio-temporal features of a gait sequence by training a 3D convolutional deep neural network (3D CNN). The proposed 3D CNN architecture tackles gait identification by employing holistic approach in the form of gait energy images (GEI) which is a condensed representation capturing the shape and motion characteristics of the the human gait. The network was evaluated on two of the largest publicly available datasets with substantial gender and age diversity; OULP and CASIA-B. Optimization strategies were explored to tune the hyper-parmeters and improve the performance of the 3D CNN network. The optimized 3D CNN and the GEI were effectively able to capture the unique characteristics of the gait cycle of an individual irrespective of the challenging covariates. State of the art results achieved on the multi-views and multiple carrying conditions of the subjects belonging to CASIA-B dataset demonstrating the efficacy of our proposed algorithm.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation
    Li, Qing
    Qiu, Zhaofan
    Yao, Ting
    Mei, Tao
    Rui, Yong
    Luo, Jiebo
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 159 - 166
  • [32] Multi-View Learning of Acoustic Features for Speaker Recognition
    Livescu, Karen
    Stoehr, Mark
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 82 - +
  • [33] Learning Bag of Spatio-Temporal Features for Human Interaction Recognition
    Slimani, Khadidja Nour El Houda
    Benezeth, Yannick
    Souami, Feryel
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [34] Accelerated Learning of Discriminative Spatio-temporal Features for Action Recognition
    Varshney, Munender
    Rameshan, Renu
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [35] A spatio-temporal multiplexing multi-view display using a lenticular lens and a beam steering screen
    Zhang, Xiangyu
    Song, Weitao
    Wang, Hongjuan
    Zhuang, Zhenfeng
    Surman, Phil
    Sun, Xiao Wei
    Zheng, Yuanjin
    OPTICS COMMUNICATIONS, 2018, 420 : 168 - 173
  • [36] A Kinect Based Sign Language Recognition System Using Spatio-temporal Features
    Memis, Abbas
    Albayrak, Songul
    SIXTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2013), 2013, 9067
  • [37] Deep Spatio-Temporal Mutual Learning for EEG Emotion Recognition
    Ye, Wenqing
    Li, Xinyu
    Zhang, Haokun
    Zhu, Zhuolin
    Li, Dongdong
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [38] Multi-view learning for visual violence recognition with maximum entropy discrimination and deep features
    Sun, Shiliang
    Liu, Yuhan
    Mao, Liang
    INFORMATION FUSION, 2019, 50 : 43 - 53
  • [39] A Multi-view Spatio-Temporal EEG Feature Learning for Cross-Subject Motor Imagery Classification
    Hameed, Adel
    Fourati, Rahma
    Ammar, Boudour
    Sanchez-Medina, Javier
    Ltifi, Hela
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PT II, 2024, 2166 : 393 - 405
  • [40] Fast color correction for multi-view video by modeling spatio-temporal variation
    Shao, Feng
    Jiang, Gang-Yi
    Yu, Mei
    Ho, Yo-Sung
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2010, 21 (5-6) : 392 - 403