Multi-view gait recognition system using spatio-temporal features and deep learning

被引:27
|
作者
Gul, Saba [1 ]
Malik, Muhammad Imran [1 ]
Khan, Gul Muhammad [2 ]
Shafait, Faisal [1 ]
机构
[1] Natl Univ Sci & Technol, Sch Elect Engn & Comp Sci, Islamabad, Pakistan
[2] Univ Engn & Technol, Natl Ctr AI, Peshawar, Pakistan
关键词
3D convolutional deep neural network (3D; CNN); Gait bio-metric; Gait energy image; Person identification; Optimization;
D O I
10.1016/j.eswa.2021.115057
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Systems based on physiological biometrics are ubiquitous but requires subject cooperation or high resolution to capture. Gait recognition is a great avenue for identification and authentication due to uniqueness of individual stride in an un-intrusive manner. Machine vision systems have been designed to capture the uniqueness of stride of a specific person but factors such as change in speed of stride, view point, clothes and carrying accessories make gait recognition challenging and open to innovation. Our proposed approach attempts to tackle these problems by capturing the spatio-temporal features of a gait sequence by training a 3D convolutional deep neural network (3D CNN). The proposed 3D CNN architecture tackles gait identification by employing holistic approach in the form of gait energy images (GEI) which is a condensed representation capturing the shape and motion characteristics of the the human gait. The network was evaluated on two of the largest publicly available datasets with substantial gender and age diversity; OULP and CASIA-B. Optimization strategies were explored to tune the hyper-parmeters and improve the performance of the 3D CNN network. The optimized 3D CNN and the GEI were effectively able to capture the unique characteristics of the gait cycle of an individual irrespective of the challenging covariates. State of the art results achieved on the multi-views and multiple carrying conditions of the subjects belonging to CASIA-B dataset demonstrating the efficacy of our proposed algorithm.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Multi-View Gait Recognition With Joint Local Multi-Scale and Global Contextual Spatio-Temporal Features
    Zhai, Wenzhe
    Li, Haomiao
    Zheng, Chaoqun
    Xing, Xianglei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1123 - 1135
  • [2] Deep Multi-view Spatio-Temporal Network for Urban Crime Prediction
    Salama, Usama
    Chen, Xiaocong
    Yao, Lina
    Paik, Hye-Young
    Wang, Xianzhi
    DATABASES THEORY AND APPLICATIONS (ADC 2021), 2021, 12610 : 50 - 61
  • [3] Gait Recognition using Spatio-temporal Silhouette-based Features
    Sabir, Azhin
    Al-jawad, Naseer
    Jassim, Sabah
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2013, 2013, 8755
  • [4] Spatio-temporal classification at multiple resolutions using multi-view regularization
    Nayak, Guruprasad
    Ghosh, Rahul
    Jia, Xiaowei
    Mithal, Varun
    Kumar, Vipin
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4117 - 4120
  • [5] Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks
    Long, Xiaoxiao
    Liu, Lingjie
    Li, Wei
    Theobalt, Christian
    Wang, Wenping
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8254 - 8263
  • [6] Dense and Accurate Spatio-temporal Multi-view Stereovision
    Courchay, Jerome
    Pons, Jean-Philippe
    Monasse, Pascal
    Keriven, Renaud
    COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 11 - +
  • [7] Hierarchical Spatio-Temporal Representation Learning for Gait Recognition
    Wang, Lei
    Liu, Bo
    Liang, Fangfang
    Wang, Bincheng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19582 - 19592
  • [8] Attention-aware spatio-temporal learning for multi-view gait-based age estimation and gender classification
    Huang, Binyuan
    Luo, Yongdong
    Xie, Jiahui
    Pan, Jiahui
    Zhou, Chengju
    IET COMPUTER VISION, 2022,
  • [9] Deep spatio-temporal features for multimodal emotion recognition
    Nguyen, Dung
    Nguyen, Kien
    Sridharan, Sridha
    Ghasemi, Afsane
    Dean, David
    Fookes, Clinton
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 1215 - 1223
  • [10] SPATIO-TEMPORAL MULTI-VIEW SYNTHESIS FOR FREE VIEWPOINT TELEVISION
    Kumar, Katta Phani
    Gupta, Sumana
    Venkatesh, K. S.
    2013 3DTV-CONFERENCE: THE TRUE VISION-CAPTURE, TRANSMISSION AND DISPALY OF 3D VIDEO (3DTV-CON), 2013,