Skeleton-based human activity recognition using ConvLSTM and guided feature learning

被引:41
|
作者
Yadav, Santosh Kumar [1 ,2 ,3 ]
Tiwari, Kamlesh [4 ]
Pandey, Hari Mohan [5 ]
Akbar, Shaik Ali [1 ,2 ]
机构
[1] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, Uttar Pradesh, India
[2] Cent Elect Engn Res Inst CEERI, CSIR, Pilani 333031, Rajasthan, India
[3] DeepBlink LLC, 30 N Gould St Ste R, Sheridan, WY 82801 USA
[4] Birla Inst Technol & Sci Pilani, Dept CSIS, Pilani Campus, Pilani 333031, Rajasthan, India
[5] Edge Hill Univ, Dept Comp Sci, Ormskirk, Lancs, England
关键词
Activity recognition; CNNs; LSTMs; ConvLTM; Skeleton tracking; FALL DETECTION;
D O I
10.1007/s00500-021-06238-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human activity recognition aims to determine actions performed by a human in an image or video. Examples of human activity include standing, running, sitting, sleeping, etc. These activities may involve intricate motion patterns and undesired events such as falling. This paper proposes a novel deep convolutional long short-term memory (ConvLSTM) network for skeletal-based activity recognition and fall detection. The proposed ConvLSTM network is a sequential fusion of convolutional neural networks (CNNs), long short-term memory (LSTM) networks, and fully connected layers. The acquisition system applies human detection and pose estimation to pre-calculate skeleton coordinates from the image/video sequence. The ConvLSTM model uses the raw skeleton coordinates along with their characteristic geometrical and kinematic features to construct the novel guided features. The geometrical and kinematic features are built upon raw skeleton coordinates using relative joint position values, differences between joints, spherical joint angles between selected joints, and their angular velocities. The novel spatiotemporal-guided features are obtained using a trained multi-player CNN-LSTM combination. Classification head including fully connected layers is subsequently applied. The proposed model has been evaluated on the KinectHAR dataset having 130,000 samples with 81 attribute values, collected with the help of a Kinect (v2) sensor. Experimental results are compared against the performance of isolated CNNs and LSTM networks. Proposed ConvLSTM have achieved an accuracy of 98.89% that is better than CNNs and LSTMs having an accuracy of 93.89 and 92.75%, respectively. The proposed system has been tested in realtime and is found to be independent of the pose, facing of the camera, individuals, clothing, etc. The code and dataset will be made publicly available.
引用
收藏
页码:877 / 890
页数:14
相关论文
共 50 条
  • [1] Skeleton-based human activity recognition using ConvLSTM and guided feature learning
    Santosh Kumar Yadav
    Kamlesh Tiwari
    Hari Mohan Pandey
    Shaik Ali Akbar
    Soft Computing, 2022, 26 : 877 - 890
  • [2] Feature difference and feature correlation learning mechanism for skeleton-based action recognition
    Qing, Ruxin
    Jiang, Min
    Kong, Jun
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [3] Adaptive Feature Selection With Reinforcement Learning for Skeleton-Based Action Recognition
    Xu, Zheyuan
    Wang, Yingfu
    Jiang, Jiaqin
    Yao, Jian
    Li, Liang
    IEEE ACCESS, 2020, 8 : 213038 - 213051
  • [4] Joint Selection using Deep Reinforcement Learning for Skeleton-based Activity Recognition
    Nikpour, Bahareh
    Armanfard, Narges
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1056 - 1061
  • [5] Robust Multi-Feature Learning for Skeleton-Based Action Recognition
    Wang, Yingfu
    Xu, Zheyuan
    Li, Li
    Yao, Jian
    IEEE ACCESS, 2019, 7 : 148658 - 148671
  • [6] InfoGCN: Representation Learning for Human Skeleton-based Action Recognition
    Chi, Hyung-gun
    Ha, Myoung Hoon
    Chi, Seunggeun
    Lee, Sang Wan
    Huang, Qixing
    Ramani, Karthik
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20154 - 20164
  • [7] Skeleton-based human activity recognition for elderly monitoring systems
    Hbali, Youssef
    Hbali, Sara
    Ballihi, Lahoucine
    Sadgal, Mohammed
    IET COMPUTER VISION, 2018, 12 (01) : 16 - 26
  • [8] Skeleton-based STIP feature and discriminant sparse coding for human action recognition
    Ushapreethi, P.
    Priya, Lakshmi G. G.
    INTERNATIONAL JOURNAL OF INTELLIGENT UNMANNED SYSTEMS, 2021, 9 (01) : 43 - 61
  • [9] RBF Models with Shallow and Deep Feature for Skeleton-based Human Gesture Recognition
    Dai-Hai Nguyen
    Quoc-Thang Pham
    Duc-Dung Nguyen
    2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 72 - 77
  • [10] Study on the edge computing method for skeleton-based human action feature recognition
    You W.
    Wang X.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2020, 41 (10): : 156 - 164