Skeleton-based human activity recognition using ConvLSTM and guided feature learning

被引:41
|
作者
Yadav, Santosh Kumar [1 ,2 ,3 ]
Tiwari, Kamlesh [4 ]
Pandey, Hari Mohan [5 ]
Akbar, Shaik Ali [1 ,2 ]
机构
[1] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, Uttar Pradesh, India
[2] Cent Elect Engn Res Inst CEERI, CSIR, Pilani 333031, Rajasthan, India
[3] DeepBlink LLC, 30 N Gould St Ste R, Sheridan, WY 82801 USA
[4] Birla Inst Technol & Sci Pilani, Dept CSIS, Pilani Campus, Pilani 333031, Rajasthan, India
[5] Edge Hill Univ, Dept Comp Sci, Ormskirk, Lancs, England
关键词
Activity recognition; CNNs; LSTMs; ConvLTM; Skeleton tracking; FALL DETECTION;
D O I
10.1007/s00500-021-06238-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human activity recognition aims to determine actions performed by a human in an image or video. Examples of human activity include standing, running, sitting, sleeping, etc. These activities may involve intricate motion patterns and undesired events such as falling. This paper proposes a novel deep convolutional long short-term memory (ConvLSTM) network for skeletal-based activity recognition and fall detection. The proposed ConvLSTM network is a sequential fusion of convolutional neural networks (CNNs), long short-term memory (LSTM) networks, and fully connected layers. The acquisition system applies human detection and pose estimation to pre-calculate skeleton coordinates from the image/video sequence. The ConvLSTM model uses the raw skeleton coordinates along with their characteristic geometrical and kinematic features to construct the novel guided features. The geometrical and kinematic features are built upon raw skeleton coordinates using relative joint position values, differences between joints, spherical joint angles between selected joints, and their angular velocities. The novel spatiotemporal-guided features are obtained using a trained multi-player CNN-LSTM combination. Classification head including fully connected layers is subsequently applied. The proposed model has been evaluated on the KinectHAR dataset having 130,000 samples with 81 attribute values, collected with the help of a Kinect (v2) sensor. Experimental results are compared against the performance of isolated CNNs and LSTM networks. Proposed ConvLSTM have achieved an accuracy of 98.89% that is better than CNNs and LSTMs having an accuracy of 93.89 and 92.75%, respectively. The proposed system has been tested in realtime and is found to be independent of the pose, facing of the camera, individuals, clothing, etc. The code and dataset will be made publicly available.
引用
收藏
页码:877 / 890
页数:14
相关论文
共 50 条
  • [31] HAND GESTURE RECOGNITION USING A SKELETON-BASED FEATURE REPRESENTATION WITH A RANDOM REGRESSION FOREST
    Canavan, Shaun
    Keyes, Walter
    Mccormick, Ryan
    Kunnumpurath, Julie
    Hoelzel, Tanner
    Yin, Lijun
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2364 - 2368
  • [32] Skeleton-Based Data Augmentation for Sign Language Recognition Using Adversarial Learning
    Nakamura, Yuriya
    Jing, Lei
    IEEE ACCESS, 2025, 13 : 15290 - 15300
  • [33] Adversarial Attack on Skeleton-Based Human Action Recognition
    Liu, Jian
    Akhtar, Naveed
    Mian, Ajmal
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (04) : 1609 - 1622
  • [34] Research Progress in Skeleton-Based Human Action Recognition
    Liu B.
    Zhou S.
    Dong J.
    Xie M.
    Zhou S.
    Zheng T.
    Zhang S.
    Ye X.
    Wang X.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (09): : 1299 - 1322
  • [35] Profile HMMs for skeleton-based human action recognition
    Ding, Wenwen
    Liu, Kai
    Fu, Xujia
    Cheng, Fei
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 42 : 109 - 119
  • [36] Multisource learning for skeleton-based action recognition using deep LSTM and CNN
    Cui, Ran
    Zhu, Aichun
    Hua, Gang
    Yin, Hongsheng
    Liu, Haiqiang
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (04)
  • [37] A Survey on Skeleton-Based Activity Recognition using Graph Convolutional Networks (GCN)
    Manuel, Mesafint
    Yuan, Xiaohong
    Kim, Hyung Nam
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021), 2021, : 177 - 182
  • [38] Improved skeleton-based activity recognition using convolutional block attention module
    Qin, Jing
    Zhang, Shugang
    Wang, Yiguo
    Yang, Fei
    Zhong, Xin
    Lu, Weigang
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
  • [39] Skeleton-Based Human Pose Recognition Using Channel State Information: A Survey
    Wang, Zhengjie
    Ma, Mingjing
    Feng, Xiaoxue
    Li, Xue
    Liu, Fei
    Guo, Yinjing
    Chen, Da
    SENSORS, 2022, 22 (22)
  • [40] Feature reconstruction graph convolutional network for skeleton-based action recognition
    Huang, Junhao
    Wang, Ziming
    Peng, Jian
    Huang, Feihu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126