Action recognition using kinematics posture feature on 3D skeleton joint locations

被引:48
|
作者
Ahad, Md Atiqur Rahman [1 ,4 ]
Ahmed, Masud [2 ]
Das Antar, Anindya [3 ]
Makihara, Yasushi [1 ]
Yagi, Yasushi [1 ]
机构
[1] Osaka Univ, Suita, Osaka, Japan
[2] Univ Maryland, Baltimore, MD 21201 USA
[3] Univ Michigan, Ann Arbor, MI 48109 USA
[4] Univ Dhaka, Dhaka, Bangladesh
关键词
Action recognition; Skeleton data; Kinematics posture feature (KPF); Position-based statistical feature (PSF); Joint angle; Joint position; Deep neural network; Ensemble architecture; Convrnn; Benchmark datasets; Linear joint position feature (LJPF); Angular joint position feature (AJPF); DEPTH; FRAMEWORK;
D O I
10.1016/j.patrec.2021.02.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition is a very widely explored research area in computer vision and related fields. We propose Kinematics Posture Feature (KPF) extraction from 3D joint positions based on skeleton data for improving the performance of action recognition. In this approach, we consider the skeleton 3D joints as kinematics sensors. We propose Linear Joint Position Feature (LJPF) and Angular Joint Position Feature (AJPF) based on 3D linear joint positions and angles between bone segments. We then combine these two kinematics features for each video frame for each action to create the KPF feature sets. These feature sets encode the variation of motion in the temporal domain as if each body joint represents kinematics position and orientation sensors. In the next stage, we process the extracted KPF feature descriptor by using a low pass filter, and segment them by using sliding windows with optimized length. This concept resembles the approach of processing kinematics sensor data. From the segmented windows, we compute the Position-based Statistical Feature (PSF). These features consist of temporal domain statistical features (e.g., mean, standard deviation, variance, etc.). These statistical features encode the variation of postures (i.e., joint positions and angles) across the video frames. For performing classification, we explore Support Vector Machine (Linear), RNN, CNNRNN, and ConvRNN model. The proposed PSF feature sets demonstrate prominent performance in both statistical machine learning-and deep learning-based models. For evaluation, we explore five benchmark datasets namely UTKinect-Action3D, Kinect Activity Recognition Dataset (KARD), MSR 3D Action Pairs, Florence 3D, and Office Activity Dataset (OAD). To prevent overfitting, we consider the leave-one-subject-out framework as the experimental setup and perform 10-fold cross-validation. Our approach outperforms several existing methods in these benchmark datasets and achieves very promising classification performance. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:216 / 224
页数:9
相关论文
共 50 条
  • [31] Deep learning-based action recognition with 3D skeleton: A survey
    Xing, Yuling
    Zhu, Jia
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2021, 6 (01) : 80 - 92
  • [32] A generically Contrastive Spatiotemporal Representation Enhancement for 3D skeleton action recognition
    Zhang, Shaojie
    Yin, Jianqin
    Dang, Yonghao
    PATTERN RECOGNITION, 2025, 164
  • [33] 3D skeleton-based action recognition with convolutional neural networks
    Van-Nam Hoang
    Thi-Lan Le
    Thanh-Hai Tran
    Hai-Vu
    Van-Toi Nguyen
    2019 INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2019,
  • [34] Learning Clip Representations for Skeleton-Based 3D Action Recognition
    Ke, Qiuhong
    Bennamoun, Mohammed
    An, Senjian
    Sohel, Ferdous
    Boussaid, Farid
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) : 2842 - 2855
  • [35] Action Recognition and Fall Detection System Based on 3D Skeleton Model
    Minh-Tri Tran
    Anh-Khoa Hoang
    Ha Hoang
    PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 86 - 93
  • [36] Skeleton-Based Action Recognition with Joint Coordinates as Feature Using Neural Oblivious Decision Ensembles
    Nasrul'Alam, Fakhrul Aniq Hakimi
    Shapiai, Mohd Ibrahim
    Batool, Uzma
    Ramli, Ahmad Kamal
    Elias, Khairil Ashraf
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2021, 337 : 380 - 392
  • [37] Action Recognition from 3D Skeleton Sequences using Deep Networks on Lie Group Features
    Rhif, Manel
    Wannous, Hazem
    Farah, Imed Riadh
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3427 - 3432
  • [38] Local Surface Geometric Feature for 3D human action recognition
    Zhang, Erhu
    Chen, Wanjun
    Zhang, Zhuomin
    Zhang, Yan
    NEUROCOMPUTING, 2016, 208 : 281 - 289
  • [39] Geometric Deep Learning on Skeleton Sequences for 2D/3D Action Recognition
    Friji, Rasha
    Drira, Hassen
    Chaieb, Faten
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 196 - 204
  • [40] The Issues of 3D Hand Gesture and Posture Recognition Using the Kinect
    Boulabiar, Mohamed-Ikbel
    Coppin, Gilles
    Poirier, Franck
    HUMAN-COMPUTER INTERACTION: ADVANCED INTERACTION MODALITIES AND TECHNIQUES, PT II, 2014, 8511 : 205 - 214