Action recognition using kinematics posture feature on 3D skeleton joint locations

被引:48
|
作者
Ahad, Md Atiqur Rahman [1 ,4 ]
Ahmed, Masud [2 ]
Das Antar, Anindya [3 ]
Makihara, Yasushi [1 ]
Yagi, Yasushi [1 ]
机构
[1] Osaka Univ, Suita, Osaka, Japan
[2] Univ Maryland, Baltimore, MD 21201 USA
[3] Univ Michigan, Ann Arbor, MI 48109 USA
[4] Univ Dhaka, Dhaka, Bangladesh
关键词
Action recognition; Skeleton data; Kinematics posture feature (KPF); Position-based statistical feature (PSF); Joint angle; Joint position; Deep neural network; Ensemble architecture; Convrnn; Benchmark datasets; Linear joint position feature (LJPF); Angular joint position feature (AJPF); DEPTH; FRAMEWORK;
D O I
10.1016/j.patrec.2021.02.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition is a very widely explored research area in computer vision and related fields. We propose Kinematics Posture Feature (KPF) extraction from 3D joint positions based on skeleton data for improving the performance of action recognition. In this approach, we consider the skeleton 3D joints as kinematics sensors. We propose Linear Joint Position Feature (LJPF) and Angular Joint Position Feature (AJPF) based on 3D linear joint positions and angles between bone segments. We then combine these two kinematics features for each video frame for each action to create the KPF feature sets. These feature sets encode the variation of motion in the temporal domain as if each body joint represents kinematics position and orientation sensors. In the next stage, we process the extracted KPF feature descriptor by using a low pass filter, and segment them by using sliding windows with optimized length. This concept resembles the approach of processing kinematics sensor data. From the segmented windows, we compute the Position-based Statistical Feature (PSF). These features consist of temporal domain statistical features (e.g., mean, standard deviation, variance, etc.). These statistical features encode the variation of postures (i.e., joint positions and angles) across the video frames. For performing classification, we explore Support Vector Machine (Linear), RNN, CNNRNN, and ConvRNN model. The proposed PSF feature sets demonstrate prominent performance in both statistical machine learning-and deep learning-based models. For evaluation, we explore five benchmark datasets namely UTKinect-Action3D, Kinect Activity Recognition Dataset (KARD), MSR 3D Action Pairs, Florence 3D, and Office Activity Dataset (OAD). To prevent overfitting, we consider the leave-one-subject-out framework as the experimental setup and perform 10-fold cross-validation. Our approach outperforms several existing methods in these benchmark datasets and achieves very promising classification performance. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:216 / 224
页数:9
相关论文
共 50 条
  • [11] A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
    Ren, Bin
    Liu, Mengyuan
    Ding, Runwei
    Liu, Hong
    CYBORG AND BIONIC SYSTEMS, 2024, 5
  • [12] TagSleep3D: RF-based 3D Sleep Posture Skeleton Recognition
    Liu, Chen
    Dong, Zixuan
    Huang, Li
    Yan, Wenlong
    Wang, Xin
    Fang, Dingyi
    Chen, Xiaojiang
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2024, 8 (01):
  • [13] SkeleMotion: A New Representation of Skeleton Joint Sequences Based on Motion Information for 3D Action Recognition
    Caetano, Carlos
    Sena, Jessica
    Bremond, Francois
    dos Santos, Jefersson A.
    Schwartz, William Robson
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [14] Kinematics Features for 3D Action Recognition Using Two-Stream CNN
    Wang, Jiangliu
    Liu, Yunhui
    2018 13TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2018, : 1731 - 1736
  • [15] Action Recognition Based on 3D Skeleton and RGB Frame Fusion
    Liu, Guiyu
    Qian, Jiuchao
    Wen, Fei
    Zhu, Xiaoguang
    Ying, Rendong
    Liu, Peilin
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 258 - 264
  • [16] Human Action Recognition Based on Quaternion 3D Skeleton Representation
    Xu Haiyang
    Kong Jun
    Jiang Min
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (02)
  • [17] Modeling the skeleton-language uncertainty for 3D action recognition
    Wang, Mingdao
    Zhang, Xianlin
    Chen, Siqi
    Li, Xueming
    Zhang, Yue
    NEUROCOMPUTING, 2024, 608
  • [18] Real-Time Arm Gesture Recognition Using 3D Skeleton Joint Data
    Paraskevopoulos, Georgios
    Spyrou, Evaggelos
    Sgouropoulos, Dimitrios
    Giannakopoulos, Theodoros
    Mylonas, Phivos
    ALGORITHMS, 2019, 12 (05)
  • [19] A Multi-Feature Scheme For Posture Recognition With 3D TOF Sensor
    Leone, Alessandro
    Diraco, Giovanni
    Siciliano, Pietro
    2012 IEEE SENSORS PROCEEDINGS, 2012, : 204 - 207
  • [20] Deep Learning-Based Action Recognition Using 3D Skeleton Joints Information
    Tasnim, Nusrat
    Islam, Md. Mahbubul
    Baek, Joong-Hwan
    INVENTIONS, 2020, 5 (03) : 1 - 15