Action recognition using kinematics posture feature on 3D skeleton joint locations

被引:48
|
作者
Ahad, Md Atiqur Rahman [1 ,4 ]
Ahmed, Masud [2 ]
Das Antar, Anindya [3 ]
Makihara, Yasushi [1 ]
Yagi, Yasushi [1 ]
机构
[1] Osaka Univ, Suita, Osaka, Japan
[2] Univ Maryland, Baltimore, MD 21201 USA
[3] Univ Michigan, Ann Arbor, MI 48109 USA
[4] Univ Dhaka, Dhaka, Bangladesh
关键词
Action recognition; Skeleton data; Kinematics posture feature (KPF); Position-based statistical feature (PSF); Joint angle; Joint position; Deep neural network; Ensemble architecture; Convrnn; Benchmark datasets; Linear joint position feature (LJPF); Angular joint position feature (AJPF); DEPTH; FRAMEWORK;
D O I
10.1016/j.patrec.2021.02.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition is a very widely explored research area in computer vision and related fields. We propose Kinematics Posture Feature (KPF) extraction from 3D joint positions based on skeleton data for improving the performance of action recognition. In this approach, we consider the skeleton 3D joints as kinematics sensors. We propose Linear Joint Position Feature (LJPF) and Angular Joint Position Feature (AJPF) based on 3D linear joint positions and angles between bone segments. We then combine these two kinematics features for each video frame for each action to create the KPF feature sets. These feature sets encode the variation of motion in the temporal domain as if each body joint represents kinematics position and orientation sensors. In the next stage, we process the extracted KPF feature descriptor by using a low pass filter, and segment them by using sliding windows with optimized length. This concept resembles the approach of processing kinematics sensor data. From the segmented windows, we compute the Position-based Statistical Feature (PSF). These features consist of temporal domain statistical features (e.g., mean, standard deviation, variance, etc.). These statistical features encode the variation of postures (i.e., joint positions and angles) across the video frames. For performing classification, we explore Support Vector Machine (Linear), RNN, CNNRNN, and ConvRNN model. The proposed PSF feature sets demonstrate prominent performance in both statistical machine learning-and deep learning-based models. For evaluation, we explore five benchmark datasets namely UTKinect-Action3D, Kinect Activity Recognition Dataset (KARD), MSR 3D Action Pairs, Florence 3D, and Office Activity Dataset (OAD). To prevent overfitting, we consider the leave-one-subject-out framework as the experimental setup and perform 10-fold cross-validation. Our approach outperforms several existing methods in these benchmark datasets and achieves very promising classification performance. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:216 / 224
页数:9
相关论文
共 50 条
  • [1] ACTION RECOGNITION USING JOINT COORDINATES OF 3D SKELETON DATA
    Batabyal, Tamal
    Chattopadhyay, Tanushyam
    Mukherjee, Dipti Prasad
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4107 - 4111
  • [2] Infrared and 3D Skeleton Feature Fusion for RGB-D Action Recognition
    De Boissiere, Alban Main
    Noumeir, Rita
    IEEE ACCESS, 2020, 8 (08): : 168297 - 168308
  • [3] AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
    Guan, Shannan
    Lu, Haiyan
    Zhu, Linchao
    Fang, Gengfa
    NEUROCOMPUTING, 2022, 514 : 256 - 267
  • [4] Hierarchical topic modeling with pose-transition feature for action recognition using 3D skeleton data
    Thien Huynh-The
    Hua, Cam-Hao
    Nguyen Anh Tu
    Hur, Taeho
    Bang, Jaehun
    Kim, Dohyeong
    Amin, Muhammad Bilal
    Kang, Byeong Ho
    Seung, Hyonwoo
    Shin, Soo Yong
    Kim, Eun-Soo
    Lee, Sungyoung
    INFORMATION SCIENCES, 2018, 444 : 20 - 35
  • [5] 3D ACTION RECOGNITION USING MULTI-TEMPORAL SKELETON VISUALIZATION
    Liu, Mengyuan
    Chen, Chen
    Meng, Fanyang
    Liu, Hong
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [6] A Quad Joint Relational Feature for 3D Skeletal Action Recognition with Circular CNNs
    Kishore, P. V. V.
    Perera, Darshika G.
    Kumar, M. Tej A. Kiran
    Kumar, D. Anil
    Kumar, E. Kiran
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [7] A New Representation of Skeleton Sequences for 3D Action Recognition
    Ke, Qiuhong
    Bennamoun, Mohammed
    An, Senjian
    Sohel, Ferdous
    Boussaid, Farid
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4570 - 4579
  • [8] 3D PostureNet: A unified framework for skeleton-based posture recognition
    Liu, Jianbo
    Wang, Ying
    Liu, Yongcheng
    Xiang, Shiming
    Pan, Chunhong
    PATTERN RECOGNITION LETTERS, 2020, 140 (140) : 143 - 149
  • [9] Image representation of pose-transition feature for 3D skeleton-based action recognition
    Thien Huynh-The
    Hua, Cam-Hao
    Trung-Thanh Ngo
    Kim, Dong-Seong
    INFORMATION SCIENCES, 2020, 513 : 112 - 126
  • [10] 3D Graph Convolutional Feature Selection and Dense Pre-Estimation for Skeleton Action Recognition
    Zhang, Junxian
    Yang, Aiping
    Miao, Changwu
    Li, Xiang
    Zhang, Rui
    Thanh, Dang N. H.
    IEEE ACCESS, 2024, 12 : 11733 - 11742