A bag of words approach to subject specific 3D human pose interaction classification with random decision forests

被引:8
|
作者
Deng, Jingjing [1 ]
Xie, Xianghua [1 ]
Daubney, Ben [1 ]
机构
[1] Swansea Univ, Dept Comp Sci, Swansea, W Glam, Wales
关键词
Human interaction; Action recognition; Human pose; Random forests; Bag of words; MOTION CAPTURE; RECOGNITION;
D O I
10.1016/j.gmod.2013.10.006
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this work, we investigate whether it is possible to distinguish conversational interactions from observing human motion alone, in particular subject specific gestures in 3D. We adopt Kinect sensors to obtain 3D displacement and velocity measurements, followed by wavelet decomposition to extract low level temporal features. These features are then generalized to form a visual vocabulary that can be further generalized to a set of topics from temporal distributions of visual vocabulary. A subject specific supervised learning approach based on Random Forests is used to classify the testing sequences to seven different conversational scenarios. These conversational scenarios concerned in this work have rather subtle differences among them. Unlike typical action or event recognition, each interaction in our case contain many instances of primitive motions and actions, many of which are shared among different conversation scenarios. That is the interactions we are concerned with are not micro or instant events, such as hugging and high-five, but rather interactions over a period of time that consists rather similar individual motions, micro actions and interactions. We believe this is among one of the first work that is devoted to subject specific conversational interaction classification using 3D pose features and to show this task is indeed possible. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:162 / 171
页数:10
相关论文
共 50 条
  • [31] Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach
    Zhou, Xingyi
    Huang, Qixing
    Sun, Xiao
    Xue, Xiangyang
    Wei, Yichen
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 398 - 407
  • [32] Regression-Based 3D Hand Pose Estimation for Human-Robot Interaction
    Bandi, Chaitanya
    Thomas, Ulrike
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2020, 2022, 1474 : 507 - 529
  • [33] Fuzzy Naive Bayesian classification in RoboSoccer 3D: A hybrid approach to decision making
    Bustamante, Carlos
    Garrido, Leonardo
    Soto, Rogelio
    ROBOCUP 2006: ROBOT SOCCER WORLD CUP X, 2007, 4434 : 507 - +
  • [34] A Non-autoregressive Decoding Model Based on Joint Classification for 3D Human Pose Regression
    Guo, Yuhang
    Fu, Dongmei
    Yang, Tao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 434 - 446
  • [35] A dual-source approach for 3D human pose estimation from single images
    Iqbal, Umar
    Doering, Andreas
    Yasin, Hashim
    Kruger, Bjorn
    Weber, Andreas
    Gall, Juergen
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 172 : 37 - 49
  • [36] A Sequential Approach to 3D Human Pose Estimation: Separation of Localization and Identification of Body Joints
    Jung, Ho Yub
    Suh, Yumin
    Moon, Gyeongsik
    Lee, Kyoung Mu
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 747 - 761
  • [37] A fused convolutional spatio-temporal progressive approach for 3D human pose estimation
    Zhang, Hehao
    Hu, Zhengping
    Sun, Zhe
    Zhao, Mengyao
    Bi, Shuai
    Di, Jirui
    VISUAL COMPUTER, 2024, 40 (06): : 4387 - 4399
  • [38] An Improved Approach for 3D Hand Pose Estimation Based on a Single Depth Image and Haar Random Forest
    Kim, Wonggi
    Chun, Junchul
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (08): : 3136 - 3150
  • [39] Relation-aware interaction spatio-temporal network for 3D human pose estimation
    Zhang, Hehao
    Hu, Zhengping
    Bi, Shuai
    Di, Jirui
    Sun, Zhe
    DIGITAL SIGNAL PROCESSING, 2024, 155
  • [40] 3D Hand and Object Pose Estimation for Real-time Human-robot Interaction
    Bandi, Chaitanya
    Kisner, Hannes
    Thomas, Urike
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 770 - 780