Accurate 3D action recognition using learning on the Grassmann manifold

被引:149
|
作者
Slama, Rim [1 ,2 ]
Wannous, Hazem [1 ,2 ]
Daoudi, Mohamed [2 ,3 ]
Srivastava, Anuj [4 ]
机构
[1] Univ Lille 1, F-59655 Villeneuve Dascq, France
[2] CNRS, UMR 8022, LIFL Lab, Villeneuve Dascq, France
[3] Inst Mines Telecom Telecom Lille, Villeneuve Dascq, France
[4] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
基金
美国国家科学基金会;
关键词
Human action recognition; Grassmann manifold; Observational latency; Depth images; Skeleton; Classification; SPARSE REPRESENTATION; VIDEO; ALGORITHMS;
D O I
10.1016/j.patcog.2014.08.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we address the problem of modeling and analyzing human motion by focusing on 3D body skeletons. Particularly, our intent is to represent skeletal motion in a geometric and efficient way, leading to an accurate action-recognition system. Here an action is represented by a dynamical system whose observability matrix is characterized as an element of a Grassmann manifold. To formulate our learning algorithm, we propose two distinct ideas: (1) in the first one we perform classification using a Truncated Wrapped Gaussian model, one for each class in its own tangent space. (2) In the second one we propose a novel learning algorithm that uses a vector representation formed by concatenating local coordinates in tangent spaces associated with different classes and training a linear SVM. We evaluate our approaches on three public 3D action datasets: MSR-action 3D, UT-kinect and UCF-kinect datasets; these datasets represent different kinds of challenges and together help provide an exhaustive evaluation. The results show that our approaches either match or exceed state-of-the-art performance reaching 91.21% on MSR-action 3D, 97.91% on UCF-kinect, and 88.5% on UT-kinect. Finally, we evaluate the latency, i.e. the ability to recognize an action before its termination, of our approach and demonstrate improvements relative to other published approaches. (C)2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:556 / 567
页数:12
相关论文
共 50 条
  • [31] Aggressive action recognition using 3D CNN architectures
    Saveliev, Anton
    Uzdiaev, Mikhail
    Dmitrii, Malov
    12TH INTERNATIONAL CONFERENCE ON THE DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2019), 2019, : 890 - 895
  • [32] 3D Human Action Recognition Using Model Segmentation
    Yoon, Sang Min
    Kuijper, Arjan
    IMAGE ANALYSIS AND RECOGNITION, PT I, PROCEEDINGS, 2010, 6111 : 189 - +
  • [33] Localization and recognition of human action in 3D using transformers
    Jiankai Sun
    Linjiang Huang
    Hongsong Wang
    Chuanyang Zheng
    Jianing Qiu
    Md Tauhidul Islam
    Enze Xie
    Bolei Zhou
    Lei Xing
    Arjun Chandrasekaran
    Michael J. Black
    Communications Engineering, 3 (1):
  • [34] Human Action Recognition Using 3D Reconstruction Data
    Papadopoulos, Georgios Th
    Daras, Petros
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (08) : 1807 - 1823
  • [35] Fast and Accurate 3D Face Recognition
    Spreeuwers, Luuk
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 93 (03) : 389 - 414
  • [36] 3D TRAJECTORIES FOR ACTION RECOGNITION
    Koperski, Michal
    Bilinski, Piotr
    Bremond, Francois
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4176 - 4180
  • [37] Accurate and Real-time Human Action Recognition Based on 3D Skeleton
    Chen, Hongzhao
    Wang, Guijin
    He, Li
    2013 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTOELECTRONIC IMAGING AND PROCESSING TECHNOLOGY, 2013, 9045
  • [38] Automated Conversion of 2D to 3D Image using Manifold Learning
    Chahal, Nidhi
    Pippal, Meghna
    Chaudhury, Santanu
    2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,
  • [39] Projection Metric Learning on Grassmann Manifold with Application to Video based Face Recognition
    Huang, Zhiwu
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 140 - 149
  • [40] Learning hierarchical 3D kernel descriptors for RGB-D action recognition
    Kong, Yu
    Satarboroujeni, Behnam
    Fu, Yun
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 144 : 14 - 23