Accurate 3D action recognition using learning on the Grassmann manifold

被引:149
|
作者
Slama, Rim [1 ,2 ]
Wannous, Hazem [1 ,2 ]
Daoudi, Mohamed [2 ,3 ]
Srivastava, Anuj [4 ]
机构
[1] Univ Lille 1, F-59655 Villeneuve Dascq, France
[2] CNRS, UMR 8022, LIFL Lab, Villeneuve Dascq, France
[3] Inst Mines Telecom Telecom Lille, Villeneuve Dascq, France
[4] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
基金
美国国家科学基金会;
关键词
Human action recognition; Grassmann manifold; Observational latency; Depth images; Skeleton; Classification; SPARSE REPRESENTATION; VIDEO; ALGORITHMS;
D O I
10.1016/j.patcog.2014.08.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we address the problem of modeling and analyzing human motion by focusing on 3D body skeletons. Particularly, our intent is to represent skeletal motion in a geometric and efficient way, leading to an accurate action-recognition system. Here an action is represented by a dynamical system whose observability matrix is characterized as an element of a Grassmann manifold. To formulate our learning algorithm, we propose two distinct ideas: (1) in the first one we perform classification using a Truncated Wrapped Gaussian model, one for each class in its own tangent space. (2) In the second one we propose a novel learning algorithm that uses a vector representation formed by concatenating local coordinates in tangent spaces associated with different classes and training a linear SVM. We evaluate our approaches on three public 3D action datasets: MSR-action 3D, UT-kinect and UCF-kinect datasets; these datasets represent different kinds of challenges and together help provide an exhaustive evaluation. The results show that our approaches either match or exceed state-of-the-art performance reaching 91.21% on MSR-action 3D, 97.91% on UCF-kinect, and 88.5% on UT-kinect. Finally, we evaluate the latency, i.e. the ability to recognize an action before its termination, of our approach and demonstrate improvements relative to other published approaches. (C)2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:556 / 567
页数:12
相关论文
共 50 条
  • [41] Tensor discriminant analysis on grassmann manifold with application to video based human action recognition
    Ozdemir, Cagri
    Hoover, Randy C.
    Caudle, Kyle
    Braman, Karen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (08) : 3353 - 3365
  • [42] Human action recognition from silhouettes using manifold learning and MDA
    Liu, Fa-wang
    Deng, Hong-bin
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2011, 12 (1-2) : 36 - 41
  • [43] Image-based 3D model retrieval using manifold learning
    Mu, Pan-pan
    Zhang, San-yuan
    Zhang, Yin
    Ye, Xiu-zi
    Pan, Xiang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (11) : 1397 - 1408
  • [44] Image-based 3D model retrieval using manifold learning
    Pan-pan MU
    San-yuan ZHANG
    Yin ZHANG
    Xiu-zi YE
    Xiang PAN
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 (11) : 1397 - 1408
  • [45] 3D Pancreas MRI Segmentation Using Dictionary Learning and Manifold Clustering
    Gou, S.
    Rapacchi, S.
    Hu, P.
    Sheng, K.
    MEDICAL PHYSICS, 2014, 41 (06) : 472 - 472
  • [46] Image-based 3D model retrieval using manifold learning
    Pan-pan Mu
    San-yuan Zhang
    Yin Zhang
    Xiu-zi Ye
    Xiang Pan
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 1397 - 1408
  • [47] 3D Surface parameterization using manifold learning for medial shape representation
    Ward, Aaron D.
    Hamarneh, Ghassan
    MEDICAL IMAGING 2007: IMAGE PROCESSING, PTS 1-3, 2007, 6512
  • [48] 3D Deep Heterogeneous Manifold Network for Behavior Recognition
    Chen, Jinghong
    Zhang, Li
    Jin, Zhihao
    Zhao, Chong
    Wang, Qicong
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [49] A Geometric ConvNet on 3D Shape Manifold for Gait Recognition
    Hosni, Nadia
    Ben Amor, Boulbaba
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3716 - 3725
  • [50] Accurate Facial Parts Localization and Deep Learning for 3D Facial Expression Recognition
    Jan, Asim
    Ding, Huaxiong
    Meng, Hongying
    Chen, Liming
    Li, Huibin
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 466 - 472