Deep 3D Flow Features for Human Action Recognition

被引:0
|
作者
Psaltis, Athanasios [1 ]
Papadopoulos, Georgios Th [1 ]
Daras, Petros [1 ]
机构
[1] Ctr Res & Technol, Iraklion, Greece
基金
欧盟地平线“2020”;
关键词
Action recognition; 3D flow; Deep Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The present work investigates the use of 3D flow information for performing Deep Learning (DL)-based human action recognition. Generally, 3D flow fields include rich and fine-grained information, regarding the motion dynamics of the observed human actions. However, despite the great potentials present, 3D flow has not been widely used, mainly due to challenges related to the efficient modeling of the flow information and the addressing of the respective computational complexity issues. In this paper, different techniques are investigated for incorporating 3D flow information in DL action recognition schemes. In particular, a novel sequence modeling approach is introduced, which combines the advantageous characteristics for spatial correlation estimation of Convolutional Neural Networks (CNNs) with the increased temporal modeling capabilities of Long Short Term Memory (LSTM) models. Additionally, an extended CNN-based deep flow model is proposed that extracts features from both the spatial and temporal domains, by applying 3D convolutions; hence, modeling the action dynamics within consecutive frames. Moreover, for compact and efficient 3D motion feature extraction, the combined use of CNNs with a `flow colorization' approach is adopted. The proposed methods significantly outperform similar DL and hand-crafted 3D flow approaches, and compare favorably with most skeleton-based techniques in the currently most challenging public dataset, namely the NTU RGB-D.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] 3D GLOH Features for Human Action Recognition
    Abdulmunem, Ashwan
    Lai, Yu-Kun
    Sun, Xianfang
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 805 - 810
  • [2] 3D Features for human action recognition with semi-supervised learning
    Sahoo, Suraj Prakash
    Srinivasu, Ulli
    Ari, Samit
    IET IMAGE PROCESSING, 2019, 13 (06) : 983 - 990
  • [3] View Invariant Human Action Recognition Using 3D Geometric Features
    Zhao, Qingsong
    Sun, Shijie
    Ji, Xiaopeng
    Wang, Lei
    Cheng, Jun
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT IV, 2019, 11743 : 564 - 575
  • [4] Hollywood 3D: What are the Best 3D Features for Action Recognition?
    Simon Hadfield
    Karel Lebeda
    Richard Bowden
    International Journal of Computer Vision, 2017, 121 : 95 - 110
  • [5] Hollywood 3D: What are the Best 3D Features for Action Recognition?
    Hadfield, Simon
    Lebeda, Karel
    Bowden, Richard
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 121 (01) : 95 - 110
  • [6] 3D CNN for Human Action Recognition
    Boualia, Sameh Neili
    Ben Amara, Najoua Essoukri
    2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, : 276 - 282
  • [7] 3D Pooling on Local Space-time Features for Human Action Recognition
    Hadibarhaghtalab, Najme
    Azimifar, Zohreh
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 266 - 269
  • [8] Fusing Spatiotemporal Features and Joints for 3D Action Recognition
    Zhu, Yu
    Chen, Wenbin
    Guo, Guodong
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 486 - 491
  • [9] Recognition of Human Continuous Action with 3D CNN
    Yu, Gang
    Li, Ting
    COMPUTER VISION SYSTEMS, ICVS 2017, 2017, 10528 : 314 - 322
  • [10] HUMAN ACTION RECOGNITION IN 3D MOTION SEQUENCES
    Kelgeorgiadis, Konstantinos
    Nikolaidis, Nikos
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2205 - 2209