Extracting hierarchical spatial and temporal features for human action recognition

被引:10
|
作者
Zhang, Keting [1 ]
Zhang, Liqing [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Key Lab Shanghai Educ Commiss Intelligent Interac, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Hierarchical feature extraction; Dual-channel model; Subspace network; Spatial and temporal representation; Action recognition; PARALLEL FRAMEWORK; HEVC;
D O I
10.1007/s11042-017-5179-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition is a challenging computer vision task and many efforts have been made to improve the performance. Most previous work has concentrated on the hand-crafted features or spatial-temporal features learned from multiple contiguous frames. In this paper, we present a dual-channel model to decouple the spatial and temporal feature extraction. More specifically, we propose to capture the complementary static form information from single frame and dynamic motion information from multi-frame differences in two separate channels. In both channels we use two stacked classical subspace networks to learn hierarchical representations, which are subsequently fused for action recognition. Our model is trained and evaluated on three typical benchmarks: KTH, UCF and Hollywood2 datasets. The experimental results illustrate that our approach achieves comparable performances to the state-of-the-art methods. In addition, both feature analysis and control experiments are also carried out to demonstrate the effectiveness of the proposed approach for feature extraction and thereby action recognition.
引用
收藏
页码:16053 / 16068
页数:16
相关论文
共 50 条
  • [41] Human Action Recognition in Video by Fusion of Structural and Spatio-temporal Features
    Borzeshi, Ehsan Zare
    Concha, Oscar Perez
    Piccardi, Massimo
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 474 - 482
  • [42] Human Action Recognition Using Temporal Hierarchical Pyramid of Depth Motion Map and KECA
    El Madany, Nour El Din
    He, Yifeng
    Guan, Ling
    2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [43] Hierarchical Temporal and Spatial Memory for Gait Pattern Recognition
    Shen, Jianghao
    Loew, Murray
    2016 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2016,
  • [44] Sparse Coding on Local Spatial-Temporal Volumes for Human Action Recognition
    Zhu, Yan
    Zhao, Xu
    Fu, Yun
    Liu, Yuncai
    COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 660 - +
  • [45] Human action recognition using an image-based temporal and spatial representation
    Silva, Vinicius
    Soares, Filomena
    Esteves, Joao Sena
    Vercelli, Gianni
    2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 41 - 46
  • [46] Action Recognition Using Mined Hierarchical Compound Features
    Gilbert, Andrew
    Illingworth, John
    Bowden, Richard
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 883 - 897
  • [47] Extracting Discriminative Parts with Flexible Number from Low-Rank Features for Human Action Recognition
    Shijian Huang
    Junyong Ye
    Tongqing Wang
    Li Jiang
    Yang Li
    Xuegang Wu
    Arabian Journal for Science and Engineering, 2016, 41 : 2987 - 3001
  • [48] Extracting Discriminative Parts with Flexible Number from Low-Rank Features for Human Action Recognition
    Huang, Shijian
    Ye, Junyong
    Wang, Tongqing
    Jiang, Li
    Li, Yang
    Wu, Xuegang
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2016, 41 (08) : 2987 - 3001
  • [49] A Hierarchical Learning Approach for Human Action Recognition
    Lemieux, Nicolas
    Noumeir, Rita
    SENSORS, 2020, 20 (17) : 1 - 16
  • [50] A novel hierarchical framework for human action recognition
    Chen, Hongzhao
    Wang, Guijin
    Xue, Jing-Hao
    He, Li
    PATTERN RECOGNITION, 2016, 55 : 148 - 159