Action recognition via structured codebook construction

被引:11
|
作者
Zhou, Wen [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
Zhang, Zhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Bag-of-words models; Structured codebook; Sparse coding; Contextual information;
D O I
10.1016/j.image.2014.01.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Bag-of-words models have been widely used to obtain the global representation for action recognition. However, these models ignored the structure information, such as the spatial and temporal contextual information for action representation. In this paper, we propose a novel structured codebook construction method to encode spatial and temporal contextual information among local features for video representation. Given a set of training videos, our method first extracts local motion and appearance features. Next, we encode the spatial and temporal contextual information among local features by constructing correlation matrices for local spatio-temporal features. Then, we discover the common patterns of movements to construct the structured codebook. After that, actions can be represented by a set of sparse coefficients with respect to the structured codebook. Finally, a simple linear SVM classifier is applied to predict the action class based on the action representation. Our method has two main advantages compared to traditional methods. First, our method automatically discovers the mid-level common patterns of movements that capture rich spatial and temporal contextual information. Second, our method is robust to unwanted background local features mainly because most unwanted background local features cannot be sparsely represented by the common patterns and they are treated as residual errors that are not encoded into the action representation. We evaluate the proposed method on two popular benchmarks: KTH action dataset and UCF sports dataset Experimental results demonstrate the advantages of our structured codebook construction. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:546 / 555
页数:10
相关论文
共 50 条
  • [21] Identifying the mechanisms underpinning recognition of structured sequences of action
    Williams, A. Mark
    North, Jamie S.
    Hope, Edward R.
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2012, 65 (10): : 1975 - 1992
  • [22] Identifying the mechanisms underpinning recognition of structured sequences of action
    North, Jamie S.
    Hope, Ed R.
    Williams, A. M.
    JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2012, 34 : S115 - S115
  • [23] Action recognition of construction workers under occlusion
    Li, Ziqi
    Li, Dongsheng
    JOURNAL OF BUILDING ENGINEERING, 2022, 45
  • [24] Object Localisation via Action Recognition
    Darby, John
    Li, Baihua
    Cunningham, Ryan
    Costen, Nicholas
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 817 - 820
  • [25] A Novel Scheme for the Construction of the SCMA Codebook
    Lei, Tuofeng
    Ni, Shuyan
    Cheng, Naiping
    Chen, Shimiao
    Song, Xin
    IEEE ACCESS, 2022, 10 : 100987 - 100998
  • [26] Background modeling and subtraction by codebook construction
    Kim, K
    Chalidabhongse, TH
    Harwood, D
    Davis, L
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 3061 - 3064
  • [27] Structured query construction via knowledge graph embedding
    Wang, Ruijie
    Wang, Meng
    Liu, Jun
    Cochez, Michael
    Decker, Stefan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (05) : 1819 - 1846
  • [28] Structured query construction via knowledge graph embedding
    Ruijie Wang
    Meng Wang
    Jun Liu
    Michael Cochez
    Stefan Decker
    Knowledge and Information Systems, 2020, 62 : 1819 - 1846
  • [29] Structured Time Series Analysis for Human Action Segmentation and Recognition
    Gong, Dian
    Medioni, Gerard
    Zhao, Xuemei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) : 1414 - 1427
  • [30] Structured Fisher Vector encoding method for Human Action Recognition
    Sekma, Manel
    Mejdoub, Mahmoud
    Ben Amar, Chokri
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 642 - 647