Action recognition via structured codebook construction

被引：11

作者：

Zhou, Wen ^{[1
]}

Wang, Chunheng ^{[1
]}

Xiao, Baihua ^{[1
]}

Zhang, Zhong ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2014年 / 29卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Action recognition; Bag-of-words models; Structured codebook; Sparse coding; Contextual information;

D O I：

10.1016/j.image.2014.01.012

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Bag-of-words models have been widely used to obtain the global representation for action recognition. However, these models ignored the structure information, such as the spatial and temporal contextual information for action representation. In this paper, we propose a novel structured codebook construction method to encode spatial and temporal contextual information among local features for video representation. Given a set of training videos, our method first extracts local motion and appearance features. Next, we encode the spatial and temporal contextual information among local features by constructing correlation matrices for local spatio-temporal features. Then, we discover the common patterns of movements to construct the structured codebook. After that, actions can be represented by a set of sparse coefficients with respect to the structured codebook. Finally, a simple linear SVM classifier is applied to predict the action class based on the action representation. Our method has two main advantages compared to traditional methods. First, our method automatically discovers the mid-level common patterns of movements that capture rich spatial and temporal contextual information. Second, our method is robust to unwanted background local features mainly because most unwanted background local features cannot be sparsely represented by the common patterns and they are treated as residual errors that are not encoded into the action representation. We evaluate the proposed method on two popular benchmarks: KTH action dataset and UCF sports dataset Experimental results demonstrate the advantages of our structured codebook construction. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：546 / 555

页数：10

共 50 条

[21] Identifying the mechanisms underpinning recognition of structured sequences of action
Williams, A. Mark
North, Jamie S.
Hope, Edward R.
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2012, 65 (10): : 1975 - 1992
[22] Identifying the mechanisms underpinning recognition of structured sequences of action
North, Jamie S.
Hope, Ed R.
Williams, A. M.
JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2012, 34 : S115 - S115
[23] Action recognition of construction workers under occlusion
Li, Ziqi
Li, Dongsheng
JOURNAL OF BUILDING ENGINEERING, 2022, 45
[24] Object Localisation via Action Recognition
Darby, John
Li, Baihua
Cunningham, Ryan
Costen, Nicholas
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 817 - 820
[25] A Novel Scheme for the Construction of the SCMA Codebook
Lei, Tuofeng
Ni, Shuyan
Cheng, Naiping
Chen, Shimiao
Song, Xin
IEEE ACCESS, 2022, 10 : 100987 - 100998
[26] Background modeling and subtraction by codebook construction
Kim, K
Chalidabhongse, TH
Harwood, D
Davis, L
ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 3061 - 3064
[27] Structured query construction via knowledge graph embedding
Wang, Ruijie
Wang, Meng
Liu, Jun
Cochez, Michael
Decker, Stefan
KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (05) : 1819 - 1846
[28] Structured query construction via knowledge graph embedding
Ruijie Wang
Meng Wang
Jun Liu
Michael Cochez
Stefan Decker
Knowledge and Information Systems, 2020, 62 : 1819 - 1846
[29] Structured Time Series Analysis for Human Action Segmentation and Recognition
Gong, Dian
Medioni, Gerard
Zhao, Xuemei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) : 1414 - 1427
[30] Structured Fisher Vector encoding method for Human Action Recognition
Sekma, Manel
Mejdoub, Mahmoud
Ben Amar, Chokri
2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 642 - 647

← 1 2 3 4 5 →