Action recognition via structured codebook construction

被引：11

作者：

Zhou, Wen ^{[1
]}

Wang, Chunheng ^{[1
]}

Xiao, Baihua ^{[1
]}

Zhang, Zhong ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2014年 / 29卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Action recognition; Bag-of-words models; Structured codebook; Sparse coding; Contextual information;

D O I：

10.1016/j.image.2014.01.012

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Bag-of-words models have been widely used to obtain the global representation for action recognition. However, these models ignored the structure information, such as the spatial and temporal contextual information for action representation. In this paper, we propose a novel structured codebook construction method to encode spatial and temporal contextual information among local features for video representation. Given a set of training videos, our method first extracts local motion and appearance features. Next, we encode the spatial and temporal contextual information among local features by constructing correlation matrices for local spatio-temporal features. Then, we discover the common patterns of movements to construct the structured codebook. After that, actions can be represented by a set of sparse coefficients with respect to the structured codebook. Finally, a simple linear SVM classifier is applied to predict the action class based on the action representation. Our method has two main advantages compared to traditional methods. First, our method automatically discovers the mid-level common patterns of movements that capture rich spatial and temporal contextual information. Second, our method is robust to unwanted background local features mainly because most unwanted background local features cannot be sparsely represented by the common patterns and they are treated as residual errors that are not encoded into the action representation. We evaluate the proposed method on two popular benchmarks: KTH action dataset and UCF sports dataset Experimental results demonstrate the advantages of our structured codebook construction. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：546 / 555

页数：10

共 50 条

[41] Human Action Recognition via Depth Maps Body Parts of Action
Farooq, Adnan
Farooq, Faisal
Anh Vu Le
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (05): : 2327 - 2347
[42] CODEBOOK ENHANCEMENT OF VLAD REPRESENTATION FOR VISUAL RECOGNITION
Wang, Zhe
Wang, Yali
Wang, Limin
Qiao, Yu
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1258 - 1262
[43] Tree-structured product-codebook vector quantization
Poggi, G
Ragozini, ARP
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2001, 16 (05) : 421 - 430
[44] Feature Similarity and Frequency-Based Weighted Visual Words Codebook Learning Scheme for Human Action Recognition
Nazir, Saima
Yousaf, Muhammad Haroon
Velastin, Sergio A.
IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), 2018, 10749 : 326 - 336
[45] Similarity Graph Convolutional Construction Network for Interactive Action Recognition
Sun, Xiangyu
Liu, Qiong
Yang, You
MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 291 - 303
[46] A Structured Codebook with Various Codeword Configurations for Downlink MIMO Systems
Kwon, Hyunil
Shin, Myeongcheol
Lee, Chungyong
IEICE TRANSACTIONS ON COMMUNICATIONS, 2010, E93B (11) : 3193 - 3196
[47] A Feature Encoding Based on Low Space Complexity Codebook Called Fuzzy Codebook for Image Recognition
Shinomiya, Yuki
Hoshino, Yukinobu
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (01) : 274 - 280
[48] Flexible goal recognition via graph construction and analysis
Yin, MH
Gu, WX
Lu, YH
FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 1118 - 1127
[49] A Feature Encoding Based on Low Space Complexity Codebook Called Fuzzy Codebook for Image Recognition
Yuki Shinomiya
Yukinobu Hoshino
International Journal of Fuzzy Systems, 2019, 21 : 274 - 280
[50] 3-DIMENSIONAL SHAPE CONSTRUCTION AND RECOGNITION BY FUSING INTENSITY AND STRUCTURED LIGHTING
WANG, YF
CHENG, DI
PATTERN RECOGNITION, 1992, 25 (12) : 1411 - 1425

← 1 2 3 4 5 →