Action recognition via structured codebook construction

被引：11

作者：

Zhou, Wen ^{[1
]}

Wang, Chunheng ^{[1
]}

Xiao, Baihua ^{[1
]}

Zhang, Zhong ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2014年 / 29卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Action recognition; Bag-of-words models; Structured codebook; Sparse coding; Contextual information;

D O I：

10.1016/j.image.2014.01.012

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Bag-of-words models have been widely used to obtain the global representation for action recognition. However, these models ignored the structure information, such as the spatial and temporal contextual information for action representation. In this paper, we propose a novel structured codebook construction method to encode spatial and temporal contextual information among local features for video representation. Given a set of training videos, our method first extracts local motion and appearance features. Next, we encode the spatial and temporal contextual information among local features by constructing correlation matrices for local spatio-temporal features. Then, we discover the common patterns of movements to construct the structured codebook. After that, actions can be represented by a set of sparse coefficients with respect to the structured codebook. Finally, a simple linear SVM classifier is applied to predict the action class based on the action representation. Our method has two main advantages compared to traditional methods. First, our method automatically discovers the mid-level common patterns of movements that capture rich spatial and temporal contextual information. Second, our method is robust to unwanted background local features mainly because most unwanted background local features cannot be sparsely represented by the common patterns and they are treated as residual errors that are not encoded into the action representation. We evaluate the proposed method on two popular benchmarks: KTH action dataset and UCF sports dataset Experimental results demonstrate the advantages of our structured codebook construction. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：546 / 555

页数：10

共 50 条

[31] Zero-Shot Recognition via Structured Prediction
Zhang, Ziming
Saligrama, Venkatesh
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 533 - 548
[32] Robust and Practical Face Recognition via Structured Sparsity
Jia, Kui
Chan, Tsung-Han
Ma, Yi
COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 : 331 - 344
[33] Action-Gons: Action Recognition with a Discriminative Dictionary of Structured Elements with Varying Granularity
Wang, Yuwang
Wang, Baoyuan
Yu, Yizhou
Dai, Qionghai
Tu, Zhuowen
COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 259 - 274
[34] IMPROVING CODEBOOK-BASED WRITER RECOGNITION
Jehanzeb, Muhammed
Bin Sulong, Ghazali
Siddiqi, Imran
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (06)
[35] Characteristic Kernels on Structured Domains Excel in Robotics and Human Action Recognition
Danafar, Somayeh
Gretton, Arthur
Schmidhuber, Juergen
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6321 : 264 - 279
[36] Latent semantic learning with structured sparse representation for human action recognition
Lu, Zhiwu
Peng, Yuxin
PATTERN RECOGNITION, 2013, 46 (07) : 1799 - 1809
[37] The use of merging approach in the construction of initial codebook
Zhang, G
Ma, JF
Li, RP
ICEMI'99: FOURTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 1999, : 1043 - 1047
[38] A generic codebook based approach for gait recognition
Muhammad Hassan Khan
Muhammad Shahid Farid
Marcin Grzegorzek
Multimedia Tools and Applications, 2019, 78 : 35689 - 35712
[39] A generic codebook based approach for gait recognition
Khan, Muhammad Hassan
Farid, Muhammad Shahid
Grzegorzek, Marcin
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (24) : 35689 - 35712
[40] Speaker recognition with a MLP classifier and LPCC codebook
Rodriguez-Porcheron, D
Faúndez-Zanuy, M
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 1005 - 1008

← 1 2 3 4 5 →