Action recognition via structured codebook construction

被引:11
|
作者
Zhou, Wen [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
Zhang, Zhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Bag-of-words models; Structured codebook; Sparse coding; Contextual information;
D O I
10.1016/j.image.2014.01.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Bag-of-words models have been widely used to obtain the global representation for action recognition. However, these models ignored the structure information, such as the spatial and temporal contextual information for action representation. In this paper, we propose a novel structured codebook construction method to encode spatial and temporal contextual information among local features for video representation. Given a set of training videos, our method first extracts local motion and appearance features. Next, we encode the spatial and temporal contextual information among local features by constructing correlation matrices for local spatio-temporal features. Then, we discover the common patterns of movements to construct the structured codebook. After that, actions can be represented by a set of sparse coefficients with respect to the structured codebook. Finally, a simple linear SVM classifier is applied to predict the action class based on the action representation. Our method has two main advantages compared to traditional methods. First, our method automatically discovers the mid-level common patterns of movements that capture rich spatial and temporal contextual information. Second, our method is robust to unwanted background local features mainly because most unwanted background local features cannot be sparsely represented by the common patterns and they are treated as residual errors that are not encoded into the action representation. We evaluate the proposed method on two popular benchmarks: KTH action dataset and UCF sports dataset Experimental results demonstrate the advantages of our structured codebook construction. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:546 / 555
页数:10
相关论文
共 50 条
  • [31] Zero-Shot Recognition via Structured Prediction
    Zhang, Ziming
    Saligrama, Venkatesh
    COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 533 - 548
  • [32] Robust and Practical Face Recognition via Structured Sparsity
    Jia, Kui
    Chan, Tsung-Han
    Ma, Yi
    COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 : 331 - 344
  • [33] Action-Gons: Action Recognition with a Discriminative Dictionary of Structured Elements with Varying Granularity
    Wang, Yuwang
    Wang, Baoyuan
    Yu, Yizhou
    Dai, Qionghai
    Tu, Zhuowen
    COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 259 - 274
  • [34] IMPROVING CODEBOOK-BASED WRITER RECOGNITION
    Jehanzeb, Muhammed
    Bin Sulong, Ghazali
    Siddiqi, Imran
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (06)
  • [35] Characteristic Kernels on Structured Domains Excel in Robotics and Human Action Recognition
    Danafar, Somayeh
    Gretton, Arthur
    Schmidhuber, Juergen
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6321 : 264 - 279
  • [36] Latent semantic learning with structured sparse representation for human action recognition
    Lu, Zhiwu
    Peng, Yuxin
    PATTERN RECOGNITION, 2013, 46 (07) : 1799 - 1809
  • [37] The use of merging approach in the construction of initial codebook
    Zhang, G
    Ma, JF
    Li, RP
    ICEMI'99: FOURTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 1999, : 1043 - 1047
  • [38] A generic codebook based approach for gait recognition
    Muhammad Hassan Khan
    Muhammad Shahid Farid
    Marcin Grzegorzek
    Multimedia Tools and Applications, 2019, 78 : 35689 - 35712
  • [39] A generic codebook based approach for gait recognition
    Khan, Muhammad Hassan
    Farid, Muhammad Shahid
    Grzegorzek, Marcin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (24) : 35689 - 35712
  • [40] Speaker recognition with a MLP classifier and LPCC codebook
    Rodriguez-Porcheron, D
    Faúndez-Zanuy, M
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 1005 - 1008