Action recognition via structured codebook construction

被引:11
|
作者
Zhou, Wen [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
Zhang, Zhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Bag-of-words models; Structured codebook; Sparse coding; Contextual information;
D O I
10.1016/j.image.2014.01.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Bag-of-words models have been widely used to obtain the global representation for action recognition. However, these models ignored the structure information, such as the spatial and temporal contextual information for action representation. In this paper, we propose a novel structured codebook construction method to encode spatial and temporal contextual information among local features for video representation. Given a set of training videos, our method first extracts local motion and appearance features. Next, we encode the spatial and temporal contextual information among local features by constructing correlation matrices for local spatio-temporal features. Then, we discover the common patterns of movements to construct the structured codebook. After that, actions can be represented by a set of sparse coefficients with respect to the structured codebook. Finally, a simple linear SVM classifier is applied to predict the action class based on the action representation. Our method has two main advantages compared to traditional methods. First, our method automatically discovers the mid-level common patterns of movements that capture rich spatial and temporal contextual information. Second, our method is robust to unwanted background local features mainly because most unwanted background local features cannot be sparsely represented by the common patterns and they are treated as residual errors that are not encoded into the action representation. We evaluate the proposed method on two popular benchmarks: KTH action dataset and UCF sports dataset Experimental results demonstrate the advantages of our structured codebook construction. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:546 / 555
页数:10
相关论文
共 50 条
  • [1] Human action recognition: a construction of codebook by discriminative features selection approach
    Siddiqui, Samra
    Khan, Muhammad Attique
    Bashir, Khalid
    Sharif, Muhammad
    Azam, Faisal
    Javed, Muhammad Younus
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2018, 5 (03) : 206 - 228
  • [2] COMPACT VISUAL CODEBOOK FOR ACTION RECOGNITION
    Wei, Qingdi
    Zhang, Xiaoqin
    Kong, Yu
    Hu, Weiming
    Ling, Haibin
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3805 - 3808
  • [3] Adaptive learning codebook for action recognition
    Kong, Yu
    Zhang, Xiaoqin
    Hu, Weiming
    Jia, Yunde
    PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1178 - 1186
  • [4] CATEGORY SENSITIVE CODEBOOK CONSTRUCTION FOR OBJECT CATEGORY RECOGNITION
    Zhang, Chunjie
    Liu, Jing
    Ouyang, Yi
    Tian, Qi
    Lu, Hanqing
    Ma, Songde
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 329 - +
  • [5] Cross-view Action Recognition via Dual-Codebook and Hierarchical Transfer Framework
    Zhang, Chengkun
    Zheng, Huicheng
    Lai, Jianhuang
    COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 579 - 592
  • [6] Visual codebook construction for class-specific recognition
    Gao, Jun
    Sang, Nong
    Gao, Changxin
    Tang, Qiling
    Sang, Jun
    OPTICAL ENGINEERING, 2009, 48 (07)
  • [7] Learning Discriminative Visual Codebook for Human Action Recognition
    Lei, Qing
    Li, Shao-zi
    Zhang, Hong-bo
    JOURNAL OF COMPUTERS, 2013, 8 (12) : 3093 - 3102
  • [8] Class-specific codebook construction for biologically inspired recognition
    Gao, Jun
    Gao, Changxin
    Sang, Nong
    Tang, Qiling
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 565 - 568
  • [9] Study on the Generation Model of Weighted Visual Codebook for Action Recognition
    Lin, Xianming
    Li, Shaozi
    Cao, Donglin
    Zhuang, Weiyuan
    MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 944 - 947
  • [10] Structured Learning for Action Recognition in Videos
    Long, Yinghan
    Srinivasan, Gopalakrishnan
    Panda, Priyadarshini
    Roy, Kaushik
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2019, 9 (03) : 475 - 484