Action recognition via structured codebook construction

被引:11
|
作者
Zhou, Wen [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
Zhang, Zhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Bag-of-words models; Structured codebook; Sparse coding; Contextual information;
D O I
10.1016/j.image.2014.01.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Bag-of-words models have been widely used to obtain the global representation for action recognition. However, these models ignored the structure information, such as the spatial and temporal contextual information for action representation. In this paper, we propose a novel structured codebook construction method to encode spatial and temporal contextual information among local features for video representation. Given a set of training videos, our method first extracts local motion and appearance features. Next, we encode the spatial and temporal contextual information among local features by constructing correlation matrices for local spatio-temporal features. Then, we discover the common patterns of movements to construct the structured codebook. After that, actions can be represented by a set of sparse coefficients with respect to the structured codebook. Finally, a simple linear SVM classifier is applied to predict the action class based on the action representation. Our method has two main advantages compared to traditional methods. First, our method automatically discovers the mid-level common patterns of movements that capture rich spatial and temporal contextual information. Second, our method is robust to unwanted background local features mainly because most unwanted background local features cannot be sparsely represented by the common patterns and they are treated as residual errors that are not encoded into the action representation. We evaluate the proposed method on two popular benchmarks: KTH action dataset and UCF sports dataset Experimental results demonstrate the advantages of our structured codebook construction. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:546 / 555
页数:10
相关论文
共 50 条
  • [41] Human Action Recognition via Depth Maps Body Parts of Action
    Farooq, Adnan
    Farooq, Faisal
    Anh Vu Le
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (05): : 2327 - 2347
  • [42] CODEBOOK ENHANCEMENT OF VLAD REPRESENTATION FOR VISUAL RECOGNITION
    Wang, Zhe
    Wang, Yali
    Wang, Limin
    Qiao, Yu
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1258 - 1262
  • [43] Tree-structured product-codebook vector quantization
    Poggi, G
    Ragozini, ARP
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2001, 16 (05) : 421 - 430
  • [44] Feature Similarity and Frequency-Based Weighted Visual Words Codebook Learning Scheme for Human Action Recognition
    Nazir, Saima
    Yousaf, Muhammad Haroon
    Velastin, Sergio A.
    IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), 2018, 10749 : 326 - 336
  • [45] Similarity Graph Convolutional Construction Network for Interactive Action Recognition
    Sun, Xiangyu
    Liu, Qiong
    Yang, You
    MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 291 - 303
  • [46] A Structured Codebook with Various Codeword Configurations for Downlink MIMO Systems
    Kwon, Hyunil
    Shin, Myeongcheol
    Lee, Chungyong
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2010, E93B (11) : 3193 - 3196
  • [47] A Feature Encoding Based on Low Space Complexity Codebook Called Fuzzy Codebook for Image Recognition
    Shinomiya, Yuki
    Hoshino, Yukinobu
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (01) : 274 - 280
  • [48] Flexible goal recognition via graph construction and analysis
    Yin, MH
    Gu, WX
    Lu, YH
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 1118 - 1127
  • [49] A Feature Encoding Based on Low Space Complexity Codebook Called Fuzzy Codebook for Image Recognition
    Yuki Shinomiya
    Yukinobu Hoshino
    International Journal of Fuzzy Systems, 2019, 21 : 274 - 280
  • [50] 3-DIMENSIONAL SHAPE CONSTRUCTION AND RECOGNITION BY FUSING INTENSITY AND STRUCTURED LIGHTING
    WANG, YF
    CHENG, DI
    PATTERN RECOGNITION, 1992, 25 (12) : 1411 - 1425