Human action recognition using Local Spatio-Temporal Discriminant Embedding

被引：0

作者：

Jia, Kui ^{[1
]}

Yeung, Dit-Yan ^{[2
]}

机构：

[1] CAS CUHK, Shenzhen Inst Adv Integrat Technol, Shenzhen, Peoples R China

[2] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China

来源：

2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12 | 2008年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human action video sequences can be considered as nonlinear dynamic shape manifolds in the space of image frames. In this paper, we address learning and classifying human actions on embedded low-dimensional manifolds. We propose a novel manifold embedding method, called Local Spatio-Temporal Discriminant Embedding (LSTDE). The discriminating capabilities of the proposed method are two-fold: (1) for local spatial discrimination, LSTDE projects data points (silhouette-based image frames of human action sequences) in a local neighborhood into the embedding space where data points of the same action class are close while those of different classes are far apart; (2) in such a local neighborhood, each data point has an associated short video segment, which forms a local temporal subspace on the embedded manifold. LSTDE finds an optimal embedding which maximizes the principal angles between those temporal subspaces associated with data points of different classes. Benefiting from the joint spatio-temporal discriminant embedding, our method is potentially more powerful for classifying human actions with similar space-time shapes, and is able to perform recognition on a frame-by-frame or short video segment basis. Experimental results demonstrate that our method can accurately recognize human actions, and can improve the recognition performance over some representative manifold embedding methods, especially on highly confusing human action types.

引用

页码：3040 / +

页数：2

共 50 条

[21] Human action categorization using discriminative local spatio-temporal feature weighting
Ghodrati, Amir
Kasaei, Shohreh
INTELLIGENT DATA ANALYSIS, 2012, 16 (04) : 537 - 550
[22] Local descriptors for spatio-temporal recognition
Laptev, Ivan
Lindeberg, Tony
SPATIAL COHERENCE FOR VISUAL MOTION ANALYSIS, 2006, 3667 : 91 - 103
[23] Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos
Duta, Ionut C.
Ionescu, Bogdan
Aizawa, Kiyoharu
Sebe, Nicu
MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 365 - 378
[24] Multimodal human action recognition based on spatio-temporal action representation recognition model
Wu, Qianhan
Huang, Qian
Li, Xing
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16409 - 16430
[25] Hierarchical and Spatio-Temporal Sparse Representation for Human Action Recognition
Tian, Yi
Kong, Yu
Ruan, Qiuqi
An, Gaoyun
Fu, Yun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (04) : 1748 - 1762
[26] Human Action Recognition Based on a Spatio-Temporal Video Autoencoder
Sousa e Santos, Anderson Carlos
Pedrini, Helio
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (11)
[27] SPATIO-TEMPORAL PYRAMIDAL ACCORDION REPRESENTATION FOR HUMAN ACTION RECOGNITION
Sekma, Manel
Mejdoub, Mahmoud
Ben Amar, Chokri
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[28] Spatio-Temporal Information Fusion and Filtration for Human Action Recognition
Zhang, Man
Li, Xing
Wu, Qianhan
SYMMETRY-BASEL, 2023, 15 (12):
[29] Bag of Spatio-temporal Synonym Sets for Human Action Recognition
Pang, Lin
Cao, Juan
Guo, Junbo
Lin, Shouxun
Song, Yan
ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 422 - 432
[30] Multimodal human action recognition based on spatio-temporal action representation recognition model
Qianhan Wu
Qian Huang
Xing Li
Multimedia Tools and Applications, 2023, 82 : 16409 - 16430

← 1 2 3 4 5 →