OFPI: Optical Flow Pose Image for Action Recognition

被引：2

作者：

Chen, Dong ^{[1
,2
]}

Zhang, Tao ^{[2
]}

Zhou, Peng ^{[1
]}

Yan, Chenyang ^{[3
]}

Li, Chuanqi ^{[1
,2
]}

机构：

[1] Guangxi Normal Univ, Coll Comp Sci & Engn, Guilin 541004, Peoples R China

[2] Nanning Normal Univ, Coll Phys & Elect Engn, Nanning 530001, Peoples R China

[3] Kanazawa Univ, Div Elect Engn & Comp Sci, Kanazawa 9201192, Japan

来源：

MATHEMATICS | 2023年 / 11卷 / 06期

关键词：

action recognition; optical flow pose image; skeletal data; transformer;

D O I：

10.3390/math11061451

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Most approaches to action recognition based on pseudo-images involve encoding skeletal data into RGB-like image representations. This approach cannot fully exploit the kinematic features and structural information of human poses, and convolutional neural network (CNN) models that process pseudo-images lack a global field of view and cannot completely extract action features from pseudo-images. In this paper, we propose a novel pose-based action representation method called Optical Flow Pose Image (OFPI) in order to fully capitalize on the spatial and temporal information of skeletal data. Specifically, in the proposed method, an advanced pose estimator collects skeletal data before locating the target person and then extracts skeletal data utilizing a human tracking algorithm. The OFPI representation is obtained by aggregating these skeletal data over time. To test the superiority of OFPI and investigate the significance of the model having a global field of view, we trained a simple CNN model and a transformer-based model, respectively. Both models achieved superior outcomes. Because of the global field of view, especially in the transformer-based model, the OFPI-based representation achieved 98.3% and 94.2% accuracy on the KTH and JHMDB datasets, respectively. Compared with other advanced pose representation methods and multi-stream methods, OFPI achieved state-of-the-art performance on the JHMDB dataset, indicating the utility and potential of this algorithm for skeleton-based action recognition research.

引用

页数：23

共 50 条

[21] Egocentric articulated pose tracking for action recognition
Yonemoto, Haruka
Murasaki, Kazuhiko
Osawa, Tatsuya
Sudo, Kyoko
Shimamura, Jun
Taniguchi, Yukinobu
2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 98 - 101
[22] Learning pose dictionary for human action recognition
Cai, Jia-xin
Tang, Xin
Feng, Guo-can
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 381 - 386
[23] DeepPear: Deep Pose Estimation and Action Recognition
Jhuang, You-Ying
Tsai, Wen-Jiin
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7119 - 7125
[24] Semantic action recognition by learning a pose lexicon
Zhou, Lijuan
Li, Wanqing
Ogunbona, Philip
Zhang, Zhengyou
PATTERN RECOGNITION, 2017, 72 : 548 - 562
[25] PoTion: Pose MoTion Representation for Action Recognition
Choutas, Vasileios
Weinzaepfel, Philippe
Revaud, Jerome
Schmid, Cordelia
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7024 - 7033
[26] Mutually Incoherent Pose Bases for Action Recognition
Qian, Yinzhong
Chen, Wenbin
Shen, I-fan
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 823 - 828
[27] Latent Pose Estimator for Continuous Action Recognition
Ning, Huazhong
Xu, Wei
Gong, Yihong
Huang, Thomas
COMPUTER VISION - ECCV 2008, PT II, PROCEEDINGS, 2008, 5303 : 419 - +
[28] Discriminative Pose Analysis for Human Action Recognition
Zhao, Xiaofeng
Huang, Yao
Yang, Jianyu
Liu, Chunping
2020 IEEE 6TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2020,
[29] Human action recognition by leaning pose dictionary
Cai, Jiaxin
Feng, Guocan
Tang, Xin
Luo, Zhihong
Cai, Jiaxin, 1600, Chinese Optical Society (34):
[30] An approach to pose-based action recognition
Wang, Chunyu
Wang, Yizhou
Yuille, Alan L.
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 915 - 922

← 1 2 3 4 5 →