OFPI: Optical Flow Pose Image for Action Recognition

被引:2
|
作者
Chen, Dong [1 ,2 ]
Zhang, Tao [2 ]
Zhou, Peng [1 ]
Yan, Chenyang [3 ]
Li, Chuanqi [1 ,2 ]
机构
[1] Guangxi Normal Univ, Coll Comp Sci & Engn, Guilin 541004, Peoples R China
[2] Nanning Normal Univ, Coll Phys & Elect Engn, Nanning 530001, Peoples R China
[3] Kanazawa Univ, Div Elect Engn & Comp Sci, Kanazawa 9201192, Japan
关键词
action recognition; optical flow pose image; skeletal data; transformer;
D O I
10.3390/math11061451
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Most approaches to action recognition based on pseudo-images involve encoding skeletal data into RGB-like image representations. This approach cannot fully exploit the kinematic features and structural information of human poses, and convolutional neural network (CNN) models that process pseudo-images lack a global field of view and cannot completely extract action features from pseudo-images. In this paper, we propose a novel pose-based action representation method called Optical Flow Pose Image (OFPI) in order to fully capitalize on the spatial and temporal information of skeletal data. Specifically, in the proposed method, an advanced pose estimator collects skeletal data before locating the target person and then extracts skeletal data utilizing a human tracking algorithm. The OFPI representation is obtained by aggregating these skeletal data over time. To test the superiority of OFPI and investigate the significance of the model having a global field of view, we trained a simple CNN model and a transformer-based model, respectively. Both models achieved superior outcomes. Because of the global field of view, especially in the transformer-based model, the OFPI-based representation achieved 98.3% and 94.2% accuracy on the KTH and JHMDB datasets, respectively. Compared with other advanced pose representation methods and multi-stream methods, OFPI achieved state-of-the-art performance on the JHMDB dataset, indicating the utility and potential of this algorithm for skeleton-based action recognition research.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Egocentric articulated pose tracking for action recognition
    Yonemoto, Haruka
    Murasaki, Kazuhiko
    Osawa, Tatsuya
    Sudo, Kyoko
    Shimamura, Jun
    Taniguchi, Yukinobu
    2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 98 - 101
  • [22] Learning pose dictionary for human action recognition
    Cai, Jia-xin
    Tang, Xin
    Feng, Guo-can
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 381 - 386
  • [23] DeepPear: Deep Pose Estimation and Action Recognition
    Jhuang, You-Ying
    Tsai, Wen-Jiin
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7119 - 7125
  • [24] Semantic action recognition by learning a pose lexicon
    Zhou, Lijuan
    Li, Wanqing
    Ogunbona, Philip
    Zhang, Zhengyou
    PATTERN RECOGNITION, 2017, 72 : 548 - 562
  • [25] PoTion: Pose MoTion Representation for Action Recognition
    Choutas, Vasileios
    Weinzaepfel, Philippe
    Revaud, Jerome
    Schmid, Cordelia
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7024 - 7033
  • [26] Mutually Incoherent Pose Bases for Action Recognition
    Qian, Yinzhong
    Chen, Wenbin
    Shen, I-fan
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 823 - 828
  • [27] Latent Pose Estimator for Continuous Action Recognition
    Ning, Huazhong
    Xu, Wei
    Gong, Yihong
    Huang, Thomas
    COMPUTER VISION - ECCV 2008, PT II, PROCEEDINGS, 2008, 5303 : 419 - +
  • [28] Discriminative Pose Analysis for Human Action Recognition
    Zhao, Xiaofeng
    Huang, Yao
    Yang, Jianyu
    Liu, Chunping
    2020 IEEE 6TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2020,
  • [29] Human action recognition by leaning pose dictionary
    Cai, Jiaxin
    Feng, Guocan
    Tang, Xin
    Luo, Zhihong
    Cai, Jiaxin, 1600, Chinese Optical Society (34):
  • [30] An approach to pose-based action recognition
    Wang, Chunyu
    Wang, Yizhou
    Yuille, Alan L.
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 915 - 922