Research Progress in Skeleton-Based Human Action Recognition

被引:0
|
作者
Liu B. [1 ,2 ]
Zhou S. [3 ]
Dong J. [1 ]
Xie M. [3 ]
Zhou S. [3 ]
Zheng T. [1 ]
Zhang S. [5 ]
Ye X. [6 ]
Wang X. [1 ]
机构
[1] School of Computer Science and Technology, Zhejiang Gongshang University, Hangzhou
[2] Key Laboratory of Public Security Informatization Application Based on Big Data Architecture, Ministry of Public Security, Hangzhou
[3] School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou
[4] Department of Computer and Information Security, Zhejiang Police College, Hangzhou
[5] College of Computer Science and Technology, Zhejiang University, Hangzhou
[6] Institute of Big Data and Information Technology, Wenzhou University, Wenzhou
关键词
action recognition; deep learning; graph convolutional network; skeleton feature extraction;
D O I
10.3724/SP.J.1089.2023.19640
中图分类号
学科分类号
摘要
In recent years, with the development of deep learning technology, many novel skeleton-based human action recognition algorithms have been proposed, which has greatly promoted the development of this field. This paper aims to give a comprehensive and detailed summary of the main datasets and algorithms in the skeleton-based human action recognition field. Firstly, the main skeleton-related datasets such as NTU, Kinetics-Skeleton, and SYSU 3DHOI are reviewed. Secondly, the skeleton-based human action recognition algorithms are summarized into three categories, i.e., supervised learning-based, semi-supervised learning-based, and unsupervised learning-based, the main algorithms of each category are further introduced and compared. Finally, challenges that the field is currently facing, i.e., over-reliance on big data, large computing power, and large models, are concluded, and three future development directions are proposed to alleviate the above challenges: high-precision skeleton dataset construction, fine-grained skeleton-based action recognition, and skeleton-based action recognition with data-efficient learning. © 2023 Institute of Computing Technology. All rights reserved.
引用
收藏
页码:1299 / 1322
页数:23
相关论文
共 132 条
  • [71] Muhammad K, Mustaqeem, Ullah A, Et al., Human action recognition using attention based LSTM network with dilated CNN features, Future Generation Computer Systems, 125, pp. 820-830, (2021)
  • [72] He J Y, Wu X, Cheng Z Q, Et al., DB-LSTM: densely-connected bi-directional LSTM for human action recognition, Neurocomputing, 444, pp. 319-331, (2021)
  • [73] Du Y, Fu Y, Wang L., Skeleton based action recognition with convolutional neural network, Proceedings of the 3th IAPR Asian Conference on Pattern Recognition, pp. 579-583, (2015)
  • [74] Kim T S, Reiter A., Interpretable 3D human action analysis with temporal convolutional networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 1623-1631, (2017)
  • [75] Li C, Zhong Q Y, Xie D, Et al., Skeleton-based action recognition with convolutional neural networks, Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, pp. 597-600, (2017)
  • [76] Li C, Zhong Q Y, Xie D, Et al., Co-occurrence feature learning from skeleton data for action recognition and detection with hierarchical aggregation
  • [77] Wang P C, Li Z Y, Hou Y H, Et al., Action recognition based on joint trajectory maps using convolu tional neural networks, Proceedings of the 24th ACM International Conference on Multimedia, pp. 102-106, (2016)
  • [78] Ke Q H, Bennamoun M, An S J, Et al., A new representation of skeleton sequences for 3D action recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4570-4579, (2017)
  • [79] Caetano C, Bremond F, Schwartz W R., Skeleton image representation for 3D action recognition based on tree structure and reference joints, Proceedings of the 32nd SIBGRAPI Conference on Graphics, Patterns and Images, pp. 16-23, (2019)
  • [80] Le T M, Inoue N, Shinoda K., A fine-to-coarse convolutional neural network for 3D human action recognition