2D Human Skeleton Action Recognition Based on Depth Estimation

被引:1
|
作者
Wang, Lei [1 ,2 ]
Yang, Shanmin [3 ]
Zhang, Jianwei [1 ]
Gu, Song [2 ]
机构
[1] Sichuan Univ, Sichuan, Peoples R China
[2] Chengdu Aeronaut Polytech, Chengdu, Peoples R China
[3] Chengdu Univ Informat Technol, Chengdu, Peoples R China
关键词
action recognition; depth estimation; muti-tasks learning; graph structure; video surveillance; NETWORK;
D O I
10.1587/transinf.2023EDP7223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition (HAR) exhibits limited accuracy in video surveillance due to the 2D information captured with monocular cameras. To address the problem, a depth estimation-based human skeleton action recognition method (SARDE) is proposed in this study, with the aim of transforming 2D human action data into 3D format to dig hidden action clues in the 2D data. SARDE comprises two tasks, i.e., human skeleton action recognition and monocular depth estimation. The two tasks are integrated in a multi-task manner in end-to-end training to comprehensively utilize the correlation between action recognition and depth estimation by sharing parameters to learn the depth features effectively for human action recognition. In this study, graph-structured networks with inception blocks and skip connections are investigated for depth estimation. The experimental results verify the effectiveness and superiority of the proposed method in skeleton action recognition that the method reaches state-of-the-art on the datasets.
引用
收藏
页码:869 / 877
页数:9
相关论文
共 50 条
  • [41] Action Tree Convolutional Networks: Skeleton-Based Human Action Recognition
    Liu, Wenjie
    Zhang, Ziyi
    Han, Bing
    Zhu, Chenhui
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 783 - 792
  • [42] Research for Human Action Recognition based on Depth Information
    Liu, Jing
    Wang, Lei
    Yang, Xubo
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1281 - 1284
  • [43] 2D Log-Gabor Wavelet Based Action Recognition
    Li, Ning
    Xu, De
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (11): : 2275 - 2278
  • [44] A Method of Depth Image based Human Action Recognition
    Li, Pei
    Cheng, Wanli
    MATERIALS SCIENCE, ENERGY TECHNOLOGY, AND POWER ENGINEERING I, 2017, 1839
  • [45] Depth-Based vs. Color-Based Pose Estimation in Human Action Recognition
    Malawski, Filip
    Jankowski, Bartosz
    ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT I, 2022, 13598 : 336 - 346
  • [46] Hands, Objects, Action! Egocentric 2D Hand-Based Action Recognition
    Mucha, Wiktor
    Kampel, Martin
    COMPUTER VISION SYSTEMS, ICVS 2023, 2023, 14253 : 31 - 40
  • [47] MODEL OF HUMAN ACTIONS RECOGNITION BASED ON 2D KERNEL
    Radulescu, Bogdan Alexandru
    Radulescu, Victorita
    PROCEEDINGS OF THE ASME 2021 30TH CONFERENCE ON INFORMATION STORAGE AND PROCESSING SYSTEMS (ISPS2021), 2021,
  • [48] SOANets: Encoder-decoder based Skeleton Orientation Alignment Network for White Cane User Recognition from 2D Human Skeleton Sequence
    Nishida, Naoki
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Ide, Ichiro
    Murase, Hiroshi
    Piao, Jun
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 435 - 443
  • [49] Depth and Skeleton Associated Action Recognition without Online Accessible RGB-D Cameras
    Lin, Yen-Yu
    Hua, Ju-Hsuan
    Tang, Nick C.
    Chen, Min-Hung
    Liao, Hong-Yuan Mark
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2617 - 2624
  • [50] 2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning
    Luvizon, Diogo C.
    Picard, David
    Tabia, Hedi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5137 - 5146