2D Human Skeleton Action Recognition Based on Depth Estimation

被引:1
|
作者
Wang, Lei [1 ,2 ]
Yang, Shanmin [3 ]
Zhang, Jianwei [1 ]
Gu, Song [2 ]
机构
[1] Sichuan Univ, Sichuan, Peoples R China
[2] Chengdu Aeronaut Polytech, Chengdu, Peoples R China
[3] Chengdu Univ Informat Technol, Chengdu, Peoples R China
关键词
action recognition; depth estimation; muti-tasks learning; graph structure; video surveillance; NETWORK;
D O I
10.1587/transinf.2023EDP7223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition (HAR) exhibits limited accuracy in video surveillance due to the 2D information captured with monocular cameras. To address the problem, a depth estimation-based human skeleton action recognition method (SARDE) is proposed in this study, with the aim of transforming 2D human action data into 3D format to dig hidden action clues in the 2D data. SARDE comprises two tasks, i.e., human skeleton action recognition and monocular depth estimation. The two tasks are integrated in a multi-task manner in end-to-end training to comprehensively utilize the correlation between action recognition and depth estimation by sharing parameters to learn the depth features effectively for human action recognition. In this study, graph-structured networks with inception blocks and skip connections are investigated for depth estimation. The experimental results verify the effectiveness and superiority of the proposed method in skeleton action recognition that the method reaches state-of-the-art on the datasets.
引用
收藏
页码:869 / 877
页数:9
相关论文
共 50 条
  • [1] Human Action Recognition Based on 2D Poses and Skeleton Joints
    Belluzzo, Bruno
    Marana, Aparecido Nilceu
    INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 71 - 83
  • [2] 2D human skeleton action recognition with spatial constraints
    Wang, Lei
    Zhang, Jianwei
    Yang, Wenbing
    Gu, Song
    Yang, Shanmin
    IET COMPUTER VISION, 2024, 18 (07) : 968 - 981
  • [3] Action Recognition Algorithm based on 2D Human Pose Estimation Method
    Yu, Chongkai
    Chen, Wenjie
    Li, Ye
    Chen, Chen
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7366 - 7370
  • [4] A 3D graph convolutional networks model for 2D skeleton-based human action recognition
    Weng, Libo
    Lou, Weidong
    Shen, Xin
    Gao, Fei
    IET IMAGE PROCESSING, 2023, 17 (03) : 773 - 783
  • [5] Adaptive 2D skeleton deformation based on view agnostic network for action recognition
    Dong, Qianyun
    Zhong, Xin
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [6] 2D Action Recognition Serves 3D Human Pose Estimation
    Gall, Juergen
    Yao, Angela
    Van Gool, Luc
    COMPUTER VISION-ECCV 2010, PT III, 2010, 6313 : 425 - 438
  • [7] Understanding the Gap between 2D and 3D Skeleton-Based Action Recognition
    Elias, Petr
    Sedmidubsky, Jan
    Zezula, Pavel
    2019 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2019), 2019, : 192 - 195
  • [8] Action Recognition based on a mixture of RGB and Depth based skeleton
    Das, Srijan
    Koperski, Michal
    Bremond, Francois
    Francesca, Gianpiero
    2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2017,
  • [9] HUMAN ACTION RECOGNITION USING ASSOCIATED DEPTH AND SKELETON INFORMATION
    Tang, Nick C.
    Lin, Yen-Yu
    Hua, Ju-Hsuan
    Weng, Ming-Fang
    Liao, Hong-Yuan Mark
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] Human Action Recognition Using Associated Depth and Skeleton Information
    Li, Keyu
    Liu, Zhigang
    Liang, Liqin
    Song, Yanan
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 418 - 422