2D Human Skeleton Action Recognition Based on Depth Estimation

被引:1
|
作者
Wang, Lei [1 ,2 ]
Yang, Shanmin [3 ]
Zhang, Jianwei [1 ]
Gu, Song [2 ]
机构
[1] Sichuan Univ, Sichuan, Peoples R China
[2] Chengdu Aeronaut Polytech, Chengdu, Peoples R China
[3] Chengdu Univ Informat Technol, Chengdu, Peoples R China
关键词
action recognition; depth estimation; muti-tasks learning; graph structure; video surveillance; NETWORK;
D O I
10.1587/transinf.2023EDP7223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition (HAR) exhibits limited accuracy in video surveillance due to the 2D information captured with monocular cameras. To address the problem, a depth estimation-based human skeleton action recognition method (SARDE) is proposed in this study, with the aim of transforming 2D human action data into 3D format to dig hidden action clues in the 2D data. SARDE comprises two tasks, i.e., human skeleton action recognition and monocular depth estimation. The two tasks are integrated in a multi-task manner in end-to-end training to comprehensively utilize the correlation between action recognition and depth estimation by sharing parameters to learn the depth features effectively for human action recognition. In this study, graph-structured networks with inception blocks and skip connections are investigated for depth estimation. The experimental results verify the effectiveness and superiority of the proposed method in skeleton action recognition that the method reaches state-of-the-art on the datasets.
引用
收藏
页码:869 / 877
页数:9
相关论文
共 50 条
  • [21] Multi-Feature Fusion Real-Time Action Recognition Based on 2D to 3D Skeleton
    Ren Guoyin
    Lu Xiaoqi
    Li Yuhao
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (24)
  • [22] Human Action Recognition System based on Skeleton Data
    Cho, Tin Zar Wint
    Win, May Thu
    Win, Aung
    2018 IEEE INTERNATIONAL CONFERENCE ON AGENTS (ICA), 2018, : 93 - 98
  • [23] Combining 2D and 3D deep models for action recognition with depth information
    Ali Seydi Keçeli
    Aydın Kaya
    Ahmet Burak Can
    Signal, Image and Video Processing, 2018, 12 : 1197 - 1205
  • [24] Combining 2D and 3D deep models for action recognition with depth information
    Keceli, Ali Seydi
    Kaya, Aydin
    Can, Ahmet Burak
    SIGNAL IMAGE AND VIDEO PROCESSING, 2018, 12 (06) : 1197 - 1205
  • [25] 2D to 3D Human Skeleton Estimation Based on the Brown Camera Distortion Model and Constrained Optimization
    Ma, Lan
    Huo, Hua
    ELECTRONICS, 2025, 14 (05):
  • [26] Gender classification on 2D human skeleton
    Barra, Paola
    Bisogni, Carmen
    Nappi, Michele
    Freire-Obregon, David
    Castrillon-Santana, Modesto
    2019 3RD INTERNATIONAL CONFERENCE ON BIO-ENGINEERING FOR SMART TECHNOLOGIES (BIOSMART), 2019,
  • [27] Deep learning-based multi-view 3D-human action recognition using skeleton and depth data
    Sampat Kumar Ghosh
    Rashmi M
    Biju R Mohan
    Ram Mohana Reddy Guddeti
    Multimedia Tools and Applications, 2023, 82 : 19829 - 19851
  • [28] Deep learning-based multi-view 3D-human action recognition using skeleton and depth data
    Ghosh, Sampat Kumar
    Rashmi, M.
    Mohan, Biju R.
    Guddeti, Ram Mohana Reddy
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 19829 - 19851
  • [29] Action Capsules: Human skeleton action recognition
    Bavil, Ali Farajzadeh
    Damirchi, Hamed
    Taghirad, Hamid D.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
  • [30] Human activity recognition using 2D skeleton data and supervised machine learning
    Ghazal, Sumaira
    Khan, Umar S.
    Saleem, Muhammad Mubasher
    Rashid, Nasir
    Iqbal, Javaid
    IET IMAGE PROCESSING, 2019, 13 (13) : 2572 - 2578