3D Human Pose Estimation from RGB plus D Images with Convolutional Neural Networks

被引:3
|
作者
Cai, Yiheng [1 ]
Wang, Xueyan [1 ]
Kong, Xinran [1 ]
机构
[1] Beijing Univ Technol, Dept Informat, PingLeyuan 100, Beijing, Peoples R China
关键词
Human Pose Estimation; Deep Learning; RGB plus D Images;
D O I
10.1145/3278198.3278225
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we explore 3D human pose estimation on the RGB+D images. While many researchers try to directly predict 3D pose from single RGB image, we propose a simple framework that could predict 3D pose predictions with the RGB image and depth image. Our approach is based on two aspects. On the one hand, we predicted accurate 2D joint locations from RGB image by applying the stacked hourglass networks based on the improved residual architecture. On the other hand, in view of obtained 2D joint locations, we could estimate 3D pose with the depth after calculating depth image patches. In general, compared with the state-of-the-art approaches, our model achieves signification improvement on benchmark dataset.
引用
收藏
页码:64 / 69
页数:6
相关论文
共 50 条
  • [41] Modulated Graph Convolutional Network for 3D Human Pose Estimation
    Zou, Zhiming
    Tang, Wei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11457 - 11467
  • [42] RGB-D Salient Object Detection via 3D Convolutional Neural Networks
    Chen, Qian
    Liu, Ze
    Zhang, Yi
    Fu, Keren
    Zhao, Qijun
    Du, Hongwei
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1063 - 1071
  • [43] Flexible Graph Convolutional Network for 3D Human Pose Estimation
    Shahjahan, Abu Taib Mohammed
    Hamza, A. Ben
    arXiv,
  • [44] Latent Distribution-Based 3D Hand Pose Estimation From Monocular RGB Images
    Li, Moran
    Wang, Jialong
    Sang, Nong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4883 - 4894
  • [45] 3D Hand Pose Detection in Egocentric RGB-D Images
    Rogez, Gregory
    Khademi, Maryam
    Supancic, J. S., III
    Montiel, J. M. M.
    Ramanan, Deva
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 356 - 371
  • [46] Semantic Graph Convolutional Networks for 3D Human Pose Regression
    Zhao, Long
    Peng, Xi
    Tian, Yu
    Kapadia, Mubbasir
    Metaxas, Dimitris N.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3420 - 3430
  • [47] Improving Semantic Segmentation of 3D Medical Images on 3D Convolutional Neural Networks
    Marquez Herrera, Alejandra
    Cuadros-Vargas, Alex J.
    Pedrini, Helio
    2019 XLV LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2019), 2019,
  • [48] Error Accuracy Estimation of 3D Reconstruction and 3D Camera Pose from RGB-D Data
    Ortiz-Fernandez, Luis E.
    Silva, Bruno M. F.
    Goncalves, Luiz M. G.
    2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 67 - 72
  • [49] 3D Human Pose Estimation With Generative Adversarial Networks
    Xia, Hailun
    Xiao, Meng
    IEEE ACCESS, 2020, 8 : 206198 - 206206
  • [50] Learning to Estimate 3D Hand Pose from Single RGB Images
    Zimmermann, Christian
    Brox, Thomas
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4913 - 4921