Improving 3D Human Pose Estimation via 3D Part Affinity Fields

被引:9
|
作者
Liu, Ding [1 ]
Zhao, Zixu [1 ]
Wang, Xinchao [2 ]
Hu, Yuxiao [3 ]
Zhang, Lei [4 ]
Huang, Thomas S. [1 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] Stevens Inst Technol, Hoboken, NJ 07030 USA
[3] Huawei Technol Inc USA, Santa Clara, CA USA
[4] Microsoft, Bellevue, WA USA
来源
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2019年
关键词
REPRESENTATION;
D O I
10.1109/WACV.2019.00112
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
3D human pose estimation from monocular images has become a heated area in computer vision recently. For years, most deep neural network based practices have adopted either an end-to-end approach, or a two-stage approach. An end-to-end network typically estimates 3D human poses directly from 2D input images, but it suffers from the shortage of 3D human pose data. It is also obscure to know if the inaccuracy stems from limited visual understanding or 2D-to-3D mapping. Whereas a two-stage directly lifts those 2D keypoint outputs to the 3D space, after utilizing an existing network for 2D keypoint detections. However, they tend to ignore some useful contextual hints from the 2D raw image pixels. In this paper, we introduce a two-stage architecture that can eliminate the main disadvantages of both these approaches. During the first stage we use an existing stateof- the-art detector to estimate 2D poses. To add more contextual information to help lifting 2D poses to 3D poses, we propose 3D Part Affinity Fields (3D-PAFs). We use 3D-PAFs to infer 3D limb vectors, and combine them with 2D poses to regress the 3D coordinates. We trained and tested our proposed framework on Human3.6M, the most popular 3D human pose benchmark dataset. Our approach achieves the state-of-the-art performance, which proves that with right selections of contextual information, a simple regression model can be very powerful in estimating 3D poses.
引用
收藏
页码:1004 / 1013
页数:10
相关论文
共 50 条
  • [31] Fast online human pose estimation via 3D voxel data
    Sagawa, Yuichi
    Shimosaka, Masamichi
    Mori, Taketoshi
    Sato, Tomomasa
    2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 1040 - 1046
  • [32] Robust 3D Human Pose Estimation via Dual Dictionaries Learning
    Ji, Hao
    Su, Fei
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3370 - 3373
  • [33] 3D Human Pose Estimation via Explicit Compositional Depth Maps
    Wu, Haiping
    Xiao, Bin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12378 - 12385
  • [34] 3D Pose Estimation and 3D Model Retrieval for Objects in the Wild
    Grabner, Alexander
    Roth, Peter M.
    Lepetit, Vincent
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3022 - 3031
  • [35] Generic 3D Representation via Pose Estimation and Matching
    Zamir, Amir R.
    Wekel, Tilman
    Agrawal, Pulkit
    Wei, Colin
    Malik, Jitendra
    Savarese, Silvio
    COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 535 - 553
  • [36] DRPose3D: Depth Ranking in 3D Human Pose Estimation
    Wang, Min
    Chen, Xipeng
    Liu, Wentao
    Qian, Chen
    Lin, Liang
    Ma, Lizhuang
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 978 - 984
  • [37] A Bayesian Part-based Approach to 3D Human Pose and Camera Estimation
    Brau, Ernesto
    Jiang, Hao
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1762 - 1767
  • [38] Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation
    Xu, Yuanlu
    Wang, Wenguan
    Liu, Tengyu
    Liu, Xiaobai
    Xie, Jianwen
    Zhu, Song-Chun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6327 - 6344
  • [39] Application of 3D Human Pose Estimation for Behavioral Reproduction
    Dare, Kodjine
    Ben Abdessalem, Hamdi
    Frasson, Claude
    INTELLIGENT TUTORING SYSTEMS, ITS 2022, 2022, 13284 : 190 - 196
  • [40] Towards Viewpoint Invariant 3D Human Pose Estimation
    Haque, Albert
    Peng, Boya
    Luo, Zelun
    Alahi, Alexandre
    Yeung, Serena
    Li Fei-Fei
    COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 160 - 177