Improving 3D Human Pose Estimation via 3D Part Affinity Fields

被引：9

作者：

Liu, Ding ^{[1
]}

Zhao, Zixu ^{[1
]}

Wang, Xinchao ^{[2
]}

Hu, Yuxiao ^{[3
]}

Zhang, Lei ^{[4
]}

Huang, Thomas S. ^{[1
]}

机构：

[1] Univ Illinois, Urbana, IL 61801 USA

[2] Stevens Inst Technol, Hoboken, NJ 07030 USA

[3] Huawei Technol Inc USA, Santa Clara, CA USA

[4] Microsoft, Bellevue, WA USA

来源：

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2019年

关键词：

REPRESENTATION;

D O I：

10.1109/WACV.2019.00112

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

3D human pose estimation from monocular images has become a heated area in computer vision recently. For years, most deep neural network based practices have adopted either an end-to-end approach, or a two-stage approach. An end-to-end network typically estimates 3D human poses directly from 2D input images, but it suffers from the shortage of 3D human pose data. It is also obscure to know if the inaccuracy stems from limited visual understanding or 2D-to-3D mapping. Whereas a two-stage directly lifts those 2D keypoint outputs to the 3D space, after utilizing an existing network for 2D keypoint detections. However, they tend to ignore some useful contextual hints from the 2D raw image pixels. In this paper, we introduce a two-stage architecture that can eliminate the main disadvantages of both these approaches. During the first stage we use an existing stateof- the-art detector to estimate 2D poses. To add more contextual information to help lifting 2D poses to 3D poses, we propose 3D Part Affinity Fields (3D-PAFs). We use 3D-PAFs to infer 3D limb vectors, and combine them with 2D poses to regress the 3D coordinates. We trained and tested our proposed framework on Human3.6M, the most popular 3D human pose benchmark dataset. Our approach achieves the state-of-the-art performance, which proves that with right selections of contextual information, a simple regression model can be very powerful in estimating 3D poses.

引用

页码：1004 / 1013

页数：10

共 50 条

[31] Fast online human pose estimation via 3D voxel data
Sagawa, Yuichi
Shimosaka, Masamichi
Mori, Taketoshi
Sato, Tomomasa
2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 1040 - 1046
[32] Robust 3D Human Pose Estimation via Dual Dictionaries Learning
Ji, Hao
Su, Fei
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3370 - 3373
[33] 3D Human Pose Estimation via Explicit Compositional Depth Maps
Wu, Haiping
Xiao, Bin
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12378 - 12385
[34] 3D Pose Estimation and 3D Model Retrieval for Objects in the Wild
Grabner, Alexander
Roth, Peter M.
Lepetit, Vincent
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3022 - 3031
[35] Generic 3D Representation via Pose Estimation and Matching
Zamir, Amir R.
Wekel, Tilman
Agrawal, Pulkit
Wei, Colin
Malik, Jitendra
Savarese, Silvio
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 535 - 553
[36] DRPose3D: Depth Ranking in 3D Human Pose Estimation
Wang, Min
Chen, Xipeng
Liu, Wentao
Qian, Chen
Lin, Liang
Ma, Lizhuang
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 978 - 984
[37] A Bayesian Part-based Approach to 3D Human Pose and Camera Estimation
Brau, Ernesto
Jiang, Hao
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1762 - 1767
[38] Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation
Xu, Yuanlu
Wang, Wenguan
Liu, Tengyu
Liu, Xiaobai
Xie, Jianwen
Zhu, Song-Chun
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6327 - 6344
[39] Application of 3D Human Pose Estimation for Behavioral Reproduction
Dare, Kodjine
Ben Abdessalem, Hamdi
Frasson, Claude
INTELLIGENT TUTORING SYSTEMS, ITS 2022, 2022, 13284 : 190 - 196
[40] Towards Viewpoint Invariant 3D Human Pose Estimation
Haque, Albert
Peng, Boya
Luo, Zelun
Alahi, Alexandre
Yeung, Serena
Li Fei-Fei
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 160 - 177

← 1 2 3 4 5 →