Efficient Human Pose Estimation from Single Depth Images

被引:278
|
作者
Shotton, Jamie [1 ,2 ]
Girshick, Ross [3 ]
Fitzgibbon, Andrew [2 ]
Sharp, Toby [1 ,2 ]
Cook, Mat [2 ]
Finocchio, Mark [4 ]
Moore, Richard
Kohli, Pushmeet [2 ]
Criminisi, Antonio [2 ]
Kipman, Alex [4 ]
Blake, Andrew [2 ]
机构
[1] Microsoft Res, Machine Learning & Percept Grp, Cambridge CB3 0FB, England
[2] Microsoft Res, Cambridge CB3 0FB, England
[3] Univ Calif Berkeley, EERES COENG Engn Res, Berkeley, CA 94720 USA
[4] Microsoft Corp, Redmond, WA 98052 USA
关键词
Computer vision; machine learning; pixel classification; depth cues; range data; games; RECOGNITION;
D O I
10.1109/TPAMI.2012.241
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe two new approaches to human pose estimation. Both can quickly and accurately predict the 3D positions of body joints from a single depth image without using any temporal information. The key to both approaches is the use of a large, realistic, and highly varied synthetic set of training images. This allows us to learn models that are largely invariant to factors such as pose, body shape, field-of-view cropping, and clothing. Our first approach employs an intermediate body parts representation, designed so that an accurate per-pixel classification of the parts will localize the joints of the body. The second approach instead directly regresses the positions of body joints. By using simple depth pixel comparison features and parallelizable decision forests, both approaches can run super-real time on consumer hardware. Our evaluation investigates many aspects of our methods, and compares the approaches to each other and to the state of the art. Results on silhouettes suggest broader applicability to other imaging modalities.
引用
收藏
页码:2821 / 2840
页数:20
相关论文
共 50 条
  • [1] Efficient Hand Pose Estimation from a Single Depth Image
    Xu, Chi
    Cheng, Li
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3456 - 3462
  • [2] Unsupervised Human Pose Estimation on Depth Images
    Blanc-Beyne, Thibault
    Carlier, Axel
    Mouysset, Sandrine
    Charvillat, Vincent
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE TRACK, ECML PKDD 2020, PT IV, 2021, 12460 : 358 - 373
  • [3] Accurate and Efficient 3D Human Pose Estimation Algorithm using Single Depth Images for Pose Analysis in Golf
    Park, Soonchan
    Chang, Ju Yong
    Jeong, Hyuk
    Lee, Jae-Ho
    Park, Ji-Young
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 105 - 113
  • [4] Human Pose and Shape Estimation From Single Polarization Images
    Zou, Shihao
    Zuo, Xinxin
    Wang, Sen
    Qian, Yiming
    Guo, Chuan
    Cheng, Li
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3560 - 3572
  • [5] Nonlinear body pose estimation from depth images
    Grest, D
    Woetzel, J
    Koch, R
    PATTERN RECOGNITION, PROCEEDINGS, 2005, 3663 : 285 - 292
  • [6] ANN for human pose estimation in low resolution depth images
    Szczuko, Piotr
    2017 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA 2017), 2017, : 354 - 359
  • [7] Face-from-Depth for Head Pose Estimation on Depth Images
    Borghi, Guido
    Fabbri, Matteo
    Vezzani, Roberto
    Calderara, Simone
    Cucchiara, Rita
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (03) : 596 - 609
  • [8] 3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5679 - 5688
  • [9] A Semantic Occlusion Model for Human Pose Estimation from a Single Depth Image
    Rafi, Umer
    Gall, Juergen
    Leibe, Bastian
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [10] Investigating Depth Domain Adaptation for Efficient Human Pose Estimation
    Martinez-Gonzalez, Angel
    Villamizar, Michael
    Canevet, Olivier
    Odobez, Jean-Marc
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 346 - 363