Efficient Human Pose Estimation from Single Depth Images

被引:278
|
作者
Shotton, Jamie [1 ,2 ]
Girshick, Ross [3 ]
Fitzgibbon, Andrew [2 ]
Sharp, Toby [1 ,2 ]
Cook, Mat [2 ]
Finocchio, Mark [4 ]
Moore, Richard
Kohli, Pushmeet [2 ]
Criminisi, Antonio [2 ]
Kipman, Alex [4 ]
Blake, Andrew [2 ]
机构
[1] Microsoft Res, Machine Learning & Percept Grp, Cambridge CB3 0FB, England
[2] Microsoft Res, Cambridge CB3 0FB, England
[3] Univ Calif Berkeley, EERES COENG Engn Res, Berkeley, CA 94720 USA
[4] Microsoft Corp, Redmond, WA 98052 USA
关键词
Computer vision; machine learning; pixel classification; depth cues; range data; games; RECOGNITION;
D O I
10.1109/TPAMI.2012.241
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe two new approaches to human pose estimation. Both can quickly and accurately predict the 3D positions of body joints from a single depth image without using any temporal information. The key to both approaches is the use of a large, realistic, and highly varied synthetic set of training images. This allows us to learn models that are largely invariant to factors such as pose, body shape, field-of-view cropping, and clothing. Our first approach employs an intermediate body parts representation, designed so that an accurate per-pixel classification of the parts will localize the joints of the body. The second approach instead directly regresses the positions of body joints. By using simple depth pixel comparison features and parallelizable decision forests, both approaches can run super-real time on consumer hardware. Our evaluation investigates many aspects of our methods, and compares the approaches to each other and to the state of the art. Results on silhouettes suggest broader applicability to other imaging modalities.
引用
收藏
页码:2821 / 2840
页数:20
相关论文
共 50 条
  • [21] UniPose: Unified Human Pose Estimation in Single Images and Videos
    Artacho, Bruno
    Savakis, Andreas
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7033 - 7042
  • [22] 3D Hand Pose Estimation from Single Depth Images with Label Distribution Learning
    Xu, Yuanfei
    Wang, Xupeng
    2020 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2020,
  • [23] Top-Down Human Pose Estimation with Depth Images and Domain Adaptation
    Rodrigues, Nelson
    Torres, Helena
    Oliveira, Bruno
    Borges, Joao
    Queiros, Sandro
    Mendes, Jose
    Fonseca, Jaime
    Coelho, Victor
    Brito, Jose Henrique
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 281 - 288
  • [24] 3D human pose estimation from range images with depth difference and geodesic distance
    Zhang, Wenhui
    Kong, Dehui
    Wang, Shaofan
    Wang, Zhiyong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 272 - 282
  • [25] Constrained optimization for human pose estimation from depth sequences
    Zhu, Youding
    Fujimura, Kikuo
    COMPUTER VISION - ACCV 2007, PT I, PROCEEDINGS, 2007, 4843 : 408 - +
  • [26] Controlled human pose estimation from depth image streams
    Zhu, Youding
    Dariush, Behzad
    Fujimura, Kikuo
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 1544 - +
  • [27] EFFICIENT DEPTH ESTIMATION FROM SINGLE IMAGE
    Zhou, Wei
    Dai, Yuchao
    He, Renjie
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 296 - 300
  • [28] Monocular Human Depth Estimation Via Pose Estimation
    Jun, Jinyoung
    Lee, Jae-Han
    Lee, Chul
    Kim, Chang-Su
    IEEE ACCESS, 2021, 9 : 151444 - 151457
  • [29] Human Pose Estimation in Stereo Images
    Lallemand, Joe
    Szczot, Magdalena
    Ilic, Slobodan
    ARTICULATED MOTION AND DEFORMABLE OBJECTS, AMDO 2014, 2014, 8563 : 10 - 19
  • [30] REAL-TIME UPPER BODY POSE ESTIMATION FROM DEPTH IMAGES
    Tsai, Ming-Han
    Chen, Kuan-Hua
    Lin, I-Chen
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2234 - 2238