A dual-source approach for 3D human pose estimation from single images

被引:22
|
作者
Iqbal, Umar [1 ]
Doering, Andreas [1 ]
Yasin, Hashim [2 ]
Kruger, Bjorn [3 ]
Weber, Andreas [4 ]
Gall, Juergen [1 ]
机构
[1] Univ Bonn, Comp Vis Grp, Bonn, Germany
[2] Natl Univ Comp & Emerging Sci, Islamabad, Pakistan
[3] Gokhale Method Inst, Stanford, CA USA
[4] Univ Bonn, Virtual Real Grp, Simulat, Multimedia, Bonn, Germany
关键词
3D human pose estimation; Motion capture; 3D reconstruction; Articulated pose estimation;
D O I
10.1016/j.cviu.2018.03.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we address the challenging problem of 3D human pose estimation from single images. Recent approaches learn deep neural networks to regress 3D pose directly from images. One major challenge for such methods, however, is the collection of large amounts of training data. Particularly, collecting a large number of unconstrained images that are annotated with accurate 3D poses is impractical. We therefore propose to use two independent training sources. The first source consists of accurate 3D motion capture data, and the second source consists of unconstrained images with annotated 2D poses. To incorporate both sources, we propose a dual-source approach that combines 2D pose estimation with efficient 3D pose retrieval. To this end, we first convert the motion capture data into a normalized 2D pose space, and separately learn a 2D pose estimation model from the image data. During inference, we estimate the 2D pose and efficiently retrieve the nearest 3D poses. We then jointly estimate a mapping from the 3D pose space to the image and reconstruct the 3D pose. We provide a comprehensive evaluation of the proposed method and experimentally demonstrate the effectiveness of our approach, even when the skeleton structures of the two sources differ substantially.
引用
收藏
页码:37 / 49
页数:13
相关论文
共 50 条
  • [1] A Dual-Source Approach for 3D Pose Estimation from a Single Image
    Yasin, Hashim
    Iqbal, Umar
    Kruger, Bjorn
    Weber, Andreas
    Gall, Juergen
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4948 - 4956
  • [3] Robust 3D Human Pose Estimation from Single Images or Video Sequences
    Wang, Chunyu
    Wang, Yizhou
    Lin, Zhouchen
    Yuille, Alan L.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (05) : 1227 - 1241
  • [4] Multiple human 3D pose estimation from multiview images
    Ershadi-Nasab, Sara
    Noury, Erfan
    Kasaei, Shohreh
    Sanaei, Esmaeil
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (12) : 15573 - 15601
  • [5] Multiple human 3D pose estimation from multiview images
    Sara Ershadi-Nasab
    Erfan Noury
    Shohreh Kasaei
    Esmaeil Sanaei
    Multimedia Tools and Applications, 2018, 77 : 15573 - 15601
  • [6] Bayesian capsule networks for 3D human pose estimation from single 2D images
    Ramirez, Ivan
    Cuesta-Infante, Alfredo
    Schiavi, Emanuele
    Jose Pantrigo, Juan
    NEUROCOMPUTING, 2020, 379 (379) : 64 - 73
  • [7] Multimodal 3D Human Pose Estimation from a Single Image
    Spurlock, Scott
    Souvenir, Richard
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 663 - 670
  • [8] Real-time 3D Pose Estimation from Single Depth Images
    Schnuerer, Thomas
    Fuchs, Stefan
    Eisenbach, Markus
    Gross, Horst-Michael
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 716 - 724
  • [9] 3D Object Pose Estimation from Binarized Images
    Kagami, Shingo
    Morita, Masaru
    Hashimoto, Koichi
    2012 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2012, : 759 - 761
  • [10] Single Image 3D Human Pose Estimation from Noisy Observations
    Simo-Serra, E.
    Ramisa, A.
    Alenya, G.
    Torras, C.
    Moreno-Noguer, F.
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2673 - 2680