Combining Generative and Discriminative Models in a Framework for Articulated Pose Estimation

被引:0
|
作者
RÓMer Rosales
Stan Sclaroff
机构
[1] Massachusetts Institute of Technology,Computer Science and Artificial Intelligence Laboratory
[2] Boston University,Image and Video Computing Group, Dept. of Computer Science
关键词
human body pose; hand pose; nonrigid and articulated pose estimation; statistical inference; generative and discriminative models; mixture models; expectation maximization algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
We develop a method for the estimation of articulated pose, such as that of the human body or the human hand, from a single (monocular) image. Pose estimation is formulated as a statistical inference problem, where the goal is to find a posterior probability distribution over poses as well as a maximum a posteriori (MAP) estimate. The method combines two modeling approaches, one discriminative and the other generative. The discriminative model consists of a set of mapping functions that are constructed automatically from a labeled training set of body poses and their respective image features. The discriminative formulation allows for modeling ambiguous, one-to-many mappings (through the use of multi-modal distributions) that may yield multiple valid articulated pose hypotheses from a single image. The generative model is defined in terms of a computer graphics rendering of poses. While the generative model offers an accurate way to relate observed (image features) and hidden (body pose) random variables, it is difficult to use it directly in pose estimation, since inference is computationally intractable. In contrast, inference with the discriminative model is tractable, but considerably less accurate for the problem of interest. A combined discriminative/generative formulation is derived that leverages the complimentary strengths of both models in a principled framework for articulated pose inference. Two efficient MAP pose estimation algorithms are derived from this formulation; the first is deterministic and the second non-deterministic. Performance of the framework is quantitatively evaluated in estimating articulated pose of both the human hand and human body.
引用
收藏
页码:251 / 276
页数:25
相关论文
共 50 条
  • [1] Combining generative and discriminative models in a framework for articulated pose estimation
    Rosales, Romer
    Sclaroff, Stan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 67 (03) : 251 - 276
  • [2] Combining Discriminative and Generative Methods for 3D Deformable Surface and Articulated Pose Reconstruction
    Salzmann, Mathieu
    Urtasun, Raquel
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 647 - 654
  • [3] Articulated Pose Estimation using Discriminative Armlet Classifiers
    Gkioxari, Georgia
    Arbelaez, Pablo
    Bourdev, Lubomir
    Malik, Jitendra
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3342 - 3349
  • [4] Cascaded Models for Articulated Pose Estimation
    Sapp, Benjamin
    Toshev, Alexander
    Taskar, Ben
    COMPUTER VISION-ECCV 2010, PT II, 2010, 6312 : 406 - +
  • [5] Combining Generative and Discriminative Models for Hybrid Inference
    Satorras, Victor Garcia
    Akata, Zeynep
    Welling, Max
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Anomaly Detection Combining Discriminative and Generative Models
    Higa, Kyota
    Sato, Hideaki
    Shiraishi, Soma
    Kikuchi, Katsumi
    Iwamoto, Kota
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS & TECHNIQUES (IST 2019), 2019,
  • [7] Using Richer Models for Articulated Pose Estimation of Footballers
    Kazemi, Vahid
    Sullivan, Josephine
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [8] Articulated Pose Estimation with Parts Connectivity using Discriminative Local Oriented Contours
    Ukita, Norimichi
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3154 - 3161
  • [9] Combining Discriminative and Model Based Approaches for Hand Pose Estimation
    Krejov, Philip
    Gilbert, Andrew
    Bowden, Richard
    2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 1, 2015,
  • [10] People detection and articulated pose estimation framework for crowded scenes
    Alyammahi, Sohailah
    Bhaskar, Harish
    Ruta, Dymitr
    Al-Mualla, Mohammed
    KNOWLEDGE-BASED SYSTEMS, 2017, 131 : 83 - 104