Combining Generative and Discriminative Models in a Framework for Articulated Pose Estimation

被引：0

作者：

RÓMer Rosales

Stan Sclaroff

机构：

[1] Massachusetts Institute of Technology,Computer Science and Artificial Intelligence Laboratory

[2] Boston University,Image and Video Computing Group, Dept. of Computer Science

来源：

International Journal of Computer Vision | 2006年 / 67卷

关键词：

human body pose; hand pose; nonrigid and articulated pose estimation; statistical inference; generative and discriminative models; mixture models; expectation maximization algorithm;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We develop a method for the estimation of articulated pose, such as that of the human body or the human hand, from a single (monocular) image. Pose estimation is formulated as a statistical inference problem, where the goal is to find a posterior probability distribution over poses as well as a maximum a posteriori (MAP) estimate. The method combines two modeling approaches, one discriminative and the other generative. The discriminative model consists of a set of mapping functions that are constructed automatically from a labeled training set of body poses and their respective image features. The discriminative formulation allows for modeling ambiguous, one-to-many mappings (through the use of multi-modal distributions) that may yield multiple valid articulated pose hypotheses from a single image. The generative model is defined in terms of a computer graphics rendering of poses. While the generative model offers an accurate way to relate observed (image features) and hidden (body pose) random variables, it is difficult to use it directly in pose estimation, since inference is computationally intractable. In contrast, inference with the discriminative model is tractable, but considerably less accurate for the problem of interest. A combined discriminative/generative formulation is derived that leverages the complimentary strengths of both models in a principled framework for articulated pose inference. Two efficient MAP pose estimation algorithms are derived from this formulation; the first is deterministic and the second non-deterministic. Performance of the framework is quantitatively evaluated in estimating articulated pose of both the human hand and human body.

引用

页码：251 / 276

页数：25

共 50 条

[1] Combining generative and discriminative models in a framework for articulated pose estimation
Rosales, Romer
Sclaroff, Stan
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 67 (03) : 251 - 276
[2] Combining Discriminative and Generative Methods for 3D Deformable Surface and Articulated Pose Reconstruction
Salzmann, Mathieu
Urtasun, Raquel
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 647 - 654
[3] Articulated Pose Estimation using Discriminative Armlet Classifiers
Gkioxari, Georgia
Arbelaez, Pablo
Bourdev, Lubomir
Malik, Jitendra
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3342 - 3349
[4] Cascaded Models for Articulated Pose Estimation
Sapp, Benjamin
Toshev, Alexander
Taskar, Ben
COMPUTER VISION-ECCV 2010, PT II, 2010, 6312 : 406 - +
[5] Combining Generative and Discriminative Models for Hybrid Inference
Satorras, Victor Garcia
Akata, Zeynep
Welling, Max
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[6] Anomaly Detection Combining Discriminative and Generative Models
Higa, Kyota
Sato, Hideaki
Shiraishi, Soma
Kikuchi, Katsumi
Iwamoto, Kota
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS & TECHNIQUES (IST 2019), 2019,
[7] Using Richer Models for Articulated Pose Estimation of Footballers
Kazemi, Vahid
Sullivan, Josephine
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[8] Articulated Pose Estimation with Parts Connectivity using Discriminative Local Oriented Contours
Ukita, Norimichi
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3154 - 3161
[9] Combining Discriminative and Model Based Approaches for Hand Pose Estimation
Krejov, Philip
Gilbert, Andrew
Bowden, Richard
2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 1, 2015,
[10] People detection and articulated pose estimation framework for crowded scenes
Alyammahi, Sohailah
Bhaskar, Harish
Ruta, Dymitr
Al-Mualla, Mohammed
KNOWLEDGE-BASED SYSTEMS, 2017, 131 : 83 - 104

← 1 2 3 4 5 →