Joint Object Recognition and Pose Estimation using a Nonlinear View-Invariant Latent Generative Model

被引：0

作者：

Bakry, Amr ^{[1
]}

Elgaaly, Tarek ^{[1
]}

Elhoseiny, Mohamed ^{[1
]}

Elgammal, Ahmed ^{[1
]}

机构：

[1] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ 08901 USA

来源：

2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object recognition and pose estimation are two fundamental problems in the field of computer vision. Recognizing objects and their poses/viewpoints are critical components of ample vision and robotic systems. Multiple viewpoints of an object lie on an intrinsic low-dimensional manifold in the input space (i.e. descriptor space). Different objects captured from the same set of viewpoints have manifolds with a common topology. In this paper we utilize this common topology between object manifolds by learning a low-dimensional latent space which non-linearly maps between a common unified manifold and the object manifold in the input space. Using a supervised embedding approach, the latent space is computed and used to jointly infer the category and pose of objects. We empirically validate our model by using multiple inference approaches and testing on multiple challenging datasets. We compare our results with the state-of-the-art and present our increased category recognition and pose estimation accuracy.

引用

页数：9

共 50 条

[41] Object recognition and pose estimation using appearance manifolds
Zhong-Hua Hao
Shi-Wei Ma
Advances in Manufacturing, 2013, 1 (03) : 258 - 264
[42] Object recognition and pose estimation using appearance manifolds
Hao, Zhong-Hua
Ma, Shi-Wei
ADVANCES IN MANUFACTURING, 2013, 1 (03) : 258 - 264
[43] Toward view-invariant representations of object structure learned using object constancy cues in natural movies
Colombe, JB
AIPR 2004: 33RD APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP, PROCEEDINGS: EMERGING TECHNOLOGIES AND APPLICATIONS FOR IMAGERY PATTERN RECOGNITION, 2005, : 86 - 91
[44] View-invariant object category learning, recognition, and search: How spatial and object attention are coordinated using surface-based attentional shrouds
Fazl, Arash
Grossberg, Stephen
Mingolla, Ennio
COGNITIVE PSYCHOLOGY, 2009, 58 (01) : 1 - 48
[45] View-invariant object recognition ability develops after discrimination, not mere exposure, at several viewing angles
Yamashita, Wakayo
Wang, Gang
Tanaka, Keiji
EUROPEAN JOURNAL OF NEUROSCIENCE, 2010, 31 (02) : 327 - 335
[46] A model based approach for pose estimation and rotation invariant object matching
Unsalan, Cem
PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 49 - 57
[47] Towards View-Invariant Intersection Recognition from Videos using Deep Network Ensembles
Kumar, Abhijeet
Gupta, Gunshi
Sharma, Avinash
Krishna, K. Madhava
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1053 - 1060
[48] View-invariant gait recognition system using a gait energy image decomposition method
Verlekar, Tanmay T.
Correia, Paulo L.
Soares, Luis D.
IET BIOMETRICS, 2017, 6 (04) : 299 - 306
[49] View-Invariant 3D Human Body Pose Reconstruction using a Monocular Video Camera
Ke, Shian-Ru
Hwang, Jenq-Neng
Lan, Kung-Ming
Wang, Shen-Zheng
2011 FIFTH ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2011,
[50] A Hierarchical Approach for Joint Multi-view Object Pose Estimation and Categorization
Ozay, Mete
Walas, Krzysztof
Leonardis, Ales
2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 5480 - 5487

← 1 2 3 4 5 →