Joint Object Recognition and Pose Estimation using a Nonlinear View-Invariant Latent Generative Model

被引:0
|
作者
Bakry, Amr [1 ]
Elgaaly, Tarek [1 ]
Elhoseiny, Mohamed [1 ]
Elgammal, Ahmed [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ 08901 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition and pose estimation are two fundamental problems in the field of computer vision. Recognizing objects and their poses/viewpoints are critical components of ample vision and robotic systems. Multiple viewpoints of an object lie on an intrinsic low-dimensional manifold in the input space (i.e. descriptor space). Different objects captured from the same set of viewpoints have manifolds with a common topology. In this paper we utilize this common topology between object manifolds by learning a low-dimensional latent space which non-linearly maps between a common unified manifold and the object manifold in the input space. Using a supervised embedding approach, the latent space is computed and used to jointly infer the category and pose of objects. We empirically validate our model by using multiple inference approaches and testing on multiple challenging datasets. We compare our results with the state-of-the-art and present our increased category recognition and pose estimation accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Object recognition and pose estimation using appearance manifolds
    Zhong-Hua Hao
    Shi-Wei Ma
    Advances in Manufacturing, 2013, 1 (03) : 258 - 264
  • [42] Object recognition and pose estimation using appearance manifolds
    Hao, Zhong-Hua
    Ma, Shi-Wei
    ADVANCES IN MANUFACTURING, 2013, 1 (03) : 258 - 264
  • [43] Toward view-invariant representations of object structure learned using object constancy cues in natural movies
    Colombe, JB
    AIPR 2004: 33RD APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP, PROCEEDINGS: EMERGING TECHNOLOGIES AND APPLICATIONS FOR IMAGERY PATTERN RECOGNITION, 2005, : 86 - 91
  • [44] View-invariant object category learning, recognition, and search: How spatial and object attention are coordinated using surface-based attentional shrouds
    Fazl, Arash
    Grossberg, Stephen
    Mingolla, Ennio
    COGNITIVE PSYCHOLOGY, 2009, 58 (01) : 1 - 48
  • [45] View-invariant object recognition ability develops after discrimination, not mere exposure, at several viewing angles
    Yamashita, Wakayo
    Wang, Gang
    Tanaka, Keiji
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2010, 31 (02) : 327 - 335
  • [46] A model based approach for pose estimation and rotation invariant object matching
    Unsalan, Cem
    PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 49 - 57
  • [47] Towards View-Invariant Intersection Recognition from Videos using Deep Network Ensembles
    Kumar, Abhijeet
    Gupta, Gunshi
    Sharma, Avinash
    Krishna, K. Madhava
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1053 - 1060
  • [48] View-invariant gait recognition system using a gait energy image decomposition method
    Verlekar, Tanmay T.
    Correia, Paulo L.
    Soares, Luis D.
    IET BIOMETRICS, 2017, 6 (04) : 299 - 306
  • [49] View-Invariant 3D Human Body Pose Reconstruction using a Monocular Video Camera
    Ke, Shian-Ru
    Hwang, Jenq-Neng
    Lan, Kung-Ming
    Wang, Shen-Zheng
    2011 FIFTH ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2011,
  • [50] A Hierarchical Approach for Joint Multi-view Object Pose Estimation and Categorization
    Ozay, Mete
    Walas, Krzysztof
    Leonardis, Ales
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 5480 - 5487