Fusing visual and range imaging for object class recognition

被引:0
|
作者
Bar-Hillel, Aharon [1 ]
Hanukaev, Dmitri [2 ]
Levi, Dan [1 ]
机构
[1] Gen Motors Adv Tech Ctr, Hamada 7, Herzliyya, Israel
[2] Hebrew Univ Jerusalem, Ctr Neural comp, IL-91904 Jerusalem, Israel
来源
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2011年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category level object recognition has improved significantly in the last few years, but machine performance remains unsatisfactory for most real-world applications. We believe this gap may be bridged using additional depth information obtained from range imaging, which was recently used to overcome similar problems in body shape interpretation. This paper presents a system which successfully fuses visual and range imaging for object category classification. We explore fusion at multiple levels: using depth as an attention mechanism, high-level fusion at the classifier level and low-level fusion of local descriptors, and show that each mechanism makes a unique contribution to performance. For low-level fusion we present a new algorithm for training of local descriptors, the Generalized Image Feature Transform (GIFT), which generalizes current representations such as SIFT and spatial pyramids and allows for the creation of new representations based on multiple channels of information. We show that our system improves state-of-the-art visual-only and depth-only methods on a diverse dataset of every-day objects.
引用
收藏
页码:65 / 72
页数:8
相关论文
共 50 条
  • [1] Fusing Visual Saliency for Material Recognition
    Qi, Lin
    Xu, Ying
    Shang, Xiaowei
    Dong, Junyu
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2046 - 2049
  • [2] Visual object recognition through one-class learning
    Wang, QH
    Lopes, LS
    Tax, DMJ
    IMAGE ANALYSIS AND RECOGNITION, PT 1, PROCEEDINGS, 2004, 3211 : 463 - 470
  • [3] Fusing binaural sonar information for object recognition
    Kue, R
    MF '96 - 1996 IEEE/SICE/RSJ INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS, 1996, : 727 - 735
  • [4] An Object Recognition Method Based on Bag-of-Visual-Words and Fusing Multi-feature
    Qi Xueting
    Chen Tianhuang
    Wang Hongxia
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE, 2014, 101 : 957 - 961
  • [5] Electromagnetic Imaging Boosted Visual Object Recognition Under Difficult Visual Conditions
    Tan, Min
    Jin, Tao
    Ye, Danhui
    Xu, Kuiwen
    Gu, Xiaoling
    Yu, Jun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] Visual object recognition
    Logothetis, NK
    Sheinberg, DL
    ANNUAL REVIEW OF NEUROSCIENCE, 1996, 19 : 577 - 621
  • [7] Class Representative Visual Words for Category-Level Object Recognition
    Lopez Sastre, Roberto Javier
    Tuytelaars, Tinne
    Maldonado Bascon, Saturnino
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2009, 5524 : 184 - +
  • [8] Fusing Bottom-up and Top-down Pathways in Neural Networks for Visual Object Recognition
    Zheng, Yuhua
    Meng, Yan
    Jin, Yaochu
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [9] Fusing stereoscopic depth and region cues for object recognition
    Reno, AL
    Booth, DM
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : A115 - A118
  • [10] Fusing Object Information and Inertial Data for Activity Recognition
    Diete, Alexander
    Stuckenschmidt, Heiner
    SENSORS, 2019, 19 (19)