Fusing visual and range imaging for object class recognition

被引：0

作者：

Bar-Hillel, Aharon ^{[1
]}

Hanukaev, Dmitri ^{[2
]}

Levi, Dan ^{[1
]}

机构：

[1] Gen Motors Adv Tech Ctr, Hamada 7, Herzliyya, Israel

[2] Hebrew Univ Jerusalem, Ctr Neural comp, IL-91904 Jerusalem, Israel

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2011年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Category level object recognition has improved significantly in the last few years, but machine performance remains unsatisfactory for most real-world applications. We believe this gap may be bridged using additional depth information obtained from range imaging, which was recently used to overcome similar problems in body shape interpretation. This paper presents a system which successfully fuses visual and range imaging for object category classification. We explore fusion at multiple levels: using depth as an attention mechanism, high-level fusion at the classifier level and low-level fusion of local descriptors, and show that each mechanism makes a unique contribution to performance. For low-level fusion we present a new algorithm for training of local descriptors, the Generalized Image Feature Transform (GIFT), which generalizes current representations such as SIFT and spatial pyramids and allows for the creation of new representations based on multiple channels of information. We show that our system improves state-of-the-art visual-only and depth-only methods on a diverse dataset of every-day objects.

引用

页码：65 / 72

页数：8

共 50 条

[1] Fusing Visual Saliency for Material Recognition
Qi, Lin
Xu, Ying
Shang, Xiaowei
Dong, Junyu
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2046 - 2049
[2] Visual object recognition through one-class learning
Wang, QH
Lopes, LS
Tax, DMJ
IMAGE ANALYSIS AND RECOGNITION, PT 1, PROCEEDINGS, 2004, 3211 : 463 - 470
[3] Fusing binaural sonar information for object recognition
Kue, R
MF '96 - 1996 IEEE/SICE/RSJ INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS, 1996, : 727 - 735
[4] An Object Recognition Method Based on Bag-of-Visual-Words and Fusing Multi-feature
Qi Xueting
Chen Tianhuang
Wang Hongxia
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE, 2014, 101 : 957 - 961
[5] Electromagnetic Imaging Boosted Visual Object Recognition Under Difficult Visual Conditions
Tan, Min
Jin, Tao
Ye, Danhui
Xu, Kuiwen
Gu, Xiaoling
Yu, Jun
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[6] Visual object recognition
Logothetis, NK
Sheinberg, DL
ANNUAL REVIEW OF NEUROSCIENCE, 1996, 19 : 577 - 621
[7] Class Representative Visual Words for Category-Level Object Recognition
Lopez Sastre, Roberto Javier
Tuytelaars, Tinne
Maldonado Bascon, Saturnino
PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2009, 5524 : 184 - +
[8] Fusing Bottom-up and Top-down Pathways in Neural Networks for Visual Object Recognition
Zheng, Yuhua
Meng, Yan
Jin, Yaochu
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[9] Fusing stereoscopic depth and region cues for object recognition
Reno, AL
Booth, DM
PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : A115 - A118
[10] Fusing Object Information and Inertial Data for Activity Recognition
Diete, Alexander
Stuckenschmidt, Heiner
SENSORS, 2019, 19 (19)

← 1 2 3 4 5 →