Recognizing RGB Images by Learning from RGB-D Data

被引:31
|
作者
Chen, Lin [1 ]
Li, Wen [2 ]
Xu, Dong [2 ]
机构
[1] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore, Singapore
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
D O I
10.1109/CVPR.2014.184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a new framework for recognizing RGB images captured by the conventional cameras by leveraging a set of labeled RGB-D data, in which the depth features can be additionally extracted from the depth images. We formulate this task as a new unsupervised domain adaptation (UDA) problem, in which we aim to take advantage of the additional depth features in the source domain and also cope with the data distribution mismatch between the source and target domains. To effectively utilize the additional depth features, we seek two optimal projection matrices to map the samples from both domains into a common space by preserving as much as possible the correlations between the visual features and depth features. To effectively employ the training samples from the source domain for learning the target classifier, we reduce the data distribution mismatch by minimizing the Maximum Mean Discrepancy (MMD) criterion, which compares the data distributions for each type of feature in the common space. Based on the above two motivations, we propose a new SVM based objective function to simultaneously learn the two projection matrices and the optimal target classifier in order to well separate the source samples from different classes when using each type of feature in the common space. An efficient alternating optimization algorithm is developed to solve our new objective function. Comprehensive experiments for object recognition and gender recognition demonstrate the effectiveness of our proposed approach for recognizing RGB images by learning from RGB-D data.
引用
收藏
页码:1418 / 1425
页数:8
相关论文
共 50 条
  • [21] Dynamic Objects Recognizing and Masking for RGB-D SLAM
    Li, Xiangcheng
    Wu, Huaiyu
    Chen, Zhihuan
    2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021), 2021, : 169 - 174
  • [22] An end-to-end learning framework for visual camera relocalization using RGB and RGB-D images
    Zhang, Kai
    Meng, Xiaolin
    Wang, Qing
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [23] RGB-D IBR: Rendering Indoor Scenes Using Sparse RGB-D Images with Local Alignments
    Jeong, Yeongyu
    Kim, Haejoon
    Seo, Hyewon
    Cordier, Frederic
    Lee, Seungyong
    PROCEEDINGS I3D 2016: 20TH ACM SIGGRAPH SYMPOSIUM ON INTERACTIVE 3D GRAPHICS AND GAMES, 2016, : 205 - 206
  • [24] RGB×D: Learning depth-weighted RGB patches for RGB-D indoor semantic segmentation
    Cao, Jinming
    Leng, Hanchao
    Cohen-Or, Daniel
    Lischinski, Dani
    Chen, Ying
    Tu, Changhe
    Li, Yangyan
    Neurocomputing, 2021, 462 : 568 - 580
  • [25] Visual Camera Re-Localization From RGB and RGB-D Images Using DSAC
    Brachmann, Eric
    Rother, Carsten
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5847 - 5865
  • [26] People Detection in RGB-D Data
    Spinello, Luciano
    Arras, Kai O.
    2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 3838 - 3843
  • [27] Robust Localization Using RGB-D Images
    Oh, Yoonseon
    Oh, Songhwai
    2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1023 - 1026
  • [28] REFLECTION REMOVAL USING RGB-D IMAGES
    Shibata, Toshihiro
    Akai, Yuji
    Matsuoka, Ryo
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1862 - 1866
  • [29] Segmentation of Shipping Bags in RGB-D Images
    Vasileva, Elena
    Ivanovski, Zoran
    2022 IEEE 5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING APPLICATIONS AND SYSTEMS, IPAS, 2022,
  • [30] Structured Images for RGB-D Action Recognition
    Wang, Pichao
    Wang, Shuang
    Gao, Zhimin
    Hou, Yonghong
    Li, Wanqing
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1005 - 1014