Recognizing RGB Images by Learning from RGB-D Data

被引:31
|
作者
Chen, Lin [1 ]
Li, Wen [2 ]
Xu, Dong [2 ]
机构
[1] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore, Singapore
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
D O I
10.1109/CVPR.2014.184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a new framework for recognizing RGB images captured by the conventional cameras by leveraging a set of labeled RGB-D data, in which the depth features can be additionally extracted from the depth images. We formulate this task as a new unsupervised domain adaptation (UDA) problem, in which we aim to take advantage of the additional depth features in the source domain and also cope with the data distribution mismatch between the source and target domains. To effectively utilize the additional depth features, we seek two optimal projection matrices to map the samples from both domains into a common space by preserving as much as possible the correlations between the visual features and depth features. To effectively employ the training samples from the source domain for learning the target classifier, we reduce the data distribution mismatch by minimizing the Maximum Mean Discrepancy (MMD) criterion, which compares the data distributions for each type of feature in the common space. Based on the above two motivations, we propose a new SVM based objective function to simultaneously learn the two projection matrices and the optimal target classifier in order to well separate the source samples from different classes when using each type of feature in the common space. An efficient alternating optimization algorithm is developed to solve our new objective function. Comprehensive experiments for object recognition and gender recognition demonstrate the effectiveness of our proposed approach for recognizing RGB images by learning from RGB-D data.
引用
收藏
页码:1418 / 1425
页数:8
相关论文
共 50 条
  • [31] Efficient Image Segmentation of RGB-D Images
    Fouad, Islam I.
    Rady, Sherine
    Mostafa, G. M. Mostafa
    2017 12TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2017, : 353 - 358
  • [32] Kinematic Structures Estimation on the RGB-D Images
    Staszak, Rafal
    Molska, Milena
    Mlodzikowski, Kamil
    Ataman, Justyna
    Belter, Dominik
    2020 25TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2020, : 675 - 681
  • [33] Fast Edge Detection in RGB-D Images
    Casarrubias-Vargas, Heriberto
    Petrilli-Barcelo, Alberto
    Bayro-Corrochano, Eduardo
    PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 868 - 875
  • [34] Learning of perceptual grouping for object segmentation on RGB-D data
    Richtsfeld, Andreas
    Moerwald, Thomas
    Prankl, Johann
    Zillich, Michael
    Vincze, Markus
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (01) : 64 - 73
  • [35] Understanding Everyday Hands in Action from RGB-D Images
    Rogez, Gregory
    Supancic, James S., III
    Ramanan, Deva
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3889 - 3897
  • [36] Matterport3D: Learning from RGB-D Data in Indoor Environments
    Chang, Angel
    Dai, Angela
    Funkhouser, Thomas
    Halber, Maciej
    Niessner, Matthias
    Savva, Manolis
    Song, Shuran
    Zeng, Andy
    Zhang, Yinda
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 667 - 676
  • [37] LEARNING-BASED HUMAN DETECTION APPLIED TO RGB-D IMAGES
    Santoso, Patrisia Sherryl
    Hang, Hsueh-Ming
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3365 - 3369
  • [38] Fine-Grained Categorization From RGB-D Images
    Tan, Yanhao
    Rahman, Mohammad Muntasir
    Yan, Yanfu
    Xue, Jian
    Shao, Ling
    Lu, Ke
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 917 - 928
  • [39] Difference-in-level Detection from RGB-D Images
    Nonaka, Yusuke
    Uchiyama, Hideaki
    Saito, Hideo
    Yachida, Shoji
    Iwamoto, Kota
    ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 393 - 406
  • [40] Guest Editorial: Feature Learning from RGB-D Data for Multimedia Applications
    Baochang Zhang
    Jungong Han
    Ling Shao
    Multimedia Tools and Applications, 2017, 76 : 4243 - 4248