Application of Transfer Learning in RGB-D Object Recognition

被引:0
|
作者
Kumar, Abhishek [1 ]
Shrivatsav, S. Nithin [2 ]
Subrahmanyam, G. R. K. S. [3 ]
Mishra, Deepak [3 ]
机构
[1] Vellore Inst Technol, Sch Elect Engn, Vellore 632014, Tamil Nadu, India
[2] Natl Inst Technol, Dept EEE, Tiruchirappalli 620015, Tamil Nadu, India
[3] Dept Avion IIST, Thiruvananthapuram 695547, Kerala, India
关键词
Computer Vision; Deep Learning; Convolution Neural Network; Transfer Learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this work, we apply Transfer Learning for a Multimodal Deep learning network for fast and robust object recognition using RGB-D dataset. The ability for a network to train quickly and recognize objects robustly is very important in the field of Robotics. The Multimodal deep learning network avoids time-consuming hand-crafted features and makes use of a RGB-D architecture for robust object recognition. Our architecture has two important features. First, it makes use of both RGB and Depth information of an image to recognize it. To achieve this, our architecture has two CNN processing streams, one for RGB modality and the other for the depth modality. This enables the network to achieve higher accuracy than normal single stream RGB network. We encoded the depth image into colour image before passing it into the CNN stream. The other important feature is the speed of training and improving the accuracy further. To achieve this, we made use of Transfer learning. Firstly we trained a CNN network with 10 classes of different objects and then we transfer the parameters to RGB and depth CNN network. This enables the network to train faster and also achieve higher accuracy for a given number of epochs.
引用
收藏
页码:580 / 584
页数:5
相关论文
共 50 条
  • [1] Deep sensorimotor learning for RGB-D object recognition
    Thermos, Spyridon
    Papadopoulos, Georgios Th.
    Daras, Petros
    Potamianos, Gerasimos
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 190
  • [2] Learning Coupled Classifiers with RGB images for RGB-D object recognition
    Li, Xiao
    Fang, Min
    Zhang, Ju-Jie
    Wu, Jinqiao
    PATTERN RECOGNITION, 2017, 61 : 433 - 446
  • [3] Learning Hierarchical Representation with Sparsity for RGB-D Object Recognition
    Yu, Kuan-Ting
    Tseng, Shih-Huan
    Fu, Li-Chen
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 3011 - 3016
  • [4] Multimodal Deep Learning for Robust RGB-D Object Recognition
    Eitel, Andreas
    Springenberg, Jost Tobias
    Spinello, Luciano
    Riedmiller, Martin
    Burgard, Wolfram
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 681 - 687
  • [5] Discriminative Feature Learning for Efficient RGB-D Object Recognition
    Asif, Umar
    Bennamoun, Mohammed
    Sohel, Ferdous
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 272 - 279
  • [6] Semi-Supervised Learning for RGB-D Object Recognition
    Cheng, Yanhua
    Zhao, Xin
    Huang, Kaiqi
    Tan, Tieniu
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2377 - 2382
  • [7] Subset based deep learning for RGB-D object recognition
    Bai, Jing
    Wu, Yan
    Zhang, Junming
    Chen, Fuqiang
    NEUROCOMPUTING, 2015, 165 : 280 - 292
  • [8] RGB-D Object Modelling for Object Recognition and Tracking
    Prankl, Johann
    Aldoma, Aitor
    Svejda, Alexander
    Vincze, Markus
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 96 - 103
  • [9] Latent Tensor Transfer Learning for RGB-D Action Recognition
    Jia, Chengcheng
    Kong, Yu
    Ding, Zhengming
    Fu, Yun
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 87 - 96
  • [10] Combining Features For RGB-D object Recognition
    Khan, Wasif
    Phaisangittisagul, Ekachai
    Ali, Luqman
    Gansawat, Duangrat
    Kumazawa, Itsuo
    2017 INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2017,