Aligning 3D Models to RGB-D Images of Cluttered Scenes

被引:0
|
作者
Gupta, Saurabh [1 ]
Arbelaez, Pablo [2 ]
Girshick, Ross [3 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ Los Andes, Bogota, Colombia
[3] Microsoft Res, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of this work is to represent objects in an RGB-D scene with corresponding 3D models from a library. We approach this problem by first detecting and segmenting object instances in the scene and then using a convolutional neural network (CNN) to predict the pose of the object. This CNN is trained using pixel surface normals in images containing renderings of synthetic objects. When tested on real data, our method outperforms alternative algorithms trained on real data. We then use this coarse pose estimate along with the inferred pixel support to align a small number of prototypical models to the data, and place into the scene the model that fits best. We observe a 48% relative improvement in performance at the task of 3D detection over the current state-of-the-art [34], while being an order of magnitude faster.
引用
收藏
页码:4731 / 4740
页数:10
相关论文
共 50 条
  • [21] Robust 3D face modeling and tracking from RGB-D images
    Changwei Luo
    Juyong Zhang
    Changcun Bao
    Yali Li
    Jing Huang
    Shengjin Wang
    Multimedia Systems, 2022, 28 : 1657 - 1666
  • [22] Robust 3D face modeling and tracking from RGB-D images
    Luo, Changwei
    Zhang, Juyong
    Bao, Changcun
    Li, Yali
    Huang, Jing
    Wang, Shengjin
    MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1657 - 1666
  • [23] Social Mapping on RGB-D Scenes
    Charalampous, Konstantinos
    Emmanouilidis, Christos
    Gasteratos, Antonios
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS & TECHNIQUES (IST), 2014, : 398 - 403
  • [24] Interior dense 3D reconstruction system with RGB-D camera for complex large scenes
    Fu, Xiaofan
    Li, Guangqiang
    Yu, Lei
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2021, 32 (01)
  • [25] GeoRec: Geometry-enhanced semantic 3D reconstruction of RGB-D indoor scenes
    Huan, Linxi
    Zheng, Xianwei
    Gong, Jianya
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 186 : 301 - 314
  • [26] Efficient 3D Object Detection of Indoor Scenes Based on RGB-D Video Stream
    Miao Y.
    Chen J.
    Zhang X.
    Ma W.
    Sun S.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (07): : 1015 - 1025
  • [27] RGB-D DSO: Direct Sparse Odometry With RGB-D Cameras for Indoor Scenes
    Yuan, Zikang
    Cheng, Ken
    Tang, Jinhui
    Yang, Xin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4092 - 4101
  • [28] Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images
    Gupta, Saurabh
    Arbelaez, Pablo
    Malik, Jitendra
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 564 - 571
  • [29] 3D Object Tracking in RGB-D Images Using Particle Swarm Optimization
    dos Santos Junior, Jose Guedes
    Silva do Monte Lima, Joao Paulo
    2017 19TH SYMPOSIUM ON VIRTUAL AND AUGMENTED REALITY (SVR), 2017, : 107 - 115
  • [30] Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images
    Song, Shuran
    Xiao, Jianxiong
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 808 - 816