Aligning 3D Models to RGB-D Images of Cluttered Scenes

被引:0
|
作者
Gupta, Saurabh [1 ]
Arbelaez, Pablo [2 ]
Girshick, Ross [3 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ Los Andes, Bogota, Colombia
[3] Microsoft Res, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of this work is to represent objects in an RGB-D scene with corresponding 3D models from a library. We approach this problem by first detecting and segmenting object instances in the scene and then using a convolutional neural network (CNN) to predict the pose of the object. This CNN is trained using pixel surface normals in images containing renderings of synthetic objects. When tested on real data, our method outperforms alternative algorithms trained on real data. We then use this coarse pose estimate along with the inferred pixel support to align a small number of prototypical models to the data, and place into the scene the model that fits best. We observe a 48% relative improvement in performance at the task of 3D detection over the current state-of-the-art [34], while being an order of magnitude faster.
引用
收藏
页码:4731 / 4740
页数:10
相关论文
共 50 条
  • [1] Dense 3D Semantic Mapping of Indoor Scenes from RGB-D Images
    Hermans, Alexander
    Floros, Georgios
    Leibe, Bastian
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 2631 - 2638
  • [2] Consistent 3D Models from Unorganized RGB-D Images
    Tascon Vidarte, Jose David
    Loaiza Correa, Humberto
    2014 IEEE 27TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2014,
  • [3] InpaintFusion: Incremental RGB-D Inpainting for 3D Scenes
    Mori, Shohei
    Erat, Okan
    Broll, Wolfgang
    Saito, Hideo
    Schmalstieg, Dieter
    Kalkofen, Denis
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (10) : 2994 - 3007
  • [4] Reconstructing 3D Contour Models of General Scenes from RGB-D Sequences
    Wang, Weiran
    Di, Huijun
    Song, Lingxiao
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 158 - 170
  • [5] 3D Texture Recognition for RGB-D Images
    Zhong, Guoqiang
    Mao, Xin
    Shi, Yaxin
    Dong, Junyu
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 518 - 528
  • [6] An Improved High Precision 3D Semantic Mapping of Indoor Scenes from RGB-D Images
    Xin, Jing
    Du, Kenan
    Feng, Jiale
    Shan, Mao
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 137 (03): : 2621 - 2640
  • [7] Basic 3D Solid Recognition in RGB-D Images
    Kornuta, Tomasz
    Stefanczyk, Maciej
    Kasprzak, Wlodzimierz
    RECENT ADVANCES IN AUTOMATION, ROBOTICS AND MEASURING TECHNIQUES, 2014, 267 : 421 - 430
  • [8] Direct Object Reconstruction on RGB-D Images in Cluttered Environment
    Zielinski, Mikolaj
    Kulecki, Bartlomiej
    Belter, Dominik
    2023 EUROPEAN CONFERENCE ON MOBILE ROBOTS, ECMR, 2023, : 244 - 250
  • [9] 3D Reconstruction of Indoor Scenes Using RGB-D Monocular Vision
    Liu, Sanmao
    Zhu, Wenqiu
    Zhang, Canqing
    Sun, Wenjing
    2016 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS), 2016, : 1 - 7
  • [10] 3D Reconstruction of Indoor Scenes using a Single RGB-D Image
    Bokaris, Panagiotis-Alexandros
    Muselet, Damien
    Tremeau, Alain
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 6, 2017, : 394 - 401