Joint stereo 3D object detection and implicit surface reconstruction

被引:1
|
作者
Li, Shichao [1 ]
Huang, Xijie [1 ]
Liu, Zechun [2 ]
Cheng, Kwang-Ting [1 ]
机构
[1] HKUST, Dept Comp Sci & Engn, Hong Kong 999077, Peoples R China
[2] Meta Real Labs, Pittsburgh, PA 15222 USA
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
D O I
10.1038/s41598-024-64677-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a new learning-based framework S-3D-RCNN that can recover accurate object orientation in SO(3) and simultaneously predict implicit rigid shapes from stereo RGB images. For orientation estimation, in contrast to previous studies that map local appearance to observation angles, we propose a progressive approach by extracting meaningful Intermediate Geometrical Representations (IGRs). This approach features a deep model that transforms perceived intensities from one or two views to object part coordinates to achieve direct egocentric object orientation estimation in the camera coordinate system. To further achieve finer description inside 3D bounding boxes, we investigate the implicit shape estimation problem from stereo images. We model visible object surfaces by designing a point-based representation, augmenting IGRs to explicitly address the unseen surface hallucination problem. Extensive experiments validate the effectiveness of the proposed IGRs, and S-3D-RCNN achieves superior 3D scene understanding performance. We also designed new metrics on the KITTI benchmark for our evaluation of implicit shape estimation.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Triangulation Learning Network: from Monocular to Stereo 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7607 - 7615
  • [42] Stereo Matching for 3D Building Reconstruction
    Gupta, Gaurav
    Balasubramanian, R.
    Rawat, M. S.
    Bhargava, R.
    Krishna, B. Gopala
    ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL, 2011, 125 : 522 - +
  • [43] Dynamic 3D Reconstruction of Tongue Surface Based on Photometric Stereo Technique
    Cai, Yiheng
    Zhang, Linlin
    Sheng, Nan
    Wang, Lina
    Zhang, Xinfeng
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2015, PT I, 2015, 9244 : 462 - 472
  • [44] 3D surface reconstruction of stereo endoscopic images for minimally invasive surgery
    Huang X.
    Abdalbari A.
    Ren J.
    Biomedical Engineering Letters, 2013, 3 (03) : 149 - 157
  • [45] 3D reconstruction of surface and subsurface structures of solids by SEM stereo images
    Sokolov, VN
    Yurkovets, DI
    Mel'nik, VN
    Boyde, A
    Howell, PGT
    ELECTRON MICROSCOPY AND ANALYSIS 2001, 2001, (168): : 119 - 122
  • [46] Neural-network-based photometric stereo for 3D surface reconstruction
    Cheng, Wen-Chang
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 404 - 410
  • [47] Stereo Matching Algorithm for 3D Surface Reconstruction Based on Triangulation Principle
    Hamzah, Rostam Affendi
    Ibrahim, Haidi
    Abu Hassan, Anwar Hasni
    2016 1ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE), 2016, : 119 - 124
  • [48] OrangeStereo: A navel orange stereo matching network for 3D surface reconstruction
    Gao, Yuan
    Wang, Qingyu
    Rao, Xiuqin
    Xie, Lijuan
    Ying, Yibin
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 217
  • [49] 3D Object Trajectory Reconstruction using Stereo Matching and Instance Flow based Multiple Object Tracking
    Bullinger, Sebastian
    Bodensteiner, Christoph
    Arens, Michael
    PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,
  • [50] Iterative Photometric Stereo with Shadow and Specular Region Detection for 3D Reconstruction
    Buyukatalay, Soner
    Birgul, Ozlem
    Halici, Ugur
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 547 - +