Joint stereo 3D object detection and implicit surface reconstruction

被引:1
|
作者
Li, Shichao [1 ]
Huang, Xijie [1 ]
Liu, Zechun [2 ]
Cheng, Kwang-Ting [1 ]
机构
[1] HKUST, Dept Comp Sci & Engn, Hong Kong 999077, Peoples R China
[2] Meta Real Labs, Pittsburgh, PA 15222 USA
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
D O I
10.1038/s41598-024-64677-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a new learning-based framework S-3D-RCNN that can recover accurate object orientation in SO(3) and simultaneously predict implicit rigid shapes from stereo RGB images. For orientation estimation, in contrast to previous studies that map local appearance to observation angles, we propose a progressive approach by extracting meaningful Intermediate Geometrical Representations (IGRs). This approach features a deep model that transforms perceived intensities from one or two views to object part coordinates to achieve direct egocentric object orientation estimation in the camera coordinate system. To further achieve finer description inside 3D bounding boxes, we investigate the implicit shape estimation problem from stereo images. We model visible object surfaces by designing a point-based representation, augmenting IGRs to explicitly address the unseen surface hallucination problem. Extensive experiments validate the effectiveness of the proposed IGRs, and S-3D-RCNN achieves superior 3D scene understanding performance. We also designed new metrics on the KITTI benchmark for our evaluation of implicit shape estimation.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Stereo reconstruction of 3D curves
    Sbert, C
    Solé, AF
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 912 - 915
  • [32] 3D point filtering algorithm for 3d object detection based on stereo image processing
    Kim J.-M.
    Park J.-M.
    Lee J.-W.
    Journal of Institute of Control, Robotics and Systems, 2021, 27 (09): : 676 - 684
  • [33] 3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting
    Lyu, Xiaoyang
    Sun, Yang-Tian
    Huang, Yi-Hua
    Wu, Xiuzhe
    Yang, Ziyi
    Chen, Yilun
    Pang, Jiangmiao
    Qi, Xiaojuan
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (06):
  • [34] Implicit surface reconstruction from noisy 3D scattered data
    Wang, Lihui
    Yuan, Baozong
    Tang, Xiaofang
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1543 - +
  • [35] ESGN: Efficient Stereo Geometry Network for Fast 3D Object Detection
    Gao, Aqi
    Pang, Yanwei
    Nie, Jing
    Shao, Zhuang
    Cao, Jiale
    Guo, Yishun
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2000 - 2009
  • [36] Corn pose estimation using 3D object detection and stereo images
    Gao, Yuliang
    Li, Zhen
    Hong, Qingqing
    Li, Bin
    Zhang, Lifeng
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 231
  • [37] Confidence Guided Stereo 3D Object Detection with Split Depth Estimation
    Li, Chengyao
    Ku, Jason
    Waslander, Steven L.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5776 - 5783
  • [38] Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
    Chen, Yi-Nan
    Dai, Hang
    Ding, Yong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 877 - 887
  • [39] Stereo CenterNet-based 3D object detection for autonomous driving
    Shi, Yuguang
    Guo, Yu
    Mi, Zhenqiang
    Li, Xinjie
    NEUROCOMPUTING, 2022, 471 : 219 - 229
  • [40] A 3D Circular Object Detection Method Based on Binocular Stereo Vision
    Chen, Zhaoxue
    Li, Mengzhuo
    Yu, Haizhong
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,