Sliding Shapes for 3D Object Detection in Depth Images

被引:0
|
作者
Song, Shuran [1 ]
Xiao, Jianxiong [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The depth information of RGB-D sensors has greatly simplified some common challenges in computer vision and enabled breakthroughs for several tasks. In this paper, we propose to use depth maps for object detection and design a 3D detector to overcome the major difficulties for recognition, namely the variations of texture, illumination, shape, viewpoint, clutter, occlusion, self-occlusion and sensor noises. We take a collection of 3D CAD models and render each CAD model from hundreds of viewpoints to obtain synthetic depth maps. For each depth rendering, we extract features from the 3D point cloud and train an Exemplar-SVM classifier. During testing and hard-negative mining, we slide a 3D detection window in 3D space. Experiment results show that our 3D detector significantly outperforms the state-of-the-art algorithms for both RGB and RGBD images, and achieves about x1.7 improvement on average precision compared to DPM and R-CNN. All source code and data are available online.
引用
收藏
页码:634 / 651
页数:18
相关论文
共 50 条
  • [41] Confidence Guided Stereo 3D Object Detection with Split Depth Estimation
    Li, Chengyao
    Ku, Jason
    Waslander, Steven L.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5776 - 5783
  • [42] CoBEV: Elevating Roadside 3D Object Detection With Depth and Height Complementarity
    Shi, Hao
    Pang, Chengshan
    Zhang, Jiaming
    Yang, Kailun
    Wu, Yuhao
    Ni, Huajian
    Lin, Yining
    Stiefelhagen, Rainer
    Wang, Kaiwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5424 - 5439
  • [43] Depth-discriminative Metric Learning for Monocular 3D Object Detection
    Choi, Wonhyeok
    Shin, Mingyu
    Im, Sunghoon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [44] Depth dynamic center difference convolutions for monocular 3D object detection
    Wu, Xinyu
    Ma, Dongliang
    Qu, Xin
    Jiang, Xin
    Zeng, Dan
    NEUROCOMPUTING, 2023, 520 : 73 - 81
  • [45] Task-Aware Monocular Depth Estimation for 3D Object Detection
    Wang, Xinlong
    Yin, Wei
    Kong, Tao
    Jiang, Yuning
    Li, Lei
    Shen, Chunhua
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12257 - 12264
  • [46] MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
    Zhang, Renrui
    Qiu, Han
    Wang, Tai
    Guo, Ziyu
    Cui, Ziteng
    Qiao, Yu
    Li, Hongsheng
    Gao, Peng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9121 - 9132
  • [47] ON 3D RECONSTRUCTION OF ROTATING-OBJECT SHAPES
    ANDROSOV, AM
    VYGON, VG
    MALIKOV, SN
    TIKHONOV, EF
    OPTIKA I SPEKTROSKOPIYA, 1989, 66 (04): : 914 - 916
  • [48] 2D and 3D object detection algorithms from images: A Survey
    Chen, Wei
    Li, Yan
    Tian, Zijian
    Zhang, Fan
    ARRAY, 2023, 19
  • [49] Expandable YOLO: 3D Object Detection from RGB-D Images
    Takahashi, Masahiro
    Ji, Yonghoon
    Umeda, Kazunori
    Moro, Alessandro
    2020 21ST INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM), 2020,
  • [50] DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection
    Peng, Liang
    Wu, Xiaopei
    Yang, Zheng
    Liu, Haifeng
    Cai, Deng
    COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 71 - 88