Sliding Shapes for 3D Object Detection in Depth Images

被引:0
|
作者
Song, Shuran [1 ]
Xiao, Jianxiong [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The depth information of RGB-D sensors has greatly simplified some common challenges in computer vision and enabled breakthroughs for several tasks. In this paper, we propose to use depth maps for object detection and design a 3D detector to overcome the major difficulties for recognition, namely the variations of texture, illumination, shape, viewpoint, clutter, occlusion, self-occlusion and sensor noises. We take a collection of 3D CAD models and render each CAD model from hundreds of viewpoints to obtain synthetic depth maps. For each depth rendering, we extract features from the 3D point cloud and train an Exemplar-SVM classifier. During testing and hard-negative mining, we slide a 3D detection window in 3D space. Experiment results show that our 3D detector significantly outperforms the state-of-the-art algorithms for both RGB and RGBD images, and achieves about x1.7 improvement on average precision compared to DPM and R-CNN. All source code and data are available online.
引用
收藏
页码:634 / 651
页数:18
相关论文
共 50 条
  • [31] Boosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision
    Kim, Youngseok
    Kim, Sanmin
    Sim, Sangmin
    Choi, Jun Won
    Kum, Dongsuk
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1801 - 1813
  • [32] 3D Object Depth Extraction by use of Elemental Images and Its Computationally Reconstructed Plane Images
    Li, Gen
    Hwang, Dong-Choon
    Jin, Fushou
    Kim, Eun-Soo
    PROCEEDINGS OF 2008 INTERNATIONAL PRE-OLYMPIC CONGRESS ON COMPUTER SCIENCE, VOL II: INFORMATION SCIENCE AND ENGINEERING, 2008, : 193 - 198
  • [33] Design of Class in Unknown Object Segmentation Focusing on 3D Object Detection in Depth Image
    Amemiya, Tatsuya
    Tasaki, Tsuyoshi
    2021 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2021, : 706 - 707
  • [34] Estimation of depth and 3D motion parameter of moving object with multiple stereo images
    Yi, JW
    Oh, JH
    IMAGE AND VISION COMPUTING, 1996, 14 (07) : 501 - 516
  • [35] MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images
    Xie, Zhouzhen
    Song, Yuying
    Wu, Jingxuan
    Li, Zecheng
    Song, Chunyi
    Xu, Zhiwei
    SENSORS, 2022, 22 (16)
  • [36] Learning Depth-Guided Convolutions for Monocular 3D Object Detection
    Ng, Mingyu
    Huo, Yuqi
    Yi, Hongwei
    Wang, Zhe
    Shi, Jianping
    Lu, Zhiwu
    Luo, Ping
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4306 - 4315
  • [37] Monocular 3D object detection with thermodynamic loss and decoupled instance depth
    Liu, Gang
    Xie, Xiaoxiao
    Yu, Qingchen
    CONNECTION SCIENCE, 2024, 36 (01)
  • [38] MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer
    Huang, Kuan-Chih
    Wu, Tsung-Han
    Su, Hung-Ting
    Hsu, Winston H.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4002 - 4011
  • [39] MonoDFNet: Monocular 3D Object Detection with Depth Fusion and Adaptive Optimization
    Gao, Yuhan
    Wang, Peng
    Li, Xiaoyan
    Sun, Mengyu
    Di, Ruohai
    Li, Liangliang
    Hong, Wei
    SENSORS, 2025, 25 (03)
  • [40] Exploiting Ground Depth Estimation for Mobile Monocular 3D Object Detection
    Zhou, Yunsong
    Liu, Quan
    Zhu, Hongzi
    Li, Yunzhe
    Chang, Shan
    Guo, Minyi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 3079 - 3093