ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

被引:177
|
作者
Qi, Charles R.
Chen, Xinlei [1 ]
Litany, Or [1 ,2 ]
Guibas, Leonidas J. [1 ,2 ]
机构
[1] Facebook AI, Menlo Pk, CA 94025 USA
[2] Stanford Univ, Stanford, CA 94305 USA
关键词
HOUGH TRANSFORM; DATABASE;
D O I
10.1109/CVPR42600.2020.00446
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection has seen quick progress thanks to advances in deep learning on point clouds. A few recent works have even shown state-of-the-art performance with just point clouds input (e.g. VOTENET). However, point cloud data have inherent limitations. They are sparse, lack color information and often suffer from sensor noise. Images, on the other hand, have high resolution and rich texture. Thus they can complement the 3D geometry provided by point clouds. Yet how to effectively use image information to assist point cloud based detection is still an open question. In this work, we build on top of VOTENET and propose a 3D detection architecture called IMVOTENET specialized for RGB-D scenes. IMVOTENET is based on fusing 2D votes in images and 3D votes in point clouds. Compared to prior work on multi-modal detection, we explicitly extract both geometric and semantic features from the 2D images. We leverage camera parameters to lift these features to 3D. To improve the synergy of 2D-3D feature fusion, we also propose a multi-tower training scheme. We validate our model on the challenging SUN RGB-D dataset, advancing state-of-the-art results by 5.7 mAP. We also provide rich ablation studies to analyze the contribution of each design choice.
引用
收藏
页码:4403 / 4412
页数:10
相关论文
共 50 条
  • [1] Boosting 3D Object Detection by Simulating Multimodality on Point Clouds
    Zheng, Wu
    Hong, Mingxuan
    Jiang, Li
    Fu, Chi-Wing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13628 - 13637
  • [2] Boosting Single-Frame 3D Object Detection by Simulating Multi-Frame Point Clouds
    Zheng, Wu
    Jiang, Li
    Lu, FanBin
    Ye, Yangyang
    Fu, Chi-Wing
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4848 - 4856
  • [3] Knowledge guided object detection and identification in 3D Point Clouds
    Karmacharya, A.
    Boochs, F.
    Tietz, B.
    VIDEOMETRICS, RANGE IMAGING, AND APPLICATIONS XIII, 2015, 9528
  • [4] 3D Object Detection with Normal-map on Point Clouds
    Miao, Jishu
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 569 - 576
  • [5] Deep Hough Voting for 3D Object Detection in Point Clouds
    Qi, Charles R.
    Litany, Or
    He, Kaiming
    Guibas, Leonidas J.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9276 - 9285
  • [6] Weakly Supervised Point Clouds Transformer for 3D Object Detection
    Tang, Zuojin
    Sun, Bo
    Ma, Tongwei
    Li, Daosheng
    Xu, Zhenhui
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3948 - 3955
  • [7] 3D Object Detection Algorithm Based on Raw Point Clouds
    Zhang, Dongdong
    Guo, Jie
    Chen, Yang
    Computer Engineering and Applications, 2024, 59 (03) : 209 - 217
  • [8] Optimisation of the PointPillars network for 3D object detection in point clouds
    Stanisz, Joanna
    Lis, Konrad
    Kryjak, Tomasz
    Gorgon, Marek
    2020 SIGNAL PROCESSING - ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2020, : 122 - 127
  • [9] Learning Deformable Network for 3D Object Detection on Point Clouds
    Zhang, Wanyi
    Fu, Xiuhua
    Li, Wei
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [10] Boundary points guided 3D object detection for point clouds
    Tang, Qingsong
    Yang, Mingzhi
    Wang, Ziyi
    Dong, Wenhao
    Liu, Yang
    APPLIED SOFT COMPUTING, 2024, 165