ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

被引:177
|
作者
Qi, Charles R.
Chen, Xinlei [1 ]
Litany, Or [1 ,2 ]
Guibas, Leonidas J. [1 ,2 ]
机构
[1] Facebook AI, Menlo Pk, CA 94025 USA
[2] Stanford Univ, Stanford, CA 94305 USA
关键词
HOUGH TRANSFORM; DATABASE;
D O I
10.1109/CVPR42600.2020.00446
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection has seen quick progress thanks to advances in deep learning on point clouds. A few recent works have even shown state-of-the-art performance with just point clouds input (e.g. VOTENET). However, point cloud data have inherent limitations. They are sparse, lack color information and often suffer from sensor noise. Images, on the other hand, have high resolution and rich texture. Thus they can complement the 3D geometry provided by point clouds. Yet how to effectively use image information to assist point cloud based detection is still an open question. In this work, we build on top of VOTENET and propose a 3D detection architecture called IMVOTENET specialized for RGB-D scenes. IMVOTENET is based on fusing 2D votes in images and 3D votes in point clouds. Compared to prior work on multi-modal detection, we explicitly extract both geometric and semantic features from the 2D images. We leverage camera parameters to lift these features to 3D. To improve the synergy of 2D-3D feature fusion, we also propose a multi-tower training scheme. We validate our model on the challenging SUN RGB-D dataset, advancing state-of-the-art results by 5.7 mAP. We also provide rich ablation studies to analyze the contribution of each design choice.
引用
收藏
页码:4403 / 4412
页数:10
相关论文
共 50 条
  • [41] FuseNet: 3D Object Detection Network with Fused Information for Lidar Point Clouds
    Biao Liu
    Bihao Tian
    Hengyang Wang
    Junchao Qiao
    Zhi Wang
    Neural Processing Letters, 2022, 54 : 5063 - 5078
  • [42] 3D Object Detection Using Frustums and Attention Modules for Images and Point Clouds
    Li, Yiran
    Xie, Han
    Shin, Hyunchul
    SIGNALS, 2021, 2 (01): : 98 - 107
  • [43] TANet: Robust 3D Object Detection from Point Clouds with Triple Attention
    Liu, Zhe
    Zhao, Xin
    Huang, Tengteng
    Hu, Ruolan
    Zhou, Yu
    Bai, Xiang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11677 - 11684
  • [44] GRNet: Geometric relation network for 3D object detection from point clouds
    Li, Ying
    Ma, Lingfei
    Tan, Weikai
    Sun, Chen
    Cao, Dongpu
    Li, Jonathan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 165 : 43 - 53
  • [45] Adversarial Training on Point Clouds for Sim-to-Real 3D Object Detection
    DeBortoli, Robert
    Li Fuxin
    Kapoor, Ashish
    Hollinger, Geoffrey A.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04): : 6662 - 6669
  • [46] Clusterformer: Cluster-based Transformer for 3D Object Detection in Point Clouds
    Pei, Yu
    Zhao, Xian
    Li, Hao
    Ma, Jingyuan
    Zhang, Jingwei
    Pu, Shiliang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6641 - 6650
  • [47] FuseNet: 3D Object Detection Network with Fused Information for Lidar Point Clouds
    Liu, Biao
    Tian, Bihao
    Wang, Hengyang
    Qiao, Junchao
    Wang, Zhi
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5063 - 5078
  • [48] PIXOR: Real-time 3D Object Detection from Point Clouds
    Yang, Bin
    Luo, Wenjie
    Urtasun, Raquel
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7652 - 7660
  • [49] Efficient indoor 3D object detection in point clouds using the Kinect sensor
    Zhang, Xuesong
    Guo, Jiaqi
    Song, Cunli
    Zhuang, Yan
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [50] Object Detection in 3D Point Clouds via Local Correlation-Aware Point Embedding
    Wu, Chengzhi
    Pfrommer, Julius
    Beyerer, Juergen
    Li, Kangning
    Neubert, Boris
    2020 JOINT 9TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2020 4TH INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2020,