Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection

被引:0
|
作者
Zhao, Shizhen [1 ]
Qi, Xiaojuan [1 ]
机构
[1] Univ Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing 3D point cloud object detection approaches heavily rely on large amounts of labeled training data. However, the labeling process is costly and time-consuming. This paper considers few-shot 3D point cloud object detection, where only a few annotated samples of novel classes are needed with abundant samples of base classes. To this end, we propose Prototypical VoteNet to recognize and localize novel instances, which incorporates two new modules: Prototypical Vote Module (PVM) and Prototypical Head Module (PHM). Specifically, as the 3D basic geometric structures can be shared among categories, PVM is designed to leverage class-agnostic geometric prototypes, which are learned from base classes, to refine local features of novel categories. Then PHM is proposed to utilize class prototypes to enhance the global feature of each object, facilitating subsequent object localization and classification, which is trained by the episodic training strategy. To evaluate the model in this new setting, we contribute two new benchmark datasets, FS-ScanNet and FS-SUNRGBD. We conduct extensive experiments to demonstrate the effectiveness of Prototypical VoteNet, and our proposed method shows significant and consistent improvements compared to baselines on two benchmark datasets. This project will be available at https://shizhen-zhao.github.io/FS3D_page/.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] CP-VoteNet: Contrastive Prototypical VoteNet for Few-Shot Point Cloud Object Detection
    Li, Xuejing
    Zhang, Weijia
    Ma, Chao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 461 - 475
  • [2] Prototypical Variational Autoencoder for Few-shot 3D Point Cloud Object Detection
    Tang, Weiliang
    Yang, Biqi
    Li, Xianzhi
    Heng, Pheng-Ann
    Liu, Yunhui
    Fu, Chi-Wing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Generalized Few-Shot 3D Point Cloud Segmentation
    Yang, Shuqian
    Ding, Henhui
    Jiang, Xudong
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [4] Few-shot 3D Point Cloud Semantic Segmentation
    Zhao, Na
    Chua, Tat-Seng
    Lee, Gim Hee
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8869 - 8878
  • [5] A Closer Look at Few-Shot 3D Point Cloud Classification
    Chuangguan Ye
    Hongyuan Zhu
    Bo Zhang
    Tao Chen
    International Journal of Computer Vision, 2023, 131 : 772 - 795
  • [6] Rethinking Few-shot 3D Point Cloud Semantic Segmentation
    An, Zhaochong
    Sun, Guolei
    Liu, Yun
    Liu, Fayao
    Wu, Zongwei
    Wang, Dan
    Van Gool, Luc
    Belongie, Serge
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 3996 - 4006
  • [7] Noisy Few-shot 3D Point Cloud Scene Segmentation
    Huang, Hao
    Yuan, Shuaihang
    Wen, CongCong
    Hao, Yu
    Fang, Yi
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 11070 - 11077
  • [8] Crossmodal Few-shot 3D Point Cloud Semantic Segmentation
    Zhao, Ziyu
    Wu, Zhenyao
    Wu, Xinyi
    Zhang, Canyu
    Wang, Song
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4760 - 4768
  • [9] A Closer Look at Few-Shot 3D Point Cloud Classification
    Ye, Chuangguan
    Zhu, Hongyuan
    Zhang, Bo
    Chen, Tao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (03) : 772 - 795
  • [10] Few-shot 3D Point Cloud Semantic Segmentation with Prototype Alignment
    Wei, Maolin
    PROCEEDINGS OF 2023 8TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2023, 2023, : 195 - 200