Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection

被引:0
|
作者
Zhao, Shizhen [1 ]
Qi, Xiaojuan [1 ]
机构
[1] Univ Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing 3D point cloud object detection approaches heavily rely on large amounts of labeled training data. However, the labeling process is costly and time-consuming. This paper considers few-shot 3D point cloud object detection, where only a few annotated samples of novel classes are needed with abundant samples of base classes. To this end, we propose Prototypical VoteNet to recognize and localize novel instances, which incorporates two new modules: Prototypical Vote Module (PVM) and Prototypical Head Module (PHM). Specifically, as the 3D basic geometric structures can be shared among categories, PVM is designed to leverage class-agnostic geometric prototypes, which are learned from base classes, to refine local features of novel categories. Then PHM is proposed to utilize class prototypes to enhance the global feature of each object, facilitating subsequent object localization and classification, which is trained by the episodic training strategy. To evaluate the model in this new setting, we contribute two new benchmark datasets, FS-ScanNet and FS-SUNRGBD. We conduct extensive experiments to demonstrate the effectiveness of Prototypical VoteNet, and our proposed method shows significant and consistent improvements compared to baselines on two benchmark datasets. This project will be available at https://shizhen-zhao.github.io/FS3D_page/.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Few-Shot Object Detection of drones
    Zou Weibao
    Liu Xindi
    Yang Jitao
    Qu Wei
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 1030 - 1034
  • [22] Cross-Modality Feature Fusion Network for Few-Shot 3D Point Cloud Classification
    Yang, Minmin
    Chen, Jiajing
    Velipasalar, Senem
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 653 - 662
  • [23] Boosting Few-shot 3D Point Cloud Segmentation via Query-Guided Enhancement
    Ning, Zhenhua
    Tian, Zhuotao
    Lu, Guangming
    Pei, Wenjie
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1895 - 1904
  • [24] Enhancing Few-Shot 3D Point Cloud Classification With Soft Interaction and Self-Attention
    Khan, Abdullah Aman
    Shao, Jie
    Shafiq, Sidra
    Zhu, Shuyuan
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1127 - 1141
  • [25] Enhancing Few-Shot 3D Point Cloud Semantic Segmentation through Bidirectional Prototype Learning
    Guo, Xuehang
    Hu, Hao
    Yang, Xiaoxi
    Deng, Yancong
    PROCEEDINGS OF 2023 9TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2023, 2023, : 7 - 16
  • [26] FS-3DSSN: an efficient few-shot learning for single-stage 3D object detection on point clouds
    Tiwari, Alok Kumar
    Sharma, G. K.
    VISUAL COMPUTER, 2024, 40 (11): : 8125 - 8139
  • [27] Distribution Aware VoteNet for 3D Object Detection
    Liang, Junxiong
    An, Pei
    Ma, Jie
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1583 - 1591
  • [28] IMPLICIT SHAPE BIASED FEW-SHOT LEARNING FOR 3D OBJECT GENERALIZATION
    Prasad, Shitala
    Li, Yiqun
    Lin, Dongyun
    Guo, Aiyuan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3436 - 3440
  • [29] Prototypical Networks for Few-shot Learning
    Snell, Jake
    Swersky, Kevin
    Zemel, Richard
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [30] Semantic Transportation Prototypical Network for Few-shot Intent Detection
    Xu, Weiyuan
    Zhou, Peilin
    You, Chenyu
    Zou, Yuexian
    INTERSPEECH 2021, 2021, : 251 - 255