FS-3DSSN: an efficient few-shot learning for single-stage 3D object detection on point clouds

被引:1
|
作者
Tiwari, Alok Kumar [1 ]
Sharma, G. K. [1 ]
机构
[1] ABV Indian Inst Informat Technol & Management, Dept Informat Technol, Gwalior, India
来源
VISUAL COMPUTER | 2024年 / 40卷 / 11期
关键词
Point cloud; 3D object detection; Few-shot learning; Single-stage detector; Autonomous driving;
D O I
10.1007/s00371-023-03228-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The current 3D object detectionmethods have achieved promising results for conventional tasks to detect frequently occurring objects like cars, pedestrians and cyclists. However, they require many annotated boundary boxes and class labels for training, which is very expensive and hard to obtain. Nevertheless, detecting infrequent occurring objects, such as police vehicles, is also essential for autonomous driving to be successful. Therefore, we explore the potential of few-shot learning to handle this challenge of detecting infrequent categories. The current 3D object detectors do not have the necessary architecture to support this type of learning. Thus, this paper presents a new method termed few-shot single-stage network for 3D object detection (FS-3DSSN) to predict infrequent categories of objects. FS-3DSSN uses a class-incremental few-shot learning approach to detect infrequent categories without compromising the detection accuracy of frequent categories. It consists of twomodules: (i) a single-stage network architecture for 3D object detection (3DSSN) using deformable convolutions to detect small objects and (ii) a class-incremental-based meta-learning module to learn and predict infrequent class categories. 3DSSN obtained 84.53 mAP(3D) on the KITTI car category and 73.4 NDS on the nuScenes dataset, outperforming previous state of the art. Further, the result of FS-3DSSN on nuScenes is also encouraging for detecting infrequent categories while maintaining accuracy in frequent classes.
引用
收藏
页码:8125 / 8139
页数:15
相关论文
共 50 条
  • [21] A Closer Look at Few-Shot 3D Point Cloud Classification
    Ye, Chuangguan
    Zhu, Hongyuan
    Zhang, Bo
    Chen, Tao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (03) : 772 - 795
  • [22] Few-shot 3D Point Cloud Semantic Segmentation with Prototype Alignment
    Wei, Maolin
    PROCEEDINGS OF 2023 8TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2023, 2023, : 195 - 200
  • [23] Hyperbolic prototype rectification for few-shot 3D point cloud classification
    Feng, Yuan-Zhi
    Lin, Shing-Ho J.
    Tang, Xuan
    Wang, Mu-Yu
    Zheng, Jian-Zhang
    He, Zi-Yao
    Pang, Zi-Yi
    Yang, Jian
    Chen, Ming-Song
    Wei, Xian
    PATTERN RECOGNITION, 2025, 158
  • [24] A Simple Framework of Few-Shot Learning Using Sparse Annotations for Semantic Segmentation of 3-D Point Clouds
    Huang, Rong
    Gao, Yang
    Xu, Yusheng
    Hoegner, Ludwig
    Tong, Xiaohua
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5147 - 5158
  • [25] Enhancing Few-Shot 3D Point Cloud Semantic Segmentation through Bidirectional Prototype Learning
    Guo, Xuehang
    Hu, Hao
    Yang, Xiaoxi
    Deng, Yancong
    PROCEEDINGS OF 2023 9TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2023, 2023, : 7 - 16
  • [26] Efficient indoor 3D object detection in point clouds using the Kinect sensor
    Zhang, Xuesong
    Guo, Jiaqi
    Song, Cunli
    Zhuang, Yan
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [27] Few-Shot Segmentation of 3D Point Clouds Under Real-World Distributional Shifts in Railroad Infrastructure
    Fayjie, Abdur R.
    Lens, Mathijs
    Vandewalle, Patrick
    SENSORS, 2025, 25 (04)
  • [28] A Single-Stage 3D Object Detection Method Based on Sparse Attention Mechanism
    Jia, Songche
    Zhang, Zhenyu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 414 - 425
  • [29] Rethinking IoU-based Optimization for Single-stage 3D Object Detection
    Sheng, Hualian
    Cai, Sijia
    Zhao, Na
    Deng, Bing
    Huang, Jianqiang
    Hua, Xian-Sheng
    Zhao, Min-Jian
    Lee, Gim Hee
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 544 - 561
  • [30] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation
    Liu, Zechen
    Wu, Zizhang
    Toth, Roland
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4289 - 4298