3DSSD: Point-based 3D Single Stage Object Detector

被引:710
|
作者
Yang, Zetong [1 ]
Sun, Yanan [2 ]
Liu, Shu [3 ]
Jia, Jiaya [1 ,3 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] SmartMore, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR42600.2020.01105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prevalence of voxel-based 3D single-stage detectors contrast with underexplored point-based methods. In this paper, we present a lightweight point-based 3D single stage object detector 3DSSD to achieve decent balance of accuracy and efficiency. In this paradigm, all upsampling layers and the refinement stage, which are indispensable in all existing point-based methods, are abandoned. We instead propose a fusion sampling strategy in downsampling process to make detection on less representative points feasible. A delicate box prediction network, including a candidate generation layer and an anchor-free regression head with a 3D center-ness assignment strategy, is developed to meet the demand of high accuracy and speed. Our 3DSSD paradigm is an elegant single-stage anchor-free one. We evaluate it on widely used KITTI dataset and more challenging nuScenes dataset. Our method outperforms all state-of-the-art voxel-based single-stage methods by a large margin, and even yields comparable performance with two-stage point-based methods, with amazing inference speed of 25+ FPS, 2x faster than former state-of-the-art point-based methods.
引用
收藏
页码:11037 / 11045
页数:9
相关论文
共 50 条
  • [41] Generation of point-based 3D statistical shape models for anatomical objects
    Lorenz, C
    Krahnstöver, N
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2000, 77 (02) : 175 - 191
  • [42] Point-Based Deep Neural Network for 3D Facial Expression Recognition
    Trimech, Imen Hamrouni
    Maalej, Ahmed
    Ben Amara, Najoua Essoukri
    2020 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2020), 2020, : 164 - 171
  • [43] A Single Stage and Single View 3D Point Cloud Reconstruction Network Based on DetNet
    Li, Bin
    Zhu, Shiao
    Lu, Yi
    SENSORS, 2022, 22 (21)
  • [44] Evolutionary optimization of feature representation for 3D point-based model classification
    Tong, Xin
    Wong, Hau-san
    Ma, Bo
    Ip, Horace H. S.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 707 - +
  • [45] HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection
    Noh, Jongyoun
    Lee, Sanghoon
    Ham, Bumsub
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14600 - 14609
  • [46] Efficient Point-based Pattern Search in 3D Motion Capture Databases
    Beecks, Christian
    Grass, Alexander
    2018 IEEE 6TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2018), 2018, : 230 - 235
  • [47] Paraxial Geometric Optics in 3D Through Point-Based Geometric Algebra
    Dorst, Leo
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT IV, 2024, 14498 : 340 - 354
  • [48] Feature Point-Based 3D Mesh Watermarking that Withstands the Cropping Attack
    Sales, Mireia Montanola
    Darazi, Rony
    Giard, Joachim
    Alface, Patrice Rondao
    Macq, Benoit
    MEDIA WATERMARKING, SECURITY, AND FORENSICS III, 2011, 7880
  • [49] VFL3D: A Single-Stage Fine-Grained Lightweight Point Cloud 3D Object Detection Algorithm Based on Voxels
    Li, Bing
    Chen, Jie
    Li, Xinde
    Xu, Rui
    Li, Qian
    Cao, Yice
    Wu, Jun
    Qu, Lei
    Li, Yingsong
    Diniz, Paulo S. R.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 12034 - 12048
  • [50] Improved 3D Object Detector Under Snowfall Weather Condition Based on LiDAR Point Cloud
    Lin, Jia
    Yin, Huilin
    Yan, Jun
    Ge, Wancheng
    Zhang, Hao
    Rigoll, Gerhard
    IEEE SENSORS JOURNAL, 2022, 22 (16) : 16276 - 16292