VFL3D: A Single-Stage Fine-Grained Lightweight Point Cloud 3D Object Detection Algorithm Based on Voxels

被引:3
|
作者
Li, Bing [1 ,2 ,3 ]
Chen, Jie [4 ,5 ]
Li, Xinde [3 ,6 ,7 ]
Xu, Rui [2 ]
Li, Qian [2 ]
Cao, Yice [2 ]
Wu, Jun [2 ]
Qu, Lei [2 ]
Li, Yingsong [2 ]
Diniz, Paulo S. R. [8 ,9 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Anhui Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
[3] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China
[4] Anhui Univ, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei 230601, Peoples R China
[5] China Elect Technol Grp Corp, Res Inst 38, Hefei 230088, Peoples R China
[6] Southeast Univ, Sch Automat, Key Lab Measurement & Control CSE, Nanjing 210096, Peoples R China
[7] Southeast Univ, Shenzhen Res Inst, Shenzhen 518063, Peoples R China
[8] Univ Fed Rio de Janeiro, Program Elect Engn, COPPE Poli, BR-21941909 Rio De Janeiro, Brazil
[9] Univ Fed Rio de Janeiro, Dept Elect & Comp Engn, COPPE Poli, BR-21941909 Rio De Janeiro, Brazil
基金
中国国家自然科学基金;
关键词
Feature extraction; Point cloud compression; Three-dimensional displays; Object detection; Convolution; Data mining; Computational efficiency; Single-stage; fine-grained; lightweight; multibranch cross-sparse convolution network; compact fine-grained self-attention augmented module;
D O I
10.1109/TITS.2024.3373227
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In this work, we propose a voxel-based single-stage fine-grained and efficient point cloud 3D object detection algorithm to address the inadequate granularity in point cloud feature extraction tasks and the imbalance between efficiency and accuracy in single-stage point cloud 3D object detection scenarios. We develop a lightweight multibranch cross-sparse convolution network (LMCCN) that is designed to preserve the feature granularity of the original point cloud while achieving enhanced extraction efficiency. Additionally, we introduce a compact fine-grained self-attention augmented bird's eye view (BEV) feature extraction module (CFSAM). This module aims to further refine BEV features, enabling the acquisition of both locally and globally enhanced features and thereby augmentingthe perceptual capabilities of the constructed model. Without bells and whistles, the proposed method attains excellent performance on many autonomous driving benchmarks, with detection accuracies of up to 81.67% on KITTI, 72.74% on ONCE, and 84.00% on nuScenes. Moreover, it reaches a peak detection speed of 46.08 FPS, effectively balancing accuracy with speed.
引用
收藏
页码:12034 / 12048
页数:15
相关论文
共 50 条
  • [41] 3D Guided Fine-Grained Face Manipulation
    Geng, Zhenglin
    Cao, Chen
    Tulyakov, Sergey
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9813 - 9822
  • [42] 3D Backscatter Localization for Fine-Grained Robotics
    Luo, Zhihong
    Zhang, Qiping
    Ma, Yunfei
    Singh, Manish
    Adib, Fadel
    PROCEEDINGS OF THE 16TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, 2019, : 765 - 781
  • [43] Fine-Grained Categorization for 3D Scene Understanding
    Stark, Michael
    Krause, Jonathan
    Pepik, Bojan
    Meger, David
    Little, James J.
    Schiele, Bernt
    Koller, Daphne
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [44] Fine-grained metals from 3D printing
    Clarke, Amy J.
    NATURE, 2019, 576 (7785) : 41 - 42
  • [45] Coarse to fine-based image-point cloud fusion network for 3D object detection
    Hao, Meilan
    Zhang, Zhongkang
    Li, Lei
    Dong, Kejian
    Cheng, Long
    Tiwari, Prayag
    Ning, Xin
    INFORMATION FUSION, 2024, 112
  • [46] Fine-Grained Patch Segmentation and Rasterization for 3-D Point Cloud Attribute Compression
    Zhao, Baoquan
    Lin, Weisi
    Lv, Chenlei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4590 - 4602
  • [47] FS-3DSSN: an efficient few-shot learning for single-stage 3D object detection on point clouds
    Tiwari, Alok Kumar
    Sharma, G. K.
    VISUAL COMPUTER, 2024, 40 (11): : 8125 - 8139
  • [48] Object Volume Estimation Based on 3D Point Cloud
    Chang, Wen-Chung
    Wu, Chia-Hung
    Tsai, Ya-Hui
    Chiu, Wei-Yao
    2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
  • [49] 3DSSD: Point-based 3D Single Stage Object Detector
    Yang, Zetong
    Sun, Yanan
    Liu, Shu
    Jia, Jiaya
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11037 - 11045
  • [50] DCGNN: a single-stage 3D object detection network based on density clustering and graph neural network
    Xiong, Shimin
    Li, Bin
    Zhu, Shiao
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (03) : 3399 - 3408