STFNET: Sparse Temporal Fusion for 3D Object Detection in LiDAR Point Cloud

被引:0
|
作者
Meng, Xin [1 ]
Zhou, Yuan [2 ]
Ma, Jun [1 ]
Jiang, Fangdi [1 ]
Qi, Yongze [1 ]
Wang, Cui [3 ]
Kim, Jonghyuk [4 ]
Wang, Shifeng [1 ,3 ]
机构
[1] Changchun Univ Sci & Technol, Sch Optoelect Engn, Changchun 130022, Peoples R China
[2] Leapmotor, Hangzhou 310000, Peoples R China
[3] Changchun Univ Sci & Technol, Zhongshan Inst, Zhongshan 528400, Peoples R China
[4] Naif Arab Univ Secur Sci, Ctr Excellence Cybercrimes & Digital Forens, Riyadh 11452, Saudi Arabia
关键词
Feature extraction; Three-dimensional displays; Point cloud compression; Object detection; Laser radar; History; Sensors; Proposals; Heating systems; Fuses; 3D object detection; autonomous vehicle; LiDAR; point cloud;
D O I
10.1109/JSEN.2024.3519603
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In autonomous driving and robotics, 3D object detection using LiDAR point clouds is a critical task. However, existing single-frame 3D object detection methods face challenges such as noise, occlusions, and sparsity, which degrade detection performance. To address these, we propose the sparse temporal fusion network (STFNet), which leverages multiframe historical information to improve 3D object detection accuracy. The contribution of STFNet contains three core modules: multihistory feature alignment module (MFAM), sparse feature extraction module (SFEM), and temporal fusion transformer (TFformer). MFAM: Ego-motion is used for compensation to align frames, establishing correlations between adjacent frames along the temporal dimension. SFEM: Sparse extraction is performed on features from different time steps to obtain key features within the time series. TFformer: The advanced temporal fusion attention mechanism is introduced to facilitate deep interactions between the current and historical frames. We validated the effectiveness of STFNet on the nuScenes dataset, achieving 71.8% NuScenes detection score (NDS) and 67.0% mean average precision (mAP). Compared to the benchmark method, our method improves 1.6% NDS and 1.5% mAP. Extensive experiments demonstrate that STFNet significantly outperforms most existing methods, highlighting the superiority and generalizability of our approach.
引用
收藏
页码:5866 / 5877
页数:12
相关论文
共 50 条
  • [31] F-Transformer: Point Cloud Fusion Transformer for Cooperative 3D Object Detection
    Wang, Jie
    Luo, Guiyang
    Yuan, Quan
    Li, Jinglin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 171 - 182
  • [32] A LiDAR-Camera Fusion 3D Object Detection Algorithm
    Liu, Leyuan
    He, Jian
    Ren, Keyan
    Xiao, Zhonghua
    Hou, Yibin
    INFORMATION, 2022, 13 (04)
  • [33] Stereo Point Cloud Refinement for 3D Object Detection
    Liu, Wangchao
    Wang, Teng
    Wang, Yang
    Zhang, Xiangyu
    Lou, Xin
    2021 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2021) & 2021 IEEE CONFERENCE ON POSTGRADUATE RESEARCH IN MICROELECTRONICS AND ELECTRONICS (PRIMEASIA 2021), 2021, : 61 - 64
  • [34] A Lightweight Model for 3D Point Cloud Object Detection
    Li, Ziyi
    Li, Yang
    Wang, Yanping
    Xie, Guangda
    Qu, Hongquan
    Lyu, Zhuoyang
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [35] 3D object detection in voxelized point cloud scene
    Li Rui-long
    Wu Chuan
    Zhu Ming
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (10) : 1355 - 1363
  • [36] VPFNet: Improving 3D Object Detection With Virtual Point Based LiDAR and Stereo Data Fusion
    Zhu, Hanqi
    Deng, Jiajun
    Zhang, Yu
    Ji, Jianmin
    Mao, Qiuyu
    Li, Houqiang
    Zhang, Yanyong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5291 - 5304
  • [37] DenseSphere: Multimodal 3D object detection under a sparse point cloud based on spherical coordinate
    Jung, Jong Won
    Yoon, Jae Hyun
    Yoo, Seok Bong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [38] PFENet: Towards precise feature extraction from sparse point cloud for 3D object detection
    Li, Yaochen
    Li, Qiao
    Gao, Cong
    Gao, Shengjing
    Wu, Hao
    Liu, Rui
    NEURAL NETWORKS, 2025, 185
  • [39] Boosting Sparse Point Cloud Object Detection via Image Fusion
    Shi, Weijing
    Rajkumar, Ragunathan
    2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 214 - 220
  • [40] SemanticVoxels: Sequential Fusion for 3D Pedestrian Detection using LiDAR Point Cloud and Semantic Segmentation
    Fei, Juncong
    Chen, Wenbo
    Heidenreich, Philipp
    Wirges, Sascha
    Stiller, Christoph
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2020, : 185 - 190