STFNET: Sparse Temporal Fusion for 3D Object Detection in LiDAR Point Cloud

被引:0
|
作者
Meng, Xin [1 ]
Zhou, Yuan [2 ]
Ma, Jun [1 ]
Jiang, Fangdi [1 ]
Qi, Yongze [1 ]
Wang, Cui [3 ]
Kim, Jonghyuk [4 ]
Wang, Shifeng [1 ,3 ]
机构
[1] Changchun Univ Sci & Technol, Sch Optoelect Engn, Changchun 130022, Peoples R China
[2] Leapmotor, Hangzhou 310000, Peoples R China
[3] Changchun Univ Sci & Technol, Zhongshan Inst, Zhongshan 528400, Peoples R China
[4] Naif Arab Univ Secur Sci, Ctr Excellence Cybercrimes & Digital Forens, Riyadh 11452, Saudi Arabia
关键词
Feature extraction; Three-dimensional displays; Point cloud compression; Object detection; Laser radar; History; Sensors; Proposals; Heating systems; Fuses; 3D object detection; autonomous vehicle; LiDAR; point cloud;
D O I
10.1109/JSEN.2024.3519603
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In autonomous driving and robotics, 3D object detection using LiDAR point clouds is a critical task. However, existing single-frame 3D object detection methods face challenges such as noise, occlusions, and sparsity, which degrade detection performance. To address these, we propose the sparse temporal fusion network (STFNet), which leverages multiframe historical information to improve 3D object detection accuracy. The contribution of STFNet contains three core modules: multihistory feature alignment module (MFAM), sparse feature extraction module (SFEM), and temporal fusion transformer (TFformer). MFAM: Ego-motion is used for compensation to align frames, establishing correlations between adjacent frames along the temporal dimension. SFEM: Sparse extraction is performed on features from different time steps to obtain key features within the time series. TFformer: The advanced temporal fusion attention mechanism is introduced to facilitate deep interactions between the current and historical frames. We validated the effectiveness of STFNet on the nuScenes dataset, achieving 71.8% NuScenes detection score (NDS) and 67.0% mean average precision (mAP). Compared to the benchmark method, our method improves 1.6% NDS and 1.5% mAP. Extensive experiments demonstrate that STFNet significantly outperforms most existing methods, highlighting the superiority and generalizability of our approach.
引用
收藏
页码:5866 / 5877
页数:12
相关论文
共 50 条
  • [1] 3D Object Detection with Fusion Point Attention Mechanism in LiDAR Point Cloud
    Liu Weili
    Zhu Deli
    Luo Huahao
    Li Yi
    ACTA PHOTONICA SINICA, 2023, 52 (09)
  • [2] Real Pseudo-Lidar Point Cloud Fusion for 3D Object Detection
    Fan, Xiangsuo
    Xiao, Dachuan
    Cai, Dengsheng
    Ding, Wentao
    ELECTRONICS, 2023, 12 (18)
  • [3] Lidar Point Cloud Guided Monocular 3D Object Detection
    Peng, Liang
    Liu, Fei
    Yu, Zhengxu
    Yan, Senbo
    Deng, Dan
    Yang, Zheng
    Liu, Haifeng
    Cai, Deng
    COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 123 - 139
  • [4] Boosting Lidar 3D Object Detection with Point Cloud Semantic Segmentation
    Zhang, Xuchong
    Min, Chong
    Jia, Yijie
    Chen, Liming
    Zhang, Jingmin
    Sun, Hongbin
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7614 - 7621
  • [5] Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud
    Weng, Xinshuo
    Kitani, Kris
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 857 - 866
  • [6] SemanticAnchors: Sequential Fusion using Lidar Point Cloud and Anchors with Semantic Annotations for 3D Object Detection
    Gao, Zhentong
    Wang, Qiantong
    Pan, Zongxu
    Long, Hui
    Hu, Yuxin
    Li, Zheng
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1128 - 1133
  • [7] Multi-Layer Fusion 3D Object Detection via Lidar Point Cloud and Camera Image
    Guo, Yuhao
    Hu, Hui
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [8] 3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation
    Xie, Guangda
    Li, Yang
    Wang, Yanping
    Li, Ziyi
    Qu, Hongquan
    REMOTE SENSING, 2023, 15 (12)
  • [9] 3D Object Detection Based on Feature Fusion of Point Cloud Sequences
    Zhai, Zhenyu
    Wang, Qiantong
    Pan, Zongxu
    Hu, Wenlong
    Hu, Yuxin
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1240 - 1245
  • [10] End-to-End 3D Object Detection using LiDAR Point Cloud
    Raut, Gaurav
    Patole, Advait
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,