Fully Sparse 3D Occupancy Prediction

被引:0
|
作者
Liu, Haisong [1 ,2 ]
Chen, Yang [1 ]
Wang, Haiguang [1 ]
Yang, Zetong [2 ]
Li, Tianyu [2 ]
Zeng, Jia [2 ]
Chen, Li [2 ]
Li, Hongyang [2 ]
Wang, Limin [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Shanghai AI Lab, Shanghai, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
3D Occupancy Estimation; Semantic Scene Completion; 3D Reconstruction; Autonomous Driving;
D O I
10.1007/978-3-031-72698-9_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Occupancy prediction plays a pivotal role in autonomous driving. Previous methods typically construct dense 3D volumes, neglecting the inherent sparsity of the scene and suffering high computational costs. To bridge the gap, we introduce a novel fully sparse occupancy network, termed SparseOcc. SparseOcc initially reconstructs a sparse 3D representation from visual inputs and subsequently predicts semantic/instance occupancy from the 3D sparse representation by sparse queries. A mask-guided sparse sampling is designed to enable sparse queries to interact with 2D features in a fully sparse manner, thereby circumventing costly dense features or global attention. Additionally, we design a thoughtful ray-based evaluation metric, namely RayIoU, to solve the inconsistency penalty along depths raised in traditional voxel-level mIoU criteria. SparseOcc demonstrates its effectiveness by achieving a RayIoU of 34.0, while maintaining a real-time inference speed of 17.3 FPS, with 7 history frames inputs. By incorporating more preceding frames to 15, SparseOcc continuously improves its performance to 35.1 RayIoU without bells and whistles. Code is available at https://github. com/MCG- NJU/SparseOcc.
引用
收藏
页码:54 / 71
页数:18
相关论文
共 50 条
  • [1] Fully Sparse 3D Object Detection
    Fan, Lue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Fully Sparse Fusion for 3D Object Detection
    Li, Yingyan
    Fan, Lue
    Liu, Yang
    Huang, Zehao
    Chen, Yuntao
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7217 - 7231
  • [3] COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction
    Ma, Qihang
    Tan, Xin
    Qu, Yanyun
    Ma, Lizhuang
    Zhang, Zhizhong
    Xie, Yuan
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 19936 - 19945
  • [4] LinkOcc: 3D Semantic Occupancy Prediction With Temporal Association
    Ouyang, Wenzhe
    Xu, Zenglin
    Shen, Bin
    Wang, Jinghua
    Xu, Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1374 - 1384
  • [5] VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking
    Chen, Yukang
    Liu, Jianhui
    Zhang, Xiangyu
    Qi, Xiaojuan
    Jia, Jiaya
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21674 - 21683
  • [6] OctOcc: High-Resolution 3D Occupancy Prediction with Octree
    Ouyang, Wenzhe
    Song, Xiaolin
    Feng, Bailan
    Xu, Zenglin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4369 - 4377
  • [7] Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution
    Sze, Samuel
    Kunze, Lars
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1286 - 1293
  • [8] A Fully Automatic Framework for Prediction of 3D Facial Rejuvenation
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    Molton, Michael
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2018,
  • [9] SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection
    Zhang, Gang
    Chen, Junnan
    Gao, Guohuan
    Li, Jianmin
    Liu, Si
    Hu, Xiaolin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 14477 - 14486
  • [10] POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
    Vobecky, Antonin
    Simeoni, Oriane
    Hurych, David
    Gidaris, Spyros
    Bursuc, Andrei
    Perez, Patrick
    Sivic, Josef
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,