Fully Sparse 3D Occupancy Prediction

被引:0
|
作者
Liu, Haisong [1 ,2 ]
Chen, Yang [1 ]
Wang, Haiguang [1 ]
Yang, Zetong [2 ]
Li, Tianyu [2 ]
Zeng, Jia [2 ]
Chen, Li [2 ]
Li, Hongyang [2 ]
Wang, Limin [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Shanghai AI Lab, Shanghai, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
3D Occupancy Estimation; Semantic Scene Completion; 3D Reconstruction; Autonomous Driving;
D O I
10.1007/978-3-031-72698-9_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Occupancy prediction plays a pivotal role in autonomous driving. Previous methods typically construct dense 3D volumes, neglecting the inherent sparsity of the scene and suffering high computational costs. To bridge the gap, we introduce a novel fully sparse occupancy network, termed SparseOcc. SparseOcc initially reconstructs a sparse 3D representation from visual inputs and subsequently predicts semantic/instance occupancy from the 3D sparse representation by sparse queries. A mask-guided sparse sampling is designed to enable sparse queries to interact with 2D features in a fully sparse manner, thereby circumventing costly dense features or global attention. Additionally, we design a thoughtful ray-based evaluation metric, namely RayIoU, to solve the inconsistency penalty along depths raised in traditional voxel-level mIoU criteria. SparseOcc demonstrates its effectiveness by achieving a RayIoU of 34.0, while maintaining a real-time inference speed of 17.3 FPS, with 7 history frames inputs. By incorporating more preceding frames to 15, SparseOcc continuously improves its performance to 35.1 RayIoU without bells and whistles. Code is available at https://github. com/MCG- NJU/SparseOcc.
引用
收藏
页码:54 / 71
页数:18
相关论文
共 50 条
  • [31] Rapid 3D Visualization of Indoor Scenes Using 3D Occupancy Grid Isosurfaces
    Zask, Ran
    Dailey, Matthew N.
    ECTI-CON: 2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 632 - 635
  • [32] FSD V2: Improving Fully Sparse 3D Object Detection With Virtual Voxels
    Fan, Lue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 1279 - 1292
  • [33] Cyclist Intent Prediction using 3D LIDAR Sensors for Fully Automated Vehicles
    Saleh, K.
    Abobakr, A.
    Nahavandi, D.
    Iskander, J.
    Attia, M.
    Hossny, M.
    Nahavandi, S.
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2020 - 2026
  • [34] 3D PROBRABILISTIC OCCUPANCY GRID TO ROBOTIC MAPPING
    Souza, Anderson
    Goncalves, Luiz
    ICINCO 2011: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 2, 2011, : 264 - 269
  • [35] Learning to Reconstruct 3D Structures for Occupancy Mapping
    Guizilini, Vitor
    Ramos, Fabio
    ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
  • [36] OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction
    Zhang, Yunpeng
    Zhu, Zheng
    Du, Dalong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9399 - 9409
  • [37] SOccDPT: 3D Semantic Occupancy From Dense Prediction Transformers Trained Under Memory Constraints
    Ganesh, Aditya Nalgunda
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2024, 4 (02): : 2201 - 2212
  • [38] TDOcc: Exploit machine learning and big data in multi-view 3D occupancy prediction
    Shan, Chun
    Zeng, Jian
    Liu, Hongming
    Chen, Chuixing
    Du, Xiaojiang
    Guizani, Mohsen
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 164
  • [39] HybridOcc: NeRF Enhanced Transformer-Based Multi-Camera 3D Occupancy Prediction
    Zhao, Xiao
    Chen, Bo
    Sun, Mingyang
    Yang, Dingkang
    Wang, Youxing
    Zhang, Xukun
    Li, Mingcheng
    Kou, Dongliang
    Wei, Xiaoyi
    Zhang, Lihua
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7867 - 7874
  • [40] Dense and Sparse 3D Deformation Signatures for 3D Dynamic Face Recognition
    Shabayek, Abd El Rahman
    Aouada, Djamila
    IEEE ACCESS, 2021, 9 : 38687 - 38705