Fully Sparse 3D Occupancy Prediction

被引:0
|
作者
Liu, Haisong [1 ,2 ]
Chen, Yang [1 ]
Wang, Haiguang [1 ]
Yang, Zetong [2 ]
Li, Tianyu [2 ]
Zeng, Jia [2 ]
Chen, Li [2 ]
Li, Hongyang [2 ]
Wang, Limin [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Shanghai AI Lab, Shanghai, Peoples R China
来源
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
3D Occupancy Estimation; Semantic Scene Completion; 3D Reconstruction; Autonomous Driving;
D O I
10.1007/978-3-031-72698-9_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Occupancy prediction plays a pivotal role in autonomous driving. Previous methods typically construct dense 3D volumes, neglecting the inherent sparsity of the scene and suffering high computational costs. To bridge the gap, we introduce a novel fully sparse occupancy network, termed SparseOcc. SparseOcc initially reconstructs a sparse 3D representation from visual inputs and subsequently predicts semantic/instance occupancy from the 3D sparse representation by sparse queries. A mask-guided sparse sampling is designed to enable sparse queries to interact with 2D features in a fully sparse manner, thereby circumventing costly dense features or global attention. Additionally, we design a thoughtful ray-based evaluation metric, namely RayIoU, to solve the inconsistency penalty along depths raised in traditional voxel-level mIoU criteria. SparseOcc demonstrates its effectiveness by achieving a RayIoU of 34.0, while maintaining a real-time inference speed of 17.3 FPS, with 7 history frames inputs. By incorporating more preceding frames to 15, SparseOcc continuously improves its performance to 35.1 RayIoU without bells and whistles. Code is available at https://github. com/MCG- NJU/SparseOcc.
引用
收藏
页码:54 / 71
页数:18
相关论文
共 50 条
  • [41] Sparse Representation for 3D Face Recognition
    Guo, Zhe
    Fan, Yang-Yu
    2013 FOURTH WORLD CONGRESS ON SOFTWARE ENGINEERING (WCSE), 2013, : 336 - 339
  • [42] Infinite, Sparse 3D Modelling Volumes
    Funk, Eugen
    Boerner, Anko
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2016, 2017, 693 : 593 - 605
  • [43] 3D sparse signal recovery via 3D orthogonal matching pursuit
    Huo, Yingqiu
    Fang, Yong
    Huang, Lei
    JOURNAL OF SYSTEMS ARCHITECTURE, 2016, 64 : 3 - 10
  • [44] SPARSE REPRESENTATION OF DEFORMABLE 3D ORGANS
    Wang, Dan
    Tewfik, Ahmed H.
    2009 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1 AND 2, 2009, : 887 - 890
  • [45] 3D COPRIME ARRAYS IN SPARSE SENSING
    Li, Conghui
    Gan, Lu
    Ling, Cong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4200 - 4204
  • [46] Sparse layout of irregular 3D clusters
    Romanova, Tatiana
    Pankratov, Alexander
    Litvinchev, Igor
    Dubinskyi, Vladimir
    Infante, Luis
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2023, 74 (01) : 351 - 361
  • [47] Super Sparse 3D Object Detection
    Fan, Lue
    Yang, Yuxue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12490 - 12505
  • [48] nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding
    Zhu, Benjin
    Wang, Zhe
    Li, Hongsheng
    COMPUTER VISION - ECCV 2024, PT V, 2025, 15063 : 125 - 141
  • [49] Optimization of a fully 3D single scatter simulation algorithm for 3D PET
    Accorsi, R
    Adam, LE
    Werner, ME
    Karp, JS
    PHYSICS IN MEDICINE AND BIOLOGY, 2004, 49 (12): : 2577 - 2598
  • [50] 3D printing of fully degradable materials
    Dove, Andrew
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256