4D-Former: Multimodal 4D Panoptic Segmentation

被引:0
|
作者
Athar, Ali [1 ,3 ]
Li, Enxu [1 ,2 ]
Casas, Sergio [1 ,2 ]
Urtasun, Raquel [1 ,2 ]
机构
[1] Waabi, Toronto, ON, Canada
[2] Univ Toronto, Toronto, ON M5S 1A1, Canada
[3] Rhein Westfal TH Aachen, Aachen, Germany
来源
关键词
Panoptic Segmentation; Sensor Fusion; Temporal Reasoning; Autonomous Driving;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
4D panoptic segmentation is a challenging but practically useful task that requires every point in a LiDAR point-cloud sequence to be assigned a semantic class label, and individual objects to be segmented and tracked over time. Existing approaches utilize only LiDAR inputs which convey limited information in regions with point sparsity. This problem can, however, be mitigated by utilizing RGB camera images which offer appearance-based information that can reinforce the geometry-based LiDAR features. Motivated by this, we propose 4D-Former: a novel method for 4D panoptic segmentation which leverages both LiDAR and image modalities, and predicts semantic masks as well as temporally consistent object masks for the input point-cloud sequence. We encode semantic classes and objects using a set of concise queries which absorb feature information from both data modalities. Additionally, we propose a learned mechanism to associate object tracks over time which reasons over both appearance and spatial location. We apply 4D-Former to the nuScenes and SemanticKITTI datasets where it achieves state-of-the-art results. For more information, visit the project website: https://waabi.ai/4dformer.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A Multistage Processing Procedure for 4D Breast MRI Segmentation
    Qi, Wang
    Hui, Ding
    Guang-zhi, Wang
    2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, : 3036 - 3039
  • [42] 4D CNN for Semantic Segmentation of Cardiac Volumetric Sequences
    Myronenko, Andriy
    Yang, Dong
    Buch, Varun
    Xu, Daguang
    Ihsani, Alvin
    Doyle, Sean
    Michalski, Mark
    Tenenholtz, Neil
    Roth, Holger
    STATISTICAL ATLASES AND COMPUTATIONAL MODELS OF THE HEART: MULTI-SEQUENCE CMR SEGMENTATION, CRT-EPIGGY AND LV FULL QUANTIFICATION CHALLENGES, 2020, 12009 : 72 - 80
  • [44] 4D IN ARCHITECTURE
    Lupeikis, Kestutis
    Maciulis, Algimantas M.
    JOURNAL OF ARCHITECTURE AND URBANISM, 2011, 35 (01) : 28 - 37
  • [45] 4D and tomotherapy
    Ramsey, C.
    RADIOTHERAPY AND ONCOLOGY, 2007, 84 : S107 - S107
  • [46] Identity in 4D
    Thomas Sattig
    Philosophical Studies, 2008, 140 : 179 - 195
  • [47] Microstructures in 4D
    Offerman, SE
    SCIENCE, 2004, 305 (5681) : 190 - 191
  • [48] 4D technology
    Hart's E and P, 2000, 73 (08):
  • [50] 4D imaging
    van Herk, M.
    RADIOTHERAPY AND ONCOLOGY, 2006, 81 : S174 - S174