4D-Former: Multimodal 4D Panoptic Segmentation

被引:0
|
作者
Athar, Ali [1 ,3 ]
Li, Enxu [1 ,2 ]
Casas, Sergio [1 ,2 ]
Urtasun, Raquel [1 ,2 ]
机构
[1] Waabi, Toronto, ON, Canada
[2] Univ Toronto, Toronto, ON M5S 1A1, Canada
[3] Rhein Westfal TH Aachen, Aachen, Germany
来源
关键词
Panoptic Segmentation; Sensor Fusion; Temporal Reasoning; Autonomous Driving;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
4D panoptic segmentation is a challenging but practically useful task that requires every point in a LiDAR point-cloud sequence to be assigned a semantic class label, and individual objects to be segmented and tracked over time. Existing approaches utilize only LiDAR inputs which convey limited information in regions with point sparsity. This problem can, however, be mitigated by utilizing RGB camera images which offer appearance-based information that can reinforce the geometry-based LiDAR features. Motivated by this, we propose 4D-Former: a novel method for 4D panoptic segmentation which leverages both LiDAR and image modalities, and predicts semantic masks as well as temporally consistent object masks for the input point-cloud sequence. We encode semantic classes and objects using a set of concise queries which absorb feature information from both data modalities. Additionally, we propose a learned mechanism to associate object tracks over time which reasons over both appearance and spatial location. We apply 4D-Former to the nuScenes and SemanticKITTI datasets where it achieves state-of-the-art results. For more information, visit the project website: https://waabi.ai/4dformer.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] 4D Cardiac Segmentation of the Epicardium and Left Ventricle
    Moll, G. Pons
    Tadmor, G.
    MacLeod, R. S.
    Rosenhahn, B.
    Brooks, D. H.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 2004 - 2007
  • [22] Aortic Root Segmentation in 4D Transesophageal Echocardiography
    Chechani, Shubham
    Suresh, Rahul
    Patwardhan, Kedar A.
    MEDICAL IMAGING 2018: COMPUTER-AIDED DIAGNOSIS, 2018, 10575
  • [23] 4D Relationships: The Missing Link in 4D Scheduling
    Trang Dang
    Bargstaedt, Hans-Joachim
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2016, 142 (02)
  • [24] 4D Multimodal Speaker Model for Remote Speech Diagnosis
    Krecichwost, Michal
    Sage, Agata
    Miodonska, Zuzanna
    Badura, Pawel
    IEEE ACCESS, 2022, 10 : 93187 - 93202
  • [26] Advances in 4D medical imaging and 4D radiation therapy
    Li, Guang
    Citrin, Deborah
    Camphausen, Kevin
    Mueller, Boris
    Burman, Chandra
    Mychalczak, Borys
    Miller, Robert W.
    Song, Yulin
    TECHNOLOGY IN CANCER RESEARCH & TREATMENT, 2008, 7 (01) : 67 - 81
  • [27] Application of 4D at field - Applying 4D to control materials
    Sota, J. V.
    Zegarra, S. V.
    EWORK AND EBUSINESS IN ARCHITECTURE, ENGINEERIN G AND CONSTRUCTION, 2006, : 135 - +
  • [28] 4D X-Ray DSA and 4D Fluoroscopy
    Mistretta, C.
    MEDICAL PHYSICS, 2012, 39 (06) : 3869 - 3869
  • [29] Implementing 4D XCAT Phantom for 4D Radiotherapy Research
    Panta, R.
    Segars, W.
    Yin, F.
    Cai, J.
    MEDICAL PHYSICS, 2012, 39 (06) : 3686 - 3686
  • [30] An Approach to Convert 4D Geometry into a 4D CT Scan
    Villard, P. F.
    Beuve, M.
    Shariat, B.
    WSCG 2006: SHORT PAPERS PROCEEDINGS: 14TH INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS, VISUALIZATION AND COMPUTER VISION 2006, 2006, : 163 - 169