BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs

被引:29
|
作者
Peng, Lang [1 ]
Chen, Zhirong [1 ]
Fu, Zhangjie [1 ]
Liang, Pengpeng [2 ]
Cheng, Erkang [1 ]
机构
[1] Nullmax, Beijing, Peoples R China
[2] Zhengzhou Univ, Zhengzhou, Peoples R China
关键词
D O I
10.1109/WACV56688.2023.00588
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation in bird's eye view (BEV) is an important task for autonomous driving. Though this task has attracted a large amount of research efforts, it is still challenging to flexibly cope with arbitrary (single or multiple) camera sensors equipped on the autonomous vehicle. In this paper, we present BEVSegFormer, an effective transformer-based method for BEV semantic segmentation from arbitrary camera rigs. Specifically, our method first encodes image features from arbitrary cameras with a shared backbone. These image features are then enhanced by a deformable transformer-based encoder. Moreover, we introduce a BEV transformer decoder module to parse BEV semantic segmentation results. An efficient multi-camera deformable attention unit is designed to carry out the BEV-to-image view transformation. Finally, the queries are reshaped according to the layout of grids in the BEV, and upsampled to produce the semantic segmentation result in a supervised manner. We evaluate the proposed algorithm on the public nuScenes dataset and a self-collected dataset. Experimental results show that our method achieves promising performance on BEV semantic segmentation from arbitrary camera rigs. We also demonstrate the effectiveness of each component via ablation study.
引用
收藏
页码:5924 / 5932
页数:9
相关论文
共 50 条
  • [1] Camera-view supervision for bird's-eye-view semantic segmentation
    Yang, Bowen
    Yu, Linlin
    Chen, Feng
    FRONTIERS IN BIG DATA, 2024, 7
  • [2] LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation
    Bartoccioni, Florent
    Zablocki, Eloi
    Bursuc, Andrei
    Perez, Patrick
    Cord, Matthieu
    Alahari, Karteek
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1663 - 1672
  • [3] Improving Bird's Eye View Semantic Segmentation by Task Decomposition
    Zhao, Tianhao
    Chen, Yongcan
    Wu, Yu
    Liu, Tianyang
    Du, Bo
    Xiao, Peilun
    Qiu, Shi
    Yang, Hongda
    Li, Guozhen
    Yang, Yi
    Lin, Yutian
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15512 - 15521
  • [4] Efficient Semantic Segmentation for Visual Bird's-Eye View Interpretation
    Saemann, Timo
    Amende, Karl
    Milz, Stefan
    Witt, Christian
    Simon, Martin
    Petzold, Johannes
    INTELLIGENT AUTONOMOUS SYSTEMS 15, IAS-15, 2019, 867 : 679 - 688
  • [5] CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers
    Xu, Runsheng
    Tu, Zhengzhong
    Xiang, Hao
    Shao, Wei
    Zhou, Bolei
    Ma, Jiaqi
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 989 - 1000
  • [6] Bird's Eye View Semantic Segmentation based on Improved Transformer for Automatic Annotation
    Liang, Tianjiao
    Pan, Weiguo
    Bao, Hong
    Fan, Xinyue
    Li, Han
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (08): : 1996 - 2015
  • [7] Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation
    Jiang, Feng
    Gao, Heng
    Qiu, Shoumeng
    Zhang, Haiqiang
    Wan, Ru
    Pu, Jian
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 402 - 407
  • [8] DVT: Decoupled Dual-Branch View Transformation for Monocular Bird's Eye View Semantic Segmentation
    Du, Jiayuan
    Pan, Xianghui
    Shen, Mengjiao
    Su, Shuai
    Yang, Jingwei
    Liu, Chengju
    Chen, Qijun
    2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2024), 2024, : 9769 - 9776
  • [9] Self Bird's Eye View with Omnidirectional Camera on HMD
    Funahashi, Kenji
    Sumida, Naoki
    Mizuno, Shinji
    2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 935 - 936
  • [10] BAEFormer: Bi-directional and Early Interaction Transformers for Bird's Eye View Semantic Segmentation
    Pan, Cong
    He, Yonghao
    Peng, Junran
    Zhang, Qian
    Sui, Wei
    Zhang, Zhaoxiang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9590 - 9599