BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs

被引：29

作者：

Peng, Lang ^{[1
]}

Chen, Zhirong ^{[1
]}

Fu, Zhangjie ^{[1
]}

Liang, Pengpeng ^{[2
]}

Cheng, Erkang ^{[1
]}

机构：

[1] Nullmax, Beijing, Peoples R China

[2] Zhengzhou Univ, Zhengzhou, Peoples R China

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00588

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation in bird's eye view (BEV) is an important task for autonomous driving. Though this task has attracted a large amount of research efforts, it is still challenging to flexibly cope with arbitrary (single or multiple) camera sensors equipped on the autonomous vehicle. In this paper, we present BEVSegFormer, an effective transformer-based method for BEV semantic segmentation from arbitrary camera rigs. Specifically, our method first encodes image features from arbitrary cameras with a shared backbone. These image features are then enhanced by a deformable transformer-based encoder. Moreover, we introduce a BEV transformer decoder module to parse BEV semantic segmentation results. An efficient multi-camera deformable attention unit is designed to carry out the BEV-to-image view transformation. Finally, the queries are reshaped according to the layout of grids in the BEV, and upsampled to produce the semantic segmentation result in a supervised manner. We evaluate the proposed algorithm on the public nuScenes dataset and a self-collected dataset. Experimental results show that our method achieves promising performance on BEV semantic segmentation from arbitrary camera rigs. We also demonstrate the effectiveness of each component via ablation study.

引用

页码：5924 / 5932

页数：9

共 50 条

[1] Camera-view supervision for bird's-eye-view semantic segmentation
Yang, Bowen
Yu, Linlin
Chen, Feng
FRONTIERS IN BIG DATA, 2024, 7
[2] LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation
Bartoccioni, Florent
Zablocki, Eloi
Bursuc, Andrei
Perez, Patrick
Cord, Matthieu
Alahari, Karteek
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1663 - 1672
[3] Improving Bird's Eye View Semantic Segmentation by Task Decomposition
Zhao, Tianhao
Chen, Yongcan
Wu, Yu
Liu, Tianyang
Du, Bo
Xiao, Peilun
Qiu, Shi
Yang, Hongda
Li, Guozhen
Yang, Yi
Lin, Yutian
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15512 - 15521
[4] Efficient Semantic Segmentation for Visual Bird's-Eye View Interpretation
Saemann, Timo
Amende, Karl
Milz, Stefan
Witt, Christian
Simon, Martin
Petzold, Johannes
INTELLIGENT AUTONOMOUS SYSTEMS 15, IAS-15, 2019, 867 : 679 - 688
[5] CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers
Xu, Runsheng
Tu, Zhengzhong
Xiang, Hao
Shao, Wei
Zhou, Bolei
Ma, Jiaqi
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 989 - 1000
[6] Bird's Eye View Semantic Segmentation based on Improved Transformer for Automatic Annotation
Liang, Tianjiao
Pan, Weiguo
Bao, Hong
Fan, Xinyue
Li, Han
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (08): : 1996 - 2015
[7] Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation
Jiang, Feng
Gao, Heng
Qiu, Shoumeng
Zhang, Haiqiang
Wan, Ru
Pu, Jian
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 402 - 407
[8] DVT: Decoupled Dual-Branch View Transformation for Monocular Bird's Eye View Semantic Segmentation
Du, Jiayuan
Pan, Xianghui
Shen, Mengjiao
Su, Shuai
Yang, Jingwei
Liu, Chengju
Chen, Qijun
2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2024), 2024, : 9769 - 9776
[9] Self Bird's Eye View with Omnidirectional Camera on HMD
Funahashi, Kenji
Sumida, Naoki
Mizuno, Shinji
2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 935 - 936
[10] BAEFormer: Bi-directional and Early Interaction Transformers for Bird's Eye View Semantic Segmentation
Pan, Cong
He, Yonghao
Peng, Junran
Zhang, Qian
Sui, Wei
Zhang, Zhaoxiang
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9590 - 9599

← 1 2 3 4 5 →