BEVStereo: Enhancing Depth Estimation in Multi-View 3D Object Detection with Temporal Stereo

被引:0
|
作者
Li, Yinhao [1 ,3 ]
Bao, Han [2 ,3 ]
Ge, Zheng [4 ]
Yang, Jinrong [5 ]
Sun, Jianjian [4 ]
Li, Zeming [4 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, State Key Lab Processors, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] MEGVII Technol, Beijing, Peoples R China
[5] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Restricted by the ability of depth perception, all Multi-view 3D object detection methods fall into the bottleneck of depth accuracy. By constructing temporal stereo, depth estimation is quite reliable in indoor scenarios. However, there are two difficulties in directly integrating temporal stereo into outdoor multi-view 3D object detectors: 1) The construction of temporal stereos for all views results in high computing costs. 2) Unable to adapt to challenging outdoor scenarios. In this study, we propose an effective method for creating temporal stereo by dynamically determining the center and range of the temporal stereo. The most confident center is found using the EM algorithm. Numerous experiments on nuScenes have shown the BEVStereo's ability to deal with complex outdoor scenarios that other stereo-based methods are unable to handle. For the first time, a stereo-based approach shows superiority in scenarios like a static ego vehicle and moving objects. BEVStereo achieves the new state-of-the-art in the camera-only track of nuScenes dataset while maintaining memory efficiency. Codes have been released(1).
引用
收藏
页码:1486 / 1494
页数:9
相关论文
共 50 条
  • [1] BEVDepth: Acquisition of Reliable Depth for Multi-View 3D Object Detection
    Li, Yinhao
    Ge, Zheng
    Yu, Guanyi
    Yang, Jinrong
    Wang, Zengran
    Shi, Yukang
    Sun, Jianjian
    Li, Zeming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1477 - 1485
  • [2] Multi-View Attentive Contextualization for Multi-View 3D Object Detection
    Liu, Xianpeng
    Zheng, Ce
    Qian, Ming
    Xue, Nan
    Chen, Chen
    Zhang, Zhebin
    Li, Chen
    Wu, Tianfu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16688 - 16698
  • [3] Continuous Depth Estimation for Multi-view Stereo
    Liu, Yebin
    Cao, Xun
    Dai, Qionghai
    Xu, Wenli
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2121 - 2128
  • [4] Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
    Wang, Shihao
    Liu, Yingfei
    Wang, Tiancai
    Li, Ying
    Zhang, Xiangyu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3598 - 3608
  • [5] Viewpoint Equivariance for Multi-View 3D Object Detection
    Chen, Dian
    Li, Jie
    Guizilini, Vitor
    Ambrus, Rares
    Gaidon, Adrien
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9213 - 9222
  • [6] Adaptive depth estimation for pyramid multi-view stereo
    Liao, Jie
    Fu, Yanping
    Yan, Qingan
    Luo, Fei
    Xiao, Chunxia
    COMPUTERS & GRAPHICS-UK, 2021, 97 : 268 - 278
  • [7] REVISED DEPTH MAP ESTIMATION FOR MULTI-VIEW STEREO
    Yao, Yao
    Zhu, Hao
    Nie, Yongming
    Ji, Xiaoli
    Cao, Xun
    2014 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D), 2014,
  • [8] Improving 3D Object Detection for Pedestrians with Virtual Multi-View Synthesis Orientation Estimation
    Ku, Jason
    Pon, Alex D.
    Walsh, Sean
    Waslander, Steven L.
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3459 - 3466
  • [9] Multi-View Stereo 3D Edge Reconstruction
    Bignoli, Andrea
    Romanoni, Andrea
    Matteucci, Matteo
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 867 - 875
  • [10] Confidence Guided Stereo 3D Object Detection with Split Depth Estimation
    Li, Chengyao
    Ku, Jason
    Waslander, Steven L.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5776 - 5783