Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection

被引:0
|
作者
Cserni, Marton [1 ]
Rovid, Andras [1 ]
机构
[1] Budapest Univ Technol & Econ BME, Fac Transportat Engn & Vehicle Engn, Dept Automot Technol, H-1111 Budapest, Hungary
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Semantics; Three-dimensional displays; Image reconstruction; Solid modeling; Trajectory; Pose estimation; Accuracy; Cameras; Computational modeling; Autonomous driving; shape aware monocular 3D object detection; trajectory reconstruction; semantic keypoints; cooperative perception;
D O I
10.1109/ACCESS.2024.3484672
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently the state-of-the-art monocular 3D object detectors use machine learning to estimate the 6DOF pose and shape of vehicles. This requires large amounts of precisely annotated 3D data for the training process and significant computing power for inference. Alternatively, there exist methods, which attempt to reconstruct target vehicle shapes and scales using projective geometry and classically detected feature points such as SURF and ORB. These methods use specific camera motion or geometrical constraints which cannot always be assumed. The resulting model is an unstructured point cloud which contains no semantic information, making its utility inconvenient in a distributed perception system. In this study, the applicability of semantic keypoints for vehicle shape and trajectory estimation is explored. A novel method is presented, which is capable reconstructing the semantic shape and trajectory of the target vehicle from a sequence of images with state-of-the art accuracy. The resulting semantic vertex model is then used for monocular, single frame 6DOF pose estimation with high accuracy. Building on this, a cooperative perception framework is also introduced. The algorithm is tested in both in-vehicle and infrastructure mounted mono-camera sensor setups. In addition to achieving state of the art depth accuracy in vehicle trajectory reconstruction on the Argoverse dataset, our method outperforms the state of the art shape-aware deep learning method in pose estimation in a cooperative perception scenario both in simulation and in real-world experiments.
引用
收藏
页码:167153 / 167167
页数:15
相关论文
共 50 条
  • [1] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction
    Ku, Jason
    Pon, Alex D.
    Waslander, Steven L.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11859 - 11868
  • [2] Probabilistic instance shape reconstruction with sparse LiDAR for monocular 3D object detection
    Ji, Chaofeng
    Wu, Han
    Liu, Guizhong
    NEUROCOMPUTING, 2023, 529 : 92 - 100
  • [3] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [4] Monocular 3D Vehicle Trajectory Reconstruction Using Terrain Shape Constraints
    Bullinger, Sebastian
    Bodensteiner, Christoph
    Arens, Michael
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 1122 - 1128
  • [5] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation
    Chen, Hansheng
    Huang, Yuyao
    Tian, Wei
    Gao, Zhong
    Xiong, Lu
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10374 - 10383
  • [6] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [7] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [8] Monocular 3D Object Reconstruction with GAN Inversion
    Zhang, Junzhe
    Ren, Daxuan
    Cai, Zhongang
    Yeo, Chai Kiat
    Dai, Bo
    Loy, Chen Change
    COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 673 - 689
  • [9] Efficient and Robust 3D Object Reconstruction Based on Monocular SLAM and CNN Semantic Segmentation
    Weber, Thomas
    Triputen, Sergey
    Gopal, Atmaraaj
    Eissler, Steffen
    Hoefert, Christian
    Schreve, Kristiaan
    Raetsch, Matthias
    ROBOT WORLD CUP XXIII, ROBOCUP 2019, 2019, 11531 : 351 - 363
  • [10] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156