Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection

被引：0

作者：

Cserni, Marton ^{[1
]}

Rovid, Andras ^{[1
]}

机构：

[1] Budapest Univ Technol & Econ BME, Fac Transportat Engn & Vehicle Engn, Dept Automot Technol, H-1111 Budapest, Hungary

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Semantics; Three-dimensional displays; Image reconstruction; Solid modeling; Trajectory; Pose estimation; Accuracy; Cameras; Computational modeling; Autonomous driving; shape aware monocular 3D object detection; trajectory reconstruction; semantic keypoints; cooperative perception;

D O I：

10.1109/ACCESS.2024.3484672

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Currently the state-of-the-art monocular 3D object detectors use machine learning to estimate the 6DOF pose and shape of vehicles. This requires large amounts of precisely annotated 3D data for the training process and significant computing power for inference. Alternatively, there exist methods, which attempt to reconstruct target vehicle shapes and scales using projective geometry and classically detected feature points such as SURF and ORB. These methods use specific camera motion or geometrical constraints which cannot always be assumed. The resulting model is an unstructured point cloud which contains no semantic information, making its utility inconvenient in a distributed perception system. In this study, the applicability of semantic keypoints for vehicle shape and trajectory estimation is explored. A novel method is presented, which is capable reconstructing the semantic shape and trajectory of the target vehicle from a sequence of images with state-of-the art accuracy. The resulting semantic vertex model is then used for monocular, single frame 6DOF pose estimation with high accuracy. Building on this, a cooperative perception framework is also introduced. The algorithm is tested in both in-vehicle and infrastructure mounted mono-camera sensor setups. In addition to achieving state of the art depth accuracy in vehicle trajectory reconstruction on the Argoverse dataset, our method outperforms the state of the art shape-aware deep learning method in pose estimation in a cooperative perception scenario both in simulation and in real-world experiments.

引用

页码：167153 / 167167

页数：15

共 50 条

[1] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction
Ku, Jason
Pon, Alex D.
Waslander, Steven L.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11859 - 11868
[2] Probabilistic instance shape reconstruction with sparse LiDAR for monocular 3D object detection
Ji, Chaofeng
Wu, Han
Liu, Guizhong
NEUROCOMPUTING, 2023, 529 : 92 - 100
[3] Shape-Aware Monocular 3D Object Detection
Chen, Wei
Zhao, Jie
Zhao, Wan-Lei
Wu, Song-Yuan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
[4] Monocular 3D Vehicle Trajectory Reconstruction Using Terrain Shape Constraints
Bullinger, Sebastian
Bodensteiner, Christoph
Arens, Michael
2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 1122 - 1128
[5] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation
Chen, Hansheng
Huang, Yuyao
Tian, Wei
Gao, Zhong
Xiong, Lu
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10374 - 10383
[6] Aerial Monocular 3D Object Detection
Hu, Yue
Fang, Shaoheng
Xie, Weidi
Chen, Siheng
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
[7] Disentangling Monocular 3D Object Detection
Simonelli, Andrea
Bulo, Samuel Rota
Porzi, Lorenzo
Lopez-Antequera, Manuel
Kontschieder, Peter
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
[8] Monocular 3D Object Reconstruction with GAN Inversion
Zhang, Junzhe
Ren, Daxuan
Cai, Zhongang
Yeo, Chai Kiat
Dai, Bo
Loy, Chen Change
COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 673 - 689
[9] Efficient and Robust 3D Object Reconstruction Based on Monocular SLAM and CNN Semantic Segmentation
Weber, Thomas
Triputen, Sergey
Gopal, Atmaraaj
Eissler, Steffen
Hoefert, Christian
Schreve, Kristiaan
Raetsch, Matthias
ROBOT WORLD CUP XXIII, ROBOCUP 2019, 2019, 11531 : 351 - 363
[10] Monocular 3D Object Detection for Autonomous Driving
Chen, Xiaozhi
Kundu, Kaustav
Zhang, Ziyu
Ma, Huimin
Fidler, Sanja
Urtasun, Raquel
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156

← 1 2 3 4 5 →