Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection

被引：0

作者：

Cserni, Marton ^{[1
]}

Rovid, Andras ^{[1
]}

机构：

[1] Budapest Univ Technol & Econ BME, Fac Transportat Engn & Vehicle Engn, Dept Automot Technol, H-1111 Budapest, Hungary

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Semantics; Three-dimensional displays; Image reconstruction; Solid modeling; Trajectory; Pose estimation; Accuracy; Cameras; Computational modeling; Autonomous driving; shape aware monocular 3D object detection; trajectory reconstruction; semantic keypoints; cooperative perception;

D O I：

10.1109/ACCESS.2024.3484672

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Currently the state-of-the-art monocular 3D object detectors use machine learning to estimate the 6DOF pose and shape of vehicles. This requires large amounts of precisely annotated 3D data for the training process and significant computing power for inference. Alternatively, there exist methods, which attempt to reconstruct target vehicle shapes and scales using projective geometry and classically detected feature points such as SURF and ORB. These methods use specific camera motion or geometrical constraints which cannot always be assumed. The resulting model is an unstructured point cloud which contains no semantic information, making its utility inconvenient in a distributed perception system. In this study, the applicability of semantic keypoints for vehicle shape and trajectory estimation is explored. A novel method is presented, which is capable reconstructing the semantic shape and trajectory of the target vehicle from a sequence of images with state-of-the art accuracy. The resulting semantic vertex model is then used for monocular, single frame 6DOF pose estimation with high accuracy. Building on this, a cooperative perception framework is also introduced. The algorithm is tested in both in-vehicle and infrastructure mounted mono-camera sensor setups. In addition to achieving state of the art depth accuracy in vehicle trajectory reconstruction on the Argoverse dataset, our method outperforms the state of the art shape-aware deep learning method in pose estimation in a cooperative perception scenario both in simulation and in real-world experiments.

引用

页码：167153 / 167167

页数：15

共 50 条

[41] Monocular 3D object detection for construction scene analysis
Shen, Jie
Jiao, Lang
Zhang, Cong
Peng, Keran
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (09) : 1370 - 1389
[42] Delving into Localization Errors for Monocular 3D Object Detection
Ma, Xinzhu
Zhang, Yinmin
Xu, Dan
Zhou, Dongzhan
Yi, Shuai
Li, Haojie
Ouyang, Wanli
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4719 - 4728
[43] Competition for roadside camera monocular 3D object detection
Jinrang Jia
Yifeng Shi
Yuli Qu
Rui Wang
Xing Xu
Hai Zhang
NationalScienceReview, 2023, 10 (06) : 34 - 37
[44] MonoGRNet: A General Framework for Monocular 3D Object Detection
Qin, Zengyi
Wang, Jinglu
Lu, Yan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184
[45] 3D Reconstruction and Object Detection for HoloLens
Wu, Zequn
Zhao, Tianhao
Nguyen, Chuong
2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
[46] Object-Aware Centroid Voting for Monocular 3D Object Detection
Bao, Wentao
Yu, Qi
Kong, Yu
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2197 - 2204
[47] Monocular 3D Shape Reconstruction using Deep Neural Networks
Rao, Qing
Krueger, Lars
Dietmayer, Klaus
2016 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2016, : 310 - 315
[48] Semantic Consistency Networks for 3D Object Detection
Wei, Wenwen
Wei, Ping
Zheng, Nanning
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2861 - 2869
[49] 3D Reconstruction of a Smooth Articulated Trajectory from a Monocular Image Sequence
Park, Hyun Soo
Sheikh, Yaser
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 201 - 208
[50] SGM3D: Stereo Guided Monocular 3D Object Detection
Zhou, Zheyuan
Du, Liang
Ye, Xiaoqing
Zou, Zhikang
Tan, Xiao
Zhang, Li
Xue, Xiangyang
Feng, Jianfeng
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10478 - 10485

← 1 2 3 4 5 →