Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection

被引:0
|
作者
Cserni, Marton [1 ]
Rovid, Andras [1 ]
机构
[1] Budapest Univ Technol & Econ BME, Fac Transportat Engn & Vehicle Engn, Dept Automot Technol, H-1111 Budapest, Hungary
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Semantics; Three-dimensional displays; Image reconstruction; Solid modeling; Trajectory; Pose estimation; Accuracy; Cameras; Computational modeling; Autonomous driving; shape aware monocular 3D object detection; trajectory reconstruction; semantic keypoints; cooperative perception;
D O I
10.1109/ACCESS.2024.3484672
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently the state-of-the-art monocular 3D object detectors use machine learning to estimate the 6DOF pose and shape of vehicles. This requires large amounts of precisely annotated 3D data for the training process and significant computing power for inference. Alternatively, there exist methods, which attempt to reconstruct target vehicle shapes and scales using projective geometry and classically detected feature points such as SURF and ORB. These methods use specific camera motion or geometrical constraints which cannot always be assumed. The resulting model is an unstructured point cloud which contains no semantic information, making its utility inconvenient in a distributed perception system. In this study, the applicability of semantic keypoints for vehicle shape and trajectory estimation is explored. A novel method is presented, which is capable reconstructing the semantic shape and trajectory of the target vehicle from a sequence of images with state-of-the art accuracy. The resulting semantic vertex model is then used for monocular, single frame 6DOF pose estimation with high accuracy. Building on this, a cooperative perception framework is also introduced. The algorithm is tested in both in-vehicle and infrastructure mounted mono-camera sensor setups. In addition to achieving state of the art depth accuracy in vehicle trajectory reconstruction on the Argoverse dataset, our method outperforms the state of the art shape-aware deep learning method in pose estimation in a cooperative perception scenario both in simulation and in real-world experiments.
引用
收藏
页码:167153 / 167167
页数:15
相关论文
共 50 条
  • [41] Monocular 3D object detection for construction scene analysis
    Shen, Jie
    Jiao, Lang
    Zhang, Cong
    Peng, Keran
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (09) : 1370 - 1389
  • [42] Delving into Localization Errors for Monocular 3D Object Detection
    Ma, Xinzhu
    Zhang, Yinmin
    Xu, Dan
    Zhou, Dongzhan
    Yi, Shuai
    Li, Haojie
    Ouyang, Wanli
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4719 - 4728
  • [43] Competition for roadside camera monocular 3D object detection
    Jinrang Jia
    Yifeng Shi
    Yuli Qu
    Rui Wang
    Xing Xu
    Hai Zhang
    NationalScienceReview, 2023, 10 (06) : 34 - 37
  • [44] MonoGRNet: A General Framework for Monocular 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184
  • [45] 3D Reconstruction and Object Detection for HoloLens
    Wu, Zequn
    Zhao, Tianhao
    Nguyen, Chuong
    2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
  • [46] Object-Aware Centroid Voting for Monocular 3D Object Detection
    Bao, Wentao
    Yu, Qi
    Kong, Yu
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2197 - 2204
  • [47] Monocular 3D Shape Reconstruction using Deep Neural Networks
    Rao, Qing
    Krueger, Lars
    Dietmayer, Klaus
    2016 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2016, : 310 - 315
  • [48] Semantic Consistency Networks for 3D Object Detection
    Wei, Wenwen
    Wei, Ping
    Zheng, Nanning
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2861 - 2869
  • [49] 3D Reconstruction of a Smooth Articulated Trajectory from a Monocular Image Sequence
    Park, Hyun Soo
    Sheikh, Yaser
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 201 - 208
  • [50] SGM3D: Stereo Guided Monocular 3D Object Detection
    Zhou, Zheyuan
    Du, Liang
    Ye, Xiaoqing
    Zou, Zhikang
    Tan, Xiao
    Zhang, Li
    Xue, Xiangyang
    Feng, Jianfeng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10478 - 10485