AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

被引:64
|
作者
Liu, Zongdai [1 ]
Zhou, Dingfu [1 ]
Lu, Feixiang [1 ]
Fang, Jin [1 ]
Zhang, Liangjun [1 ]
机构
[1] Baidu Res, Natl Engn Lab Deep Learning Technol & Applicat, Robot & Autonomous Driving Lab, Beijing, Peoples R China
关键词
ACCURATE;
D O I
10.1109/ICCV48922.2021.01535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing deep learning-based approaches for monocular 3D object detection in autonomous driving often model the object as a rotated 3D cuboid while the object's geometric shape has been ignored. In this work, we propose an approach for incorporating the shape-aware 2D/3D constraints into the 3D detection framework. Specifically, we employ the deep neural network to learn distinguished 2D keypoints in the 2D image domain and regress their corresponding 3D coordinates in the local 3D object coordinate first. Then the 2D/3D geometric constraints are built by these correspondences for each object to boost the detection performance. For generating the ground truth of 2D/3D keypoints, an automatic model-fitting approach has been proposed by fitting the deformed 3D object model and the object mask in the 2D image. The proposed framework has been verified on the public KITTI dataset and the experimental results demonstrate that by using additional geometrical constraints the detection performance has been significantly improved as compared to the baseline method. More importantly, the proposed framework achieves state-of-the-art performance with real time. Data and code will be available at https://github.com/ zongdai/AutoShape
引用
收藏
页码:15621 / 15630
页数:10
相关论文
共 50 条
  • [31] An Approach to 3D Object Detection in Real-Time for Cognitive Robotics Experiments
    Vidal-Soroa, Daniel
    Furelos, Pedro
    Bellas, Francisco
    Antonio Becerra, Jose
    ROBOT2022: FIFTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, VOL 1, 2023, 589 : 283 - 294
  • [32] PIXOR: Real-time 3D Object Detection from Point Clouds
    Yang, Bin
    Luo, Wenjie
    Urtasun, Raquel
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7652 - 7660
  • [33] SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction
    Jayakumar, Nivetha
    Hossain, Tonmoy
    Zhang, Miaomiao
    SHAPE IN MEDICAL IMAGING, SHAPEMI 2023, 2023, 14350 : 287 - 300
  • [34] Towards locally and globally shape-aware reverse 3D modeling
    Goyal, Manish
    Murugappan, Sundar
    Piya, Cecil
    Benjamin, William
    Fang, Yi
    Liu, Min
    Ramani, Karthik
    COMPUTER-AIDED DESIGN, 2012, 44 (06) : 537 - 553
  • [35] Real-Time Complex Object 3D Measurement
    Li, Zhongwei
    Shi, Yusheng
    Wang, Congjun
    2009 INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION, PROCEEDINGS, 2009, : 191 - 193
  • [36] MonoCAPE: Monocular 3D object detection with coordinate-aware position embeddings
    Chen, Wenyu
    Chen, Mu
    Fang, Jian
    Zhao, Huaici
    Wang, Guogang
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [37] Shape Priors for Real-Time Monocular Object Localization in Dynamic Environments
    Murthy, J. Krishna
    Sharma, Sarthak
    Krishna, K. Madhava
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 1768 - 1774
  • [38] Occlusion-Aware Plane-Constraints for Monocular 3D Object Detection
    Yao, Hongdou
    Chen, Jun
    Wang, Zheng
    Wang, Xiao
    Han, Pengfei
    Chai, Xiaoyu
    Qiu, Yansheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 4593 - 4605
  • [39] MonoGAE: Roadside Monocular 3D Object Detection With Ground-Aware Embeddings
    Yang, Lei
    Zhang, Xinyu
    Yu, Jiaxin
    Li, Jun
    Zhao, Tong
    Wang, Li
    Huang, Yi
    Zhang, Chuang
    Wang, Hong
    Li, Yiming
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17587 - 17601
  • [40] VKP-P3D: Real-Time Monocular Pseudo 3D Object Detection Based on Visible Key Points and Camera Geometry
    Sun, Changliang
    Liu, Hongli
    Xiao, Weichu
    Shi, Bo
    Qiu, Yuan
    IEEE ACCESS, 2024, 12 : 41883 - 41895