AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

被引:64
|
作者
Liu, Zongdai [1 ]
Zhou, Dingfu [1 ]
Lu, Feixiang [1 ]
Fang, Jin [1 ]
Zhang, Liangjun [1 ]
机构
[1] Baidu Res, Natl Engn Lab Deep Learning Technol & Applicat, Robot & Autonomous Driving Lab, Beijing, Peoples R China
关键词
ACCURATE;
D O I
10.1109/ICCV48922.2021.01535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing deep learning-based approaches for monocular 3D object detection in autonomous driving often model the object as a rotated 3D cuboid while the object's geometric shape has been ignored. In this work, we propose an approach for incorporating the shape-aware 2D/3D constraints into the 3D detection framework. Specifically, we employ the deep neural network to learn distinguished 2D keypoints in the 2D image domain and regress their corresponding 3D coordinates in the local 3D object coordinate first. Then the 2D/3D geometric constraints are built by these correspondences for each object to boost the detection performance. For generating the ground truth of 2D/3D keypoints, an automatic model-fitting approach has been proposed by fitting the deformed 3D object model and the object mask in the 2D image. The proposed framework has been verified on the public KITTI dataset and the experimental results demonstrate that by using additional geometrical constraints the detection performance has been significantly improved as compared to the baseline method. More importantly, the proposed framework achieves state-of-the-art performance with real time. Data and code will be available at https://github.com/ zongdai/AutoShape
引用
收藏
页码:15621 / 15630
页数:10
相关论文
共 50 条
  • [41] SACINet: Semantic-Aware Cross-Modal Interaction Network for Real-Time 3D Object Detection
    Yang, Ying
    Yin, Hui
    Chong, Ai-Xin
    Wan, Jin
    Liu, Qing-Yi
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3917 - 3927
  • [42] PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular Videos
    Xie, Yiming
    Gadelha, Matheus
    Yang, Fengting
    Zhou, Xiaowei
    Jiang, Huaizu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6209 - 6218
  • [43] Real-time monocular object SLAM
    Galvez-Lopez, Dorian
    Salas, Marta
    Tardos, Juan D.
    Montiel, J. M. M.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 435 - 449
  • [44] Risk-aware Real-time Object Detection
    Santana, Misael Alpizar
    Calinescu, Radu
    Paterson, Colin
    2022 18TH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC 2022), 2022, : 105 - 108
  • [45] Real-time 3D features reconstruction through monocular vision
    Liverani, Alfredo
    Leali, Francesco
    Pellicciari, Marcello
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2010, 4 (02): : 103 - 112
  • [46] Real-time monocular 3D perception with ORB-Features
    Ji, Babing
    Cao, Qixin
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2018, 45 (06): : 776 - 783
  • [47] DeepSDP: A Real-Time Deep Stereo Detection and Positioning Method for 3D Object Detection
    Moradi, Homayoun
    Karami, Mohammad
    Shamaghdari, Saeed
    2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 1309 - 1313
  • [48] Real-Time 3D Reconstruction Method Based on Monocular Vision
    Jia, Qingyu
    Chang, Liang
    Qiang, Baohua
    Zhang, Shihao
    Xie, Wu
    Yang, Xianyi
    Sun, Yangchang
    Yang, Minghao
    SENSORS, 2021, 21 (17)
  • [49] Real-time active 3D shape reconstruction for 3D video
    Wu, X
    Matsuyama, T
    ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 186 - 191
  • [50] Probabilistic instance shape reconstruction with sparse LiDAR for monocular 3D object detection
    Ji, Chaofeng
    Wu, Han
    Liu, Guizhong
    NEUROCOMPUTING, 2023, 529 : 92 - 100