Augmented Autoencoders: Implicit 3D Orientation Learning for 6D Object Detection

被引:0
|
作者
Martin Sundermeyer
Zoltan-Csaba Marton
Maximilian Durner
Rudolph Triebel
机构
[1] German Aerospace Center (DLR),
[2] Technical University of Munich,undefined
来源
International Journal of Computer Vision | 2020年 / 128卷
关键词
6D object detection; Pose estimation; Domain randomization; Autoencoder; Synthetic data; Symmetries;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalizes to various test sensors and inherently handles object and view symmetries. Instead of learning an explicit mapping from input images to object poses, it provides an implicit representation of object orientations defined by samples in a latent space. Our pipeline achieves state-of-the-art performance on the T-LESS dataset both in the RGB and RGB-D domain. We also evaluate on the LineMOD dataset where we can compete with other synthetically trained approaches. We further increase performance by correcting 3D orientation estimates to account for perspective errors when the object deviates from the image center and show extended results. Our code is available here https://github.com/DLR-RM/AugmentedAutoencoder.
引用
收藏
页码:714 / 729
页数:15
相关论文
共 50 条
  • [21] 3D Mapping and 6D Pose Computation for Real Time Augmented Reality on Cylindrical Objects
    Tang, Fulin
    Wu, Yihong
    Hou, Xiaohui
    Ling, Haibin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) : 2887 - 2899
  • [22] Implicit learning in 3D object recognition: The importance of temporal context
    Becker, S
    NEURAL COMPUTATION, 1999, 11 (02) : 347 - 374
  • [23] Unsupervised Joint 3D Object Model Learning and 6D Pose Estimation for Depth-Based Instance Segmentation
    Wu, Yuanwei
    Marks, Tim K.
    Cherian, Anoop
    Chen, Siheng
    Feng, Chen
    Wang, Guanghui
    Sullivan, Alan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2777 - 2786
  • [24] Joint stereo 3D object detection and implicit surface reconstruction
    Li, Shichao
    Huang, Xijie
    Liu, Zechun
    Cheng, Kwang-Ting
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [25] Weighted Unsupervised Learning for 3D Object Detection
    Kowsari, Kamran
    Alassaf, Manal H.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (01) : 584 - 593
  • [26] Learning Occupancy for Monocular 3D Object Detection
    Peng, Liang
    Xu, Junkai
    Cheng, Haoran
    Yang, Zheng
    Wu, Xiaopei
    Qian, Wei
    Wang, Wenxiao
    Wu, Boxi
    Cai, Deng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10281 - 10292
  • [27] Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles
    Srivastava, Siddharth
    Jurie, Frederic
    Sharma, Gaurav
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4504 - 4511
  • [28] Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
    Zhang, Mengchen
    Wu, Tong
    Wang, Tai
    Wang, Tengfei
    Liu, Ziwei
    Lin, Dahua
    COMPUTER VISION - ECCV 2024, PT XXV, 2025, 15083 : 216 - 232
  • [29] TrackAgent: 6D Object Tracking via Reinforcement Learning
    Roehrl, Konstantin
    Bauer, Dominik
    Patten, Timothy
    Vincze, Markus
    COMPUTER VISION SYSTEMS, ICVS 2023, 2023, 14253 : 323 - 335
  • [30] 6D stereo: A robust and rapid approach to object detection
    Lorei, Marcus
    Wurz-Wessel, Alexander
    Heger, Thomas
    VDI Berichte, 2008, (2038): : 85 - 92