Certifiable Object Pose Estimation: Foundations, Learning Models, and Self-Training

被引:3
|
作者
Talak, Rajat [1 ]
Peng, Lisa R. [1 ,2 ]
Carlone, Luca [1 ]
机构
[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA
[2] Ample, San Francisco, CA 94107 USA
基金
美国国家科学基金会;
关键词
Certifiable models; computer vision; 3D robot vision; object pose estimation; safe perception; self-supervised learning; PREDICTION;
D O I
10.1109/TRO.2023.3271568
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this article, we consider a certifiable object pose estimation problem, where-given a partial point cloud of an object-the goal is to not only estimate the object pose, but also provide a certificate of correctness for the resulting estimate. Our first contribution is a general theory of certification for end-to-end perception models. In particular, we introduce the notion of ?-correctness, which bounds the distance between an estimate and the ground truth. We then show that ?-correctness can be assessed by implementing two certificates: 1) a certificate of observable correctness, which asserts if the model output is consistent with the input data and prior information; and 2) a certificate of nondegeneracy, which asserts whether the input data are sufficient to compute a unique estimate. Our second contribution is to apply this theory and design a new learning-based certifiable pose estimator. In particular, we propose C-3PO, a semantic-keypoint-based pose estimation model, augmented with the two certificates, to solve the certifiable pose estimation problem. C-3PO also includes a keypoint corrector, implemented as a differentiable optimization layer, that can correct large detection errors (e.g., due to the sim-to-real gap). Our third contribution is a novel self-supervised training approach that uses our certificate of observable correctness to provide the supervisory signal to C-3PO during training. In it, the model trains only on the observably correct input-output pairs produced in each batch and at each iteration. As training progresses, we see that the observably correct input-output pairs grow, eventually reaching near 100% in many cases. We conduct extensive experiments to evaluate the performance of the corrector, the certification, and the proposed self-supervised training using the ShapeNet and YCB datasets. The experiments show that 1) standard semantic-keypoint-based methods (which constitute the backbone of C-3PO) outperform more recent alternatives in challenging problem instances; 2) C-3PO further improves performance and significantly outperforms all the baselines; and 3) C-3PO's certificates are able to discern correct pose estimates.(1)
引用
收藏
页码:2805 / 2824
页数:20
相关论文
共 50 条
  • [1] SLAM-Supported Self-Training for 6D Object Pose Estimation
    Lu, Ziqi
    Zhang, Yihao
    Doherty, Kevin
    Severinsen, Odin
    Yang, Ethan
    Leonard, John
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2833 - 2840
  • [2] Semi-supervised self-training of object detection models
    Rosenberg, C
    Hebert, M
    Schneiderman, H
    WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 29 - 36
  • [3] Unsupervised Learning of Object Landmarks via Self-Training Correspondence
    Mallis, Dimitrios
    Sanchez, Enrique
    Bell, Matt
    Tzimiropoulos, Georgios
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] Adaptive Self-Training for Object Detection
    Vandeghen, Renaud
    Louppe, Gilles
    Van Droogenbroeck, Marc
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 914 - 923
  • [5] Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking
    Chen, Kai
    Cao, Rui
    James, Stephen
    Li, Yichuan
    Liu, Yun-Hui
    Abbeel, Pieter
    Dou, Qi
    COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 533 - 550
  • [6] Bridging the Domain Gap in Satellite Pose Estimation: A Self-Training Approach Based on Geometrical Constraints
    Wang, Zi
    Chen, Minglin
    Guo, Yulan
    Li, Zhang
    Yu, Qifeng
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (03) : 2500 - 2514
  • [7] Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training
    Feng, Qi
    He, Kun
    Wen, He
    Keskin, Cem
    Ye, Yuting
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5684 - 5693
  • [8] Manifold-Aware Self-Training for Unsupervised Domain Adaptation on Regressing 6D Object Pose
    Zhang, Yichen
    Lin, Jiehong
    Chen, Ke
    Xu, Zelin
    Wang, Yaowei
    Jia, Kui
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1740 - 1748
  • [9] Domain Adaptive Object Detection via Balancing Between Self-Training and Adversarial Learning
    Munir, Muhammad Akhtar
    Khan, Muhammad Haris
    Sarfraz, M. Saquib
    Ali, Mohsen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14353 - 14365
  • [10] SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection
    Munir, Muhammad Akhtar
    Khan, Muhammad Haris
    Sarfraz, M. Saquib
    Ali, Mohsen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,