Certifiable Object Pose Estimation: Foundations, Learning Models, and Self-Training

被引：3

作者：

Talak, Rajat ^{[1
]}

Peng, Lisa R. ^{[1
,2
]}

Carlone, Luca ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA

[2] Ample, San Francisco, CA 94107 USA

来源：

IEEE TRANSACTIONS ON ROBOTICS | 2023年 / 39卷 / 04期

基金：

美国国家科学基金会;

关键词：

Certifiable models; computer vision; 3D robot vision; object pose estimation; safe perception; self-supervised learning; PREDICTION;

D O I：

10.1109/TRO.2023.3271568

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In this article, we consider a certifiable object pose estimation problem, where-given a partial point cloud of an object-the goal is to not only estimate the object pose, but also provide a certificate of correctness for the resulting estimate. Our first contribution is a general theory of certification for end-to-end perception models. In particular, we introduce the notion of ?-correctness, which bounds the distance between an estimate and the ground truth. We then show that ?-correctness can be assessed by implementing two certificates: 1) a certificate of observable correctness, which asserts if the model output is consistent with the input data and prior information; and 2) a certificate of nondegeneracy, which asserts whether the input data are sufficient to compute a unique estimate. Our second contribution is to apply this theory and design a new learning-based certifiable pose estimator. In particular, we propose C-3PO, a semantic-keypoint-based pose estimation model, augmented with the two certificates, to solve the certifiable pose estimation problem. C-3PO also includes a keypoint corrector, implemented as a differentiable optimization layer, that can correct large detection errors (e.g., due to the sim-to-real gap). Our third contribution is a novel self-supervised training approach that uses our certificate of observable correctness to provide the supervisory signal to C-3PO during training. In it, the model trains only on the observably correct input-output pairs produced in each batch and at each iteration. As training progresses, we see that the observably correct input-output pairs grow, eventually reaching near 100% in many cases. We conduct extensive experiments to evaluate the performance of the corrector, the certification, and the proposed self-supervised training using the ShapeNet and YCB datasets. The experiments show that 1) standard semantic-keypoint-based methods (which constitute the backbone of C-3PO) outperform more recent alternatives in challenging problem instances; 2) C-3PO further improves performance and significantly outperforms all the baselines; and 3) C-3PO's certificates are able to discern correct pose estimates.(1)

引用

页码：2805 / 2824

页数：20

共 50 条

[1] SLAM-Supported Self-Training for 6D Object Pose Estimation
Lu, Ziqi
Zhang, Yihao
Doherty, Kevin
Severinsen, Odin
Yang, Ethan
Leonard, John
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2833 - 2840
[2] Semi-supervised self-training of object detection models
Rosenberg, C
Hebert, M
Schneiderman, H
WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 29 - 36
[3] Unsupervised Learning of Object Landmarks via Self-Training Correspondence
Mallis, Dimitrios
Sanchez, Enrique
Bell, Matt
Tzimiropoulos, Georgios
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[4] Adaptive Self-Training for Object Detection
Vandeghen, Renaud
Louppe, Gilles
Van Droogenbroeck, Marc
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 914 - 923
[5] Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking
Chen, Kai
Cao, Rui
James, Stephen
Li, Yichuan
Liu, Yun-Hui
Abbeel, Pieter
Dou, Qi
COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 533 - 550
[6] Bridging the Domain Gap in Satellite Pose Estimation: A Self-Training Approach Based on Geometrical Constraints
Wang, Zi
Chen, Minglin
Guo, Yulan
Li, Zhang
Yu, Qifeng
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (03) : 2500 - 2514
[7] Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training
Feng, Qi
He, Kun
Wen, He
Keskin, Cem
Ye, Yuting
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5684 - 5693
[8] Manifold-Aware Self-Training for Unsupervised Domain Adaptation on Regressing 6D Object Pose
Zhang, Yichen
Lin, Jiehong
Chen, Ke
Xu, Zelin
Wang, Yaowei
Jia, Kui
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1740 - 1748
[9] Domain Adaptive Object Detection via Balancing Between Self-Training and Adversarial Learning
Munir, Muhammad Akhtar
Khan, Muhammad Haris
Sarfraz, M. Saquib
Ali, Mohsen
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14353 - 14365
[10] SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection
Munir, Muhammad Akhtar
Khan, Muhammad Haris
Sarfraz, M. Saquib
Ali, Mohsen
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,

← 1 2 3 4 5 →