Certifiable Object Pose Estimation: Foundations, Learning Models, and Self-Training

被引：3

作者：

Talak, Rajat ^{[1
]}

Peng, Lisa R. ^{[1
,2
]}

Carlone, Luca ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA

[2] Ample, San Francisco, CA 94107 USA

来源：

IEEE TRANSACTIONS ON ROBOTICS | 2023年 / 39卷 / 04期

基金：

美国国家科学基金会;

关键词：

Certifiable models; computer vision; 3D robot vision; object pose estimation; safe perception; self-supervised learning; PREDICTION;

D O I：

10.1109/TRO.2023.3271568

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In this article, we consider a certifiable object pose estimation problem, where-given a partial point cloud of an object-the goal is to not only estimate the object pose, but also provide a certificate of correctness for the resulting estimate. Our first contribution is a general theory of certification for end-to-end perception models. In particular, we introduce the notion of ?-correctness, which bounds the distance between an estimate and the ground truth. We then show that ?-correctness can be assessed by implementing two certificates: 1) a certificate of observable correctness, which asserts if the model output is consistent with the input data and prior information; and 2) a certificate of nondegeneracy, which asserts whether the input data are sufficient to compute a unique estimate. Our second contribution is to apply this theory and design a new learning-based certifiable pose estimator. In particular, we propose C-3PO, a semantic-keypoint-based pose estimation model, augmented with the two certificates, to solve the certifiable pose estimation problem. C-3PO also includes a keypoint corrector, implemented as a differentiable optimization layer, that can correct large detection errors (e.g., due to the sim-to-real gap). Our third contribution is a novel self-supervised training approach that uses our certificate of observable correctness to provide the supervisory signal to C-3PO during training. In it, the model trains only on the observably correct input-output pairs produced in each batch and at each iteration. As training progresses, we see that the observably correct input-output pairs grow, eventually reaching near 100% in many cases. We conduct extensive experiments to evaluate the performance of the corrector, the certification, and the proposed self-supervised training using the ShapeNet and YCB datasets. The experiments show that 1) standard semantic-keypoint-based methods (which constitute the backbone of C-3PO) outperform more recent alternatives in challenging problem instances; 2) C-3PO further improves performance and significantly outperforms all the baselines; and 3) C-3PO's certificates are able to discern correct pose estimates.(1)

引用

页码：2805 / 2824

页数：20

共 50 条

[31] Learning to Classify Skin Lesions via Self-Training and Self-Paced Learning
Asare, Sarpong Kwadwo
You, Fei
Nartey, Obed Tettey
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 963 - 967
[32] Appearance Based Object Pose Estimation Using Regression Models
Saito, Mamoru
Kitaguchi, Katsuhisa
PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION AND INSTRUMENTATION, VOL 4, 2008, : 1987 - 1991
[33] A Study on Systematic Improvement of Transformer Models for Object Pose Estimation
Lee, Jungwoo
Suh, Jinho
SENSORS, 2025, 25 (04)
[34] Appearance Based Object Pose Estimation Using Regression Models
Saito, Mamoru
Kitaguchi, Katsuhisa
2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1844 - 1847
[35] Multi-Task Self-Training for Learning General Representations
Ghiasi, Golnaz
Zoph, Barret
Cubuk, Ekin D.
Quoc V Le
Lin, Tsung-Yi
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8836 - 8845
[36] Neighborhood-Regularized Self-Training for Learning with Few Labels
Xu, Ran
Yu, Yue
Cui, Hejie
Kan, Xuan
Zhu, Yanqiao
Ho, Joyce
Zhang, Chao
Yang, Carl
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10611 - 10619
[37] Dynamic self-training with less uncertainty for graph imbalance learning
Juan, Xin
Peng, Meixin
Wang, Xin
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 271
[38] Deep unsupervised shadow detection with curriculum learning and self-training
Zhang, Qiang
Guo, Hongyuan
Li, Guanghe
Zhang, Tianlu
Jiao, Qiang
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
[39] Semi-supervised Continual Learning with Meta Self-training
Ho, Stella
Liu, Ming
Du, Lan
Li, Yunfeng
Gao, Longxiang
Gao, Shang
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4024 - 4028
[40] Synthetic Learning Set for Object Pose Estimation: Initial Experiments
Lee, Joo-Haeng
Yun, Woo-Han
Lee, Jaeyeon
Kim, Jaehong
2017 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2017, : 106 - 108

← 1 2 3 4 5 →