Dual Branch PnP Based Network for Monocular 6D Pose Estimation

被引：2

作者：

Liang, Jia-Yu ^{[1
]}

Zhang, Hong-Bo ^{[1
]}

Lei, Qing ^{[2
]}

Du, Ji-Xiang ^{[3
]}

Lin, Tian-Liang ^{[4
]}

机构：

[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen 361000, Peoples R China

[2] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361000, Peoples R China

[3] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361000, Peoples R China

[4] Coll Mech Engn & Automat, Xiamen 361000, Peoples R China

来源：

INTELLIGENT AUTOMATION AND SOFT COMPUTING | 2023年 / 36卷 / 03期

基金：

中国国家自然科学基金;

关键词：

6D pose; monocular RGB; edge enhancement; dual-branch PnP; 2D-3D correspondence;

D O I：

10.32604/iasc.2023.035812

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Monocular 6D pose estimation is a functional task in the field of com-puter vision and robotics. In recent years, 2D-3D correspondence-based methods have achieved improved performance in multiview and depth data-based scenes. However, for monocular 6D pose estimation, these methods are affected by the prediction results of the 2D-3D correspondences and the robustness of the per-spective-n-point (PnP) algorithm. There is still a difference in the distance from the expected estimation effect. To obtain a more effective feature representation result, edge enhancement is proposed to increase the shape information of the object by analyzing the influence of inaccurate 2 D-3D matching on 6D pose regression and comparing the effectiveness of the intermediate representation. Furthermore, although the transformation matrix is composed of rotation and translation matrices from 3D model points to 2D pixel points, the two variables are essentially different and the same network cannot be used for both variables in the regression process. Therefore, to improve the effectiveness of the PnP algo-rithm, this paper designs a dual-branch PnP network to predict rotation and trans-lation information. Finally, the proposed method is verified on the public LM, LM-O and YCB-Video datasets. The ADD(S) values of the proposed method are 94.2 and 62.84 on the LM and LM-O datasets, respectively. The AUC of ADD(-S) value on YCB-Video is 81.1. These experimental results show that the performance of the proposed method is superior to that of similar methods.

引用

页码：3243 / 3256

页数：14

共 50 条

[41] Estimation of 6D Pose of Objects Based on a Variant Adversarial Autoencoder
Huang, Dan
Ahn, Hyemin
Li, Shile
Hu, Yueming
Lee, Dongheui
NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9581 - 9596
[42] Reconstruction-based 6D pose estimation for robotic assembly
Shi, Zhongchen
Xu, Kai
Li, Zhang
Guan, Banglei
Wang, Gang
Shang, Yang
APPLIED OPTICS, 2020, 59 (31) : 9824 - 9835
[43] Estimation of 6D Pose of Objects Based on a Variant Adversarial Autoencoder
Dan Huang
Hyemin Ahn
Shile Li
Yueming Hu
Dongheui Lee
Neural Processing Letters, 2023, 55 : 9581 - 9596
[44] BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
Liu, Penglei
Zhang, Qieshi
Cheng, Jun
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (02) : 1793 - 1804
[45] DON6D: a decoupled one-stage network for 6D pose estimation
Wang, Zheng
Tu, Hangyao
Qian, Yutong
Zhao, Yanwei
SCIENTIFIC REPORTS, 2024, 14 (01)
[46] FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
He, Yisheng
Huang, Haibin
Fan, Haoqiang
Chen, Qifeng
Sun, Jian
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3002 - 3012
[47] EFN6D: an efficient RGB-D fusion network for 6D pose estimation
Wang Y.
Jiang X.
Fujita H.
Fang Z.
Qiu X.
Chen J.
Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (01) : 75 - 88
[48] Single Shot 6D Object Pose Estimation
Kleeberger, Kilian
Huber, Marco F.
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245
[49] BOP: Benchmark for 6D Object Pose Estimation
Hodan, Tomas
Michel, Frank
Brachmann, Eric
Kehl, Wadim
Buch, Anders Glent
Kraft, Dirk
Drost, Bertram
Vidal, Joel
Ihrke, Stephan
Zabulis, Xenophon
Sahin, Caner
Manhardt, Fabian
Tombari, Federico
Kim, Tae-Kyun
Matas, Jiri
Rother, Carsten
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
[50] Survey on 6D Pose Estimation of Rigid Object
Chen, Jiale
Zhang, Lijun
Liu, Yi
Xu, Chi
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7440 - 7445

← 1 2 3 4 5 →