Dual Branch PnP Based Network for Monocular 6D Pose Estimation

被引：2

作者：

Liang, Jia-Yu ^{[1
]}

Zhang, Hong-Bo ^{[1
]}

Lei, Qing ^{[2
]}

Du, Ji-Xiang ^{[3
]}

Lin, Tian-Liang ^{[4
]}

机构：

[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen 361000, Peoples R China

[2] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361000, Peoples R China

[3] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361000, Peoples R China

[4] Coll Mech Engn & Automat, Xiamen 361000, Peoples R China

来源：

INTELLIGENT AUTOMATION AND SOFT COMPUTING | 2023年 / 36卷 / 03期

基金：

中国国家自然科学基金;

关键词：

6D pose; monocular RGB; edge enhancement; dual-branch PnP; 2D-3D correspondence;

D O I：

10.32604/iasc.2023.035812

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Monocular 6D pose estimation is a functional task in the field of com-puter vision and robotics. In recent years, 2D-3D correspondence-based methods have achieved improved performance in multiview and depth data-based scenes. However, for monocular 6D pose estimation, these methods are affected by the prediction results of the 2D-3D correspondences and the robustness of the per-spective-n-point (PnP) algorithm. There is still a difference in the distance from the expected estimation effect. To obtain a more effective feature representation result, edge enhancement is proposed to increase the shape information of the object by analyzing the influence of inaccurate 2 D-3D matching on 6D pose regression and comparing the effectiveness of the intermediate representation. Furthermore, although the transformation matrix is composed of rotation and translation matrices from 3D model points to 2D pixel points, the two variables are essentially different and the same network cannot be used for both variables in the regression process. Therefore, to improve the effectiveness of the PnP algo-rithm, this paper designs a dual-branch PnP network to predict rotation and trans-lation information. Finally, the proposed method is verified on the public LM, LM-O and YCB-Video datasets. The ADD(S) values of the proposed method are 94.2 and 62.84 on the LM and LM-O datasets, respectively. The AUC of ADD(-S) value on YCB-Video is 81.1. These experimental results show that the performance of the proposed method is superior to that of similar methods.

引用

页码：3243 / 3256

页数：14

共 50 条

[31] Robust 6D Pose Estimation Using Dual Active Marker System
Choi, Hyeon-Ju
Kim, Yeong-Bin
Park, Bum Yong
Lee, Dong-Hyun
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2025, 23 (02) : 382 - 391
[32] YOLO-6D-Pose: Enhancing YOLO for Single-Stage Monocular Multi-Object 6D Pose Estimation
Maji, Debapriya
Nagori, Soyeb
Mathew, Manu
Poddar, Deepak
2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1616 - 1625
[33] DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency
Lin, Jiehong
Wei, Zewei
Li, Zhihao
Xu, Songcen
Jia, Kui
Li, Yuanqing
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3540 - 3549
[34] A Transformer-based multi-modal fusion network for 6D pose estimation
Hong, Jia-Xin
Zhang, Hong-Bo
Liu, Jing-Hua
Lei, Qing
Yang, Li-Jie
Du, Ji-Xiang
INFORMATION FUSION, 2024, 105
[35] TPSFusion: A Transformer-based pyramid screening fusion network for 6D pose estimation
Zhu, Jiaqi
Li, Bin
Zhao, Xinhua
IMAGE AND VISION COMPUTING, 2025, 154
[36] A Lightweight 6D Pose Estimation Network Based on Improved Atrous Spatial Pyramid Pooling
Wang, Fupan
Tang, Xiaohang
Wu, Yadong
Wang, Yinfan
Chen, Huarong
Wang, Guijuan
Liao, Jing
ELECTRONICS, 2024, 13 (07)
[37] DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation
Park, Jaewoo
Cho, Nam Ik
COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 363 - 379
[38] Mobile Robot 6D Pose Estimation Using a Wireless Localization Network
Dobrev, Yassen
Reustle, Christoph
Pavlenko, Tatiana
Cordes, Florian
Vossiek, Martin
2016 IEEE MTT-S INTERNATIONAL CONFERENCE ON MICROWAVES FOR INTELLIGENT MOBILITY (ICMIM), 2016,
[39] An efficient network for category-level 6D object pose estimation
Sun, Shantong
Liu, Rongke
Sun, Shuqiao
Yang, Xinxin
Lu, Guangshan
SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (07) : 1643 - 1651
[40] An efficient network for category-level 6D object pose estimation
Shantong Sun
Rongke Liu
Shuqiao Sun
Xinxin Yang
Guangshan Lu
Signal, Image and Video Processing, 2021, 15 : 1643 - 1651

← 1 2 3 4 5 →