Dual Branch PnP Based Network for Monocular 6D Pose Estimation

被引:2
|
作者
Liang, Jia-Yu [1 ]
Zhang, Hong-Bo [1 ]
Lei, Qing [2 ]
Du, Ji-Xiang [3 ]
Lin, Tian-Liang [4 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen 361000, Peoples R China
[2] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361000, Peoples R China
[3] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361000, Peoples R China
[4] Coll Mech Engn & Automat, Xiamen 361000, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
6D pose; monocular RGB; edge enhancement; dual-branch PnP; 2D-3D correspondence;
D O I
10.32604/iasc.2023.035812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Monocular 6D pose estimation is a functional task in the field of com-puter vision and robotics. In recent years, 2D-3D correspondence-based methods have achieved improved performance in multiview and depth data-based scenes. However, for monocular 6D pose estimation, these methods are affected by the prediction results of the 2D-3D correspondences and the robustness of the per-spective-n-point (PnP) algorithm. There is still a difference in the distance from the expected estimation effect. To obtain a more effective feature representation result, edge enhancement is proposed to increase the shape information of the object by analyzing the influence of inaccurate 2 D-3D matching on 6D pose regression and comparing the effectiveness of the intermediate representation. Furthermore, although the transformation matrix is composed of rotation and translation matrices from 3D model points to 2D pixel points, the two variables are essentially different and the same network cannot be used for both variables in the regression process. Therefore, to improve the effectiveness of the PnP algo-rithm, this paper designs a dual-branch PnP network to predict rotation and trans-lation information. Finally, the proposed method is verified on the public LM, LM-O and YCB-Video datasets. The ADD(S) values of the proposed method are 94.2 and 62.84 on the LM and LM-O datasets, respectively. The AUC of ADD(-S) value on YCB-Video is 81.1. These experimental results show that the performance of the proposed method is superior to that of similar methods.
引用
收藏
页码:3243 / 3256
页数:14
相关论文
共 50 条
  • [41] Estimation of 6D Pose of Objects Based on a Variant Adversarial Autoencoder
    Huang, Dan
    Ahn, Hyemin
    Li, Shile
    Hu, Yueming
    Lee, Dongheui
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9581 - 9596
  • [42] Reconstruction-based 6D pose estimation for robotic assembly
    Shi, Zhongchen
    Xu, Kai
    Li, Zhang
    Guan, Banglei
    Wang, Gang
    Shang, Yang
    APPLIED OPTICS, 2020, 59 (31) : 9824 - 9835
  • [43] Estimation of 6D Pose of Objects Based on a Variant Adversarial Autoencoder
    Dan Huang
    Hyemin Ahn
    Shile Li
    Yueming Hu
    Dongheui Lee
    Neural Processing Letters, 2023, 55 : 9581 - 9596
  • [44] BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
    Liu, Penglei
    Zhang, Qieshi
    Cheng, Jun
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (02) : 1793 - 1804
  • [45] DON6D: a decoupled one-stage network for 6D pose estimation
    Wang, Zheng
    Tu, Hangyao
    Qian, Yutong
    Zhao, Yanwei
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [46] FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
    He, Yisheng
    Huang, Haibin
    Fan, Haoqiang
    Chen, Qifeng
    Sun, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3002 - 3012
  • [47] EFN6D: an efficient RGB-D fusion network for 6D pose estimation
    Wang Y.
    Jiang X.
    Fujita H.
    Fang Z.
    Qiu X.
    Chen J.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (01) : 75 - 88
  • [48] Single Shot 6D Object Pose Estimation
    Kleeberger, Kilian
    Huber, Marco F.
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6239 - 6245
  • [49] BOP: Benchmark for 6D Object Pose Estimation
    Hodan, Tomas
    Michel, Frank
    Brachmann, Eric
    Kehl, Wadim
    Buch, Anders Glent
    Kraft, Dirk
    Drost, Bertram
    Vidal, Joel
    Ihrke, Stephan
    Zabulis, Xenophon
    Sahin, Caner
    Manhardt, Fabian
    Tombari, Federico
    Kim, Tae-Kyun
    Matas, Jiri
    Rother, Carsten
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
  • [50] Survey on 6D Pose Estimation of Rigid Object
    Chen, Jiale
    Zhang, Lijun
    Liu, Yi
    Xu, Chi
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7440 - 7445