Dual Branch PnP Based Network for Monocular 6D Pose Estimation

被引:2
|
作者
Liang, Jia-Yu [1 ]
Zhang, Hong-Bo [1 ]
Lei, Qing [2 ]
Du, Ji-Xiang [3 ]
Lin, Tian-Liang [4 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen 361000, Peoples R China
[2] Huaqiao Univ, Xiamen Key Lab Comp Vis & Pattern Recognit, Xiamen 361000, Peoples R China
[3] Huaqiao Univ, Fujian Key Lab Big Data Intelligence & Secur, Xiamen 361000, Peoples R China
[4] Coll Mech Engn & Automat, Xiamen 361000, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
6D pose; monocular RGB; edge enhancement; dual-branch PnP; 2D-3D correspondence;
D O I
10.32604/iasc.2023.035812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Monocular 6D pose estimation is a functional task in the field of com-puter vision and robotics. In recent years, 2D-3D correspondence-based methods have achieved improved performance in multiview and depth data-based scenes. However, for monocular 6D pose estimation, these methods are affected by the prediction results of the 2D-3D correspondences and the robustness of the per-spective-n-point (PnP) algorithm. There is still a difference in the distance from the expected estimation effect. To obtain a more effective feature representation result, edge enhancement is proposed to increase the shape information of the object by analyzing the influence of inaccurate 2 D-3D matching on 6D pose regression and comparing the effectiveness of the intermediate representation. Furthermore, although the transformation matrix is composed of rotation and translation matrices from 3D model points to 2D pixel points, the two variables are essentially different and the same network cannot be used for both variables in the regression process. Therefore, to improve the effectiveness of the PnP algo-rithm, this paper designs a dual-branch PnP network to predict rotation and trans-lation information. Finally, the proposed method is verified on the public LM, LM-O and YCB-Video datasets. The ADD(S) values of the proposed method are 94.2 and 62.84 on the LM and LM-O datasets, respectively. The AUC of ADD(-S) value on YCB-Video is 81.1. These experimental results show that the performance of the proposed method is superior to that of similar methods.
引用
收藏
页码:3243 / 3256
页数:14
相关论文
共 50 条
  • [31] Robust 6D Pose Estimation Using Dual Active Marker System
    Choi, Hyeon-Ju
    Kim, Yeong-Bin
    Park, Bum Yong
    Lee, Dong-Hyun
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2025, 23 (02) : 382 - 391
  • [32] YOLO-6D-Pose: Enhancing YOLO for Single-Stage Monocular Multi-Object 6D Pose Estimation
    Maji, Debapriya
    Nagori, Soyeb
    Mathew, Manu
    Poddar, Deepak
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1616 - 1625
  • [33] DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency
    Lin, Jiehong
    Wei, Zewei
    Li, Zhihao
    Xu, Songcen
    Jia, Kui
    Li, Yuanqing
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3540 - 3549
  • [34] A Transformer-based multi-modal fusion network for 6D pose estimation
    Hong, Jia-Xin
    Zhang, Hong-Bo
    Liu, Jing-Hua
    Lei, Qing
    Yang, Li-Jie
    Du, Ji-Xiang
    INFORMATION FUSION, 2024, 105
  • [35] TPSFusion: A Transformer-based pyramid screening fusion network for 6D pose estimation
    Zhu, Jiaqi
    Li, Bin
    Zhao, Xinhua
    IMAGE AND VISION COMPUTING, 2025, 154
  • [36] A Lightweight 6D Pose Estimation Network Based on Improved Atrous Spatial Pyramid Pooling
    Wang, Fupan
    Tang, Xiaohang
    Wu, Yadong
    Wang, Yinfan
    Chen, Huarong
    Wang, Guijuan
    Liao, Jing
    ELECTRONICS, 2024, 13 (07)
  • [37] DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation
    Park, Jaewoo
    Cho, Nam Ik
    COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 363 - 379
  • [38] Mobile Robot 6D Pose Estimation Using a Wireless Localization Network
    Dobrev, Yassen
    Reustle, Christoph
    Pavlenko, Tatiana
    Cordes, Florian
    Vossiek, Martin
    2016 IEEE MTT-S INTERNATIONAL CONFERENCE ON MICROWAVES FOR INTELLIGENT MOBILITY (ICMIM), 2016,
  • [39] An efficient network for category-level 6D object pose estimation
    Sun, Shantong
    Liu, Rongke
    Sun, Shuqiao
    Yang, Xinxin
    Lu, Guangshan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (07) : 1643 - 1651
  • [40] An efficient network for category-level 6D object pose estimation
    Shantong Sun
    Rongke Liu
    Shuqiao Sun
    Xinxin Yang
    Guangshan Lu
    Signal, Image and Video Processing, 2021, 15 : 1643 - 1651