Homologous multimodal fusion network with geometric constraint keypoints selection for 6D pose estimation

被引:0
|
作者
Guo, Yi [1 ]
Wang, Fei [2 ]
Ding, Qichuan [2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
[2] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
关键词
6D pose estimation; Homologous multimodal fusion; Rotation-invariant; Geometric constraint; Visual grasp; ROBUST; DEPTH;
D O I
10.1016/j.eswa.2024.126022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating the 6D pose of objects from RGB-D images is a fundamental problem in computer vision, with the primary challenge lying ineffectively fusing these two modalities of information: color and depth. In this work, we present a novel homologous multimodal fusion framework for 6D pose estimation from RGBD images. Unlike existing methods, our approach directly utilizes homologous RGB-D as input to exploit the innate semantic similarity between them through hierarchical global and local feature fusion. This approach avoids performance loss caused by point cloud transformation. Additionally, we introduce a rotation- invariant residual network and geometric constraint loss for calculating object keypoints, further enhancing the accuracy and robustness of localization. Extensive comparative experiments and ablation studies validate the effectiveness of the proposed method, achieving state-of-the-art performance on the LineMOD (99.9%), Occlusion-LineMOD (79.2%), and YCB-Video datasets (97.1%). Finally, we validate the effectiveness of our method through recognition and grasping experiments in cluttered real-world scenarios. Video is available at https://youtu.be/LS_m4N9b5tU.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Orientation Keypoints for 6D Human Pose Estimation
    Fisch, Martin
    Clark, Ronald
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10145 - 10158
  • [2] GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation
    Wen, Yongming
    Fang, Yiquan
    Cai, Junhao
    Tung, Kimwa
    Cheng, Hui
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2671 - 2679
  • [3] 6D Pose Estimation with Correlation Fusion
    Cheng, Yi
    Zhu, Hongyuan
    Sun, Ying
    Acar, Cihan
    Jing, Wei
    Wu, Yan
    Li, Liyuan
    Tan, Cheston
    Lim, Joo-Hwee
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2988 - 2994
  • [4] Deep Fusion Transformer Network with Weighted Vector-Wise Keypoints Voting for Robust 6D Object Pose Estimation
    Zhou, Jun
    Chen, Kai
    Xu, Linlin
    Dou, Qi
    Qin, Jing
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13921 - 13931
  • [5] Multi-View Keypoints for Reliable 6D Object Pose Estimation
    Li, Alan
    Schoellig, Angela P.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6988 - 6994
  • [6] Shape Enhanced Keypoints Learning with Geometric Prior for 6D Object Pose Tracking
    Majcher, Mateusz
    Kwolek, Bogdan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2985 - 2991
  • [7] 6D Object Pose Estimation Using Keypoints and Part Affinity Fields
    Zappel, Moritz
    Bultmann, Simon
    Behnke, Sven
    ROBOT WORLD CUP XXIV, ROBOCUP 2021, 2022, 13132 : 78 - 90
  • [8] A modal fusion network with dual attention mechanism for 6D pose estimation
    Wei, Liangrui
    Xie, Feifei
    Sun, Lin
    Chen, Jinpeng
    Zhang, Zhipeng
    VISUAL COMPUTER, 2024, 40 (10): : 7411 - 7425
  • [9] The 6D Pose Estimation of the Aircraft Using Geometric Property
    Fu, Daoyong
    Han, Songchen
    Liang, Binbin
    Li, Wei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (07) : 3358 - 3368
  • [10] BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
    Liu, Penglei
    Zhang, Qieshi
    Cheng, Jun
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (02) : 1793 - 1804