Homologous multimodal fusion network with geometric constraint keypoints selection for 6D pose estimation

被引:0
|
作者
Guo, Yi [1 ]
Wang, Fei [2 ]
Ding, Qichuan [2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
[2] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
关键词
6D pose estimation; Homologous multimodal fusion; Rotation-invariant; Geometric constraint; Visual grasp; ROBUST; DEPTH;
D O I
10.1016/j.eswa.2024.126022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating the 6D pose of objects from RGB-D images is a fundamental problem in computer vision, with the primary challenge lying ineffectively fusing these two modalities of information: color and depth. In this work, we present a novel homologous multimodal fusion framework for 6D pose estimation from RGBD images. Unlike existing methods, our approach directly utilizes homologous RGB-D as input to exploit the innate semantic similarity between them through hierarchical global and local feature fusion. This approach avoids performance loss caused by point cloud transformation. Additionally, we introduce a rotation- invariant residual network and geometric constraint loss for calculating object keypoints, further enhancing the accuracy and robustness of localization. Extensive comparative experiments and ablation studies validate the effectiveness of the proposed method, achieving state-of-the-art performance on the LineMOD (99.9%), Occlusion-LineMOD (79.2%), and YCB-Video datasets (97.1%). Finally, we validate the effectiveness of our method through recognition and grasping experiments in cluttered real-world scenarios. Video is available at https://youtu.be/LS_m4N9b5tU.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
    He, Yisheng
    Huang, Haibin
    Fan, Haoqiang
    Chen, Qifeng
    Sun, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3002 - 3012
  • [12] EFN6D: an efficient RGB-D fusion network for 6D pose estimation
    Wang Y.
    Jiang X.
    Fujita H.
    Fang Z.
    Qiu X.
    Chen J.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (01) : 75 - 88
  • [13] A RGB-D feature fusion network for occluded object 6D pose estimation
    Song, Yiwei
    Tang, Chunhui
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6309 - 6319
  • [14] Lightweight Full-Flow Bidirectional Fusion Network for 6D Pose Estimation
    Lin, Haotian
    Li, Yongchang
    Jiang, Jing
    Qin, Guangjun
    Computer Engineering and Applications, 2024, 60 (22) : 282 - 291
  • [15] Estimating 6D Aircraft Pose from Keypoints and Structures
    Fan, Runze
    Xu, Ting-Bing
    Wei, Zhenzhong
    REMOTE SENSING, 2021, 13 (04) : 1 - 24
  • [16] 6D Object Pose Estimation with Pairwise Compatible Geometric Features
    Lin, Muyuan
    Murali, Varun
    Karaman, Sertac
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10966 - 10973
  • [17] MagicCubePose, A more comprehensive 6D pose estimation network
    Fudong Li
    Dongyang Gao
    Qiang Huang
    Wei Li
    Yuequan Yang
    Scientific Reports, 13
  • [18] EdgePose: An Edge Attention Network for 6D Pose Estimation
    Feng, Qi
    Nong, Jian
    Liang, Yanyan
    MATHEMATICS, 2024, 12 (17)
  • [19] MPF6D: masked pyramid fusion 6D pose estimation
    Nuno Pereira
    Luís A. Alexandre
    Pattern Analysis and Applications, 2023, 26 (3) : 1363 - 1373
  • [20] A Lightweight Two-End Feature Fusion Network for Object 6D Pose Estimation
    Zuo, Ligang
    Xie, Lun
    Pan, Hang
    Wang, Zhiliang
    MACHINES, 2022, 10 (04)