Homologous multimodal fusion network with geometric constraint keypoints selection for 6D pose estimation

被引:0
|
作者
Guo, Yi [1 ]
Wang, Fei [2 ]
Ding, Qichuan [2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
[2] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
关键词
6D pose estimation; Homologous multimodal fusion; Rotation-invariant; Geometric constraint; Visual grasp; ROBUST; DEPTH;
D O I
10.1016/j.eswa.2024.126022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating the 6D pose of objects from RGB-D images is a fundamental problem in computer vision, with the primary challenge lying ineffectively fusing these two modalities of information: color and depth. In this work, we present a novel homologous multimodal fusion framework for 6D pose estimation from RGBD images. Unlike existing methods, our approach directly utilizes homologous RGB-D as input to exploit the innate semantic similarity between them through hierarchical global and local feature fusion. This approach avoids performance loss caused by point cloud transformation. Additionally, we introduce a rotation- invariant residual network and geometric constraint loss for calculating object keypoints, further enhancing the accuracy and robustness of localization. Extensive comparative experiments and ablation studies validate the effectiveness of the proposed method, achieving state-of-the-art performance on the LineMOD (99.9%), Occlusion-LineMOD (79.2%), and YCB-Video datasets (97.1%). Finally, we validate the effectiveness of our method through recognition and grasping experiments in cluttered real-world scenarios. Video is available at https://youtu.be/LS_m4N9b5tU.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation
    Trabelsi, Ameni
    Chaabane, Mohamed
    Blanchard, Nathaniel
    Beveridge, Ross
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2381 - 2390
  • [22] MPF6D: masked pyramid fusion 6D pose estimation
    Pereira, Nuno
    Alexandre, Luis A.
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1363 - 1373
  • [23] PointPoseNet: Point Pose Network for Robust 6D Object Pose Estimation
    Chen, Wei
    Duan, Jinming
    Basevi, Hector
    Chang, Hyung Jin
    Leonardis, Ales
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2813 - 2822
  • [24] MagicCubePose, A more comprehensive 6D pose estimation network
    Li, Fudong
    Gao, Dongyang
    Huang, Qiang
    Li, Wei
    Yang, Yuequan
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [25] Graph neural network for 6D object pose estimation
    Yin, Pengshuai
    Ye, Jiayong
    Lin, Guoshen
    Wu, Qingyao
    KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [26] Generalizable and Accurate 6D Object Pose Estimation Network
    Fu, Shouxu
    Li, Xiaoning
    Yu, Xiangdong
    Cao, Lu
    Li, Xingxing
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 312 - 324
  • [27] A Transformer-based multi-modal fusion network for 6D pose estimation
    Hong, Jia-Xin
    Zhang, Hong-Bo
    Liu, Jing-Hua
    Lei, Qing
    Yang, Li-Jie
    Du, Ji-Xiang
    INFORMATION FUSION, 2024, 105
  • [28] TPSFusion: A Transformer-based pyramid screening fusion network for 6D pose estimation
    Zhu, Jiaqi
    Li, Bin
    Zhao, Xinhua
    IMAGE AND VISION COMPUTING, 2025, 154
  • [29] DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion
    Wang, Chen
    Xu, Danfei
    Zhu, Yuke
    Martin-Martin, Roberto
    Lu, Cewu
    Li Fei-Fei
    Savarese, Silvio
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3338 - 3347
  • [30] Selective Embedding with Gated Fusion for 6D Object Pose Estimation
    Shantong Sun
    Rongke Liu
    Qiuchen Du
    Shuqiao Sun
    Neural Processing Letters, 2020, 51 : 2417 - 2436