Homologous multimodal fusion network with geometric constraint keypoints selection for 6D pose estimation

被引:0
|
作者
Guo, Yi [1 ]
Wang, Fei [2 ]
Ding, Qichuan [2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
[2] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
关键词
6D pose estimation; Homologous multimodal fusion; Rotation-invariant; Geometric constraint; Visual grasp; ROBUST; DEPTH;
D O I
10.1016/j.eswa.2024.126022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating the 6D pose of objects from RGB-D images is a fundamental problem in computer vision, with the primary challenge lying ineffectively fusing these two modalities of information: color and depth. In this work, we present a novel homologous multimodal fusion framework for 6D pose estimation from RGBD images. Unlike existing methods, our approach directly utilizes homologous RGB-D as input to exploit the innate semantic similarity between them through hierarchical global and local feature fusion. This approach avoids performance loss caused by point cloud transformation. Additionally, we introduce a rotation- invariant residual network and geometric constraint loss for calculating object keypoints, further enhancing the accuracy and robustness of localization. Extensive comparative experiments and ablation studies validate the effectiveness of the proposed method, achieving state-of-the-art performance on the LineMOD (99.9%), Occlusion-LineMOD (79.2%), and YCB-Video datasets (97.1%). Finally, we validate the effectiveness of our method through recognition and grasping experiments in cluttered real-world scenarios. Video is available at https://youtu.be/LS_m4N9b5tU.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Deep Fusion for Multi-Modal 6D Pose Estimation
    Lin, Shifeng
    Wang, Zunran
    Zhang, Shenghao
    Ling, Yonggen
    Yang, Chenguang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6540 - 6549
  • [32] 6D Object Pose Estimation With Color/Geometry Attention Fusion
    Yuan, Honglin
    Veltkamp, Remco C.
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 529 - 535
  • [33] Complex Human Pose Estimation via Keypoints Association Constraint Network
    Zhu, Xuan
    Guo, Zhenpeng
    Liu, Xin
    Li, Bin
    Peng, Jinye
    Chen, Peirong
    Wang, Rongzhi
    IEEE ACCESS, 2020, 8 : 205938 - 205947
  • [34] Shape-Constraint Recurrent Flow for 6D Object Pose Estimation
    Hai, Yang
    Song, Rui
    Li, Jiaojiao
    Hu, Yinlin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4831 - 4840
  • [35] TGNF-Net: Two-Stage Geometric Neighborhood Fusion Network for Category-Level 6D Pose Estimation
    Zhao, Xiaolong
    Yan, Feihu
    Zhao, Guangzhe
    Wang, Caiyong
    INFORMATION, 2025, 16 (02)
  • [36] Selective Embedding with Gated Fusion for 6D Object Pose Estimation
    Sun, Shantong
    Liu, Rongke
    Du, Qiuchen
    Sun, Shuqiao
    NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2417 - 2436
  • [37] CMFF6D: Cross-modality multiscale feature fusion network for 6D pose estimation
    Han, Zongwang
    Chen, Long
    Wu, Shiqing
    NEUROCOMPUTING, 2025, 623
  • [38] Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation
    Corsetti, Jaime
    Boscaini, Davide
    Poiesi, Fabio
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2095 - 2104
  • [39] Learning stereopsis from geometric synthesis for 6D object pose estimation
    State Key Laboratory of Industrial Control Technology and Institue of Cyber-Systems and Control, Zhejiang University, Zhejiang, China
    arXiv, 1600,
  • [40] A Projective Geometric View for 6D Pose Estimation in mmWave MIMO Systems
    Shen, Shengqiang
    Wymeersch, Henk
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (08) : 9144 - 9159