PoseDiffusion: A Coarse-to-Fine Framework for Unseen Object 6-DoF Pose Estimation

被引:2
|
作者
Zhou, Jiaming [1 ,2 ]
Zhu, Qing [1 ,2 ]
Wang, Yaonan [1 ,2 ]
Feng, Mingtao [3 ]
Wu, Chengzhong [4 ]
Liu, Xuebing [1 ,2 ]
Huang, Jianan [1 ,2 ]
Mian, Ajmal [5 ]
机构
[1] Hunan Univ, Coll Elect & Informat Engn, Changsha 410012, Peoples R China
[2] Natl Engn Res Ctr Robot Visual Percept & Control, Changsha 410082, Peoples R China
[3] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[4] Jiangxi Prov Commun Terminal Ind Co Ltd, Jian 343000, Peoples R China
[5] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA 6009, Australia
关键词
Diffusion model; robotic grasping; transformer; unseen object pose estimation;
D O I
10.1109/TII.2024.3399886
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurately estimating the six-degrees of freedom (DoF) pose of unseen objects is crucial for successful robotic manipulation in industrial automation. Some existing methods for this task rely on prior knowledge of individual objects, i.e., the model must be trained on the exact object instance or object category. Others perform unseen object pose estimation but are limited in their feature learning and pose refinement ability. To address these problems, we propose an unseen object pose estimation method that follows a coarse-to-fine framework and leverages the powerful learning ability of diffusion models. We introduce a diffusion model for generating object poses, and conduct a comparison between the generated poses and the original pose to determine the optimal one. We design a novel pose estimation module to provide coarse poses for the PoseDiffusion. This module comprises two feature extraction modules that extract global and masked features. In addition, we propose a strategy to estimate the pose by comparing the similarity between rendered and query poses. The renderings of an unseen object from various viewpoints are generated from its computer-aided design (CAD) model. Our method requires a CAD model of the unseen object only during inference, a scenario well suited to industrial applications. Experimental evaluation on benchmark datasets demonstrates that the proposed framework outperforms existing approaches, achieving state-of-the-art performance in six-DoF object pose estimation.
引用
收藏
页码:11127 / 11138
页数:12
相关论文
共 50 条
  • [41] OA-Pose: Occlusion-aware monocular 6-DoF object pose estimation under geometry alignment for robot manipulation
    Wang, Jikun
    Luo, Luqing
    Liang, Weixiang
    Yang, Zhi-Xin
    PATTERN RECOGNITION, 2024, 154
  • [42] Toward 6 DOF Object Pose Estimation with Minimum Dataset
    Suzui, Kota
    Yoshiyasu, Yusuke
    Gabas, Antonio
    Kanehiro, Fumio
    Yoshida, Eiichi
    2019 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2019, : 462 - 467
  • [43] Object aspect classification and 6DoF pose estimation
    Dede, Muhammet Ali
    Genc, Yakup
    IMAGE AND VISION COMPUTING, 2022, 124
  • [44] Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features
    Ma, Wufei
    Wang, Angtian
    Yuille, Alan
    Kortylewski, Adam
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 492 - 508
  • [45] An automatic framework for estimating the pose of the catheter distal section using a coarse-to-fine network
    Peng, Wenjia
    Wu, Wei
    Zhang, Jingyang
    Xie, Hongzhi
    Zhang, Shuyang
    Gu, Lixu
    Computer Methods and Programs in Biomedicine, 2022, 225
  • [46] An automatic framework for estimating the pose of the catheter distal section using a coarse-to-fine network
    Peng, Wenjia
    Wu, Wei
    Zhang, Jingyang
    Xie, Hongzhi
    Zhang, Shuyang
    Gu, Lixu
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 225
  • [47] Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images
    Liu, Yuan
    Wen, Yilin
    Peng, Sida
    Lin, Cheng
    Long, Xiaoxiao
    Komura, Taku
    Wang, Wenping
    COMPUTER VISION - ECCV 2022, PT XXXII, 2022, 13692 : 298 - 315
  • [48] Real-Time Facial Pose Estimation and Tracking by Coarse-to-Fine Iterative Optimization
    Yang, Xiaolong
    Jia, Xiaohong
    Yuan, Mengke
    Yan, Dong-Ming
    TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (05) : 690 - 700
  • [49] 6-DOF Vehicle Pose Estimation Considering Lidar Odometry Initial Condition
    Yang, Chanuk
    Huh, Kunsoo
    IEEE ACCESS, 2024, 12 : 77791 - 77799
  • [50] 6-DoF Pose Estimation for Axisymmetric Objects Using Deep Learning with Uncertainty
    Hashimoto, Shintaro
    Hirano, Daichi
    Ishihama, Naoki
    2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,