PoseDiffusion: A Coarse-to-Fine Framework for Unseen Object 6-DoF Pose Estimation

被引:2
|
作者
Zhou, Jiaming [1 ,2 ]
Zhu, Qing [1 ,2 ]
Wang, Yaonan [1 ,2 ]
Feng, Mingtao [3 ]
Wu, Chengzhong [4 ]
Liu, Xuebing [1 ,2 ]
Huang, Jianan [1 ,2 ]
Mian, Ajmal [5 ]
机构
[1] Hunan Univ, Coll Elect & Informat Engn, Changsha 410012, Peoples R China
[2] Natl Engn Res Ctr Robot Visual Percept & Control, Changsha 410082, Peoples R China
[3] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[4] Jiangxi Prov Commun Terminal Ind Co Ltd, Jian 343000, Peoples R China
[5] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA 6009, Australia
关键词
Diffusion model; robotic grasping; transformer; unseen object pose estimation;
D O I
10.1109/TII.2024.3399886
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurately estimating the six-degrees of freedom (DoF) pose of unseen objects is crucial for successful robotic manipulation in industrial automation. Some existing methods for this task rely on prior knowledge of individual objects, i.e., the model must be trained on the exact object instance or object category. Others perform unseen object pose estimation but are limited in their feature learning and pose refinement ability. To address these problems, we propose an unseen object pose estimation method that follows a coarse-to-fine framework and leverages the powerful learning ability of diffusion models. We introduce a diffusion model for generating object poses, and conduct a comparison between the generated poses and the original pose to determine the optimal one. We design a novel pose estimation module to provide coarse poses for the PoseDiffusion. This module comprises two feature extraction modules that extract global and masked features. In addition, we propose a strategy to estimate the pose by comparing the similarity between rendered and query poses. The renderings of an unseen object from various viewpoints are generated from its computer-aided design (CAD) model. Our method requires a CAD model of the unseen object only during inference, a scenario well suited to industrial applications. Experimental evaluation on benchmark datasets demonstrates that the proposed framework outperforms existing approaches, achieving state-of-the-art performance in six-DoF object pose estimation.
引用
收藏
页码:11127 / 11138
页数:12
相关论文
共 50 条
  • [1] A probabilistic framework for object search with 6-DOF pose estimation
    Ma, Jeremy
    Chung, Timothy H.
    Burdick, Joel
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2011, 30 (10): : 1209 - 1228
  • [2] Efficient Monocular Coarse-to-Fine Object Pose Estimation
    Feng, Rong
    Zhang, Hong
    2016 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2016, : 1617 - 1622
  • [3] ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation
    Su, Yongzhi
    Saleh, Mahdi
    Fetzer, Torben
    Rambach, Jason
    Navab, Nassir
    Busam, Benjamin
    Stricker, Didier
    Tombari, Federico
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6728 - 6738
  • [4] Object 6-DoF pose estimation using auxiliary learning
    Chen M.
    Gai S.
    Da F.
    Yu J.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (06): : 901 - 914
  • [5] CFVS: Coarse-to-Fine Visual Servoing for 6-DoF Object-Agnostic Peg-In-Hole Assembly
    Lu, Bo-Siang
    Chen, Tung-I
    Lee, Hsin-Ying
    Hsu, Winston H.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 12402 - 12408
  • [6] Coarse-to-fine Animal Pose and Shape Estimation
    Li, Chen
    Lee, Gim Hee
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Deep object 6-DoF pose estimation using instance segmentation
    Pujolle, Victor
    Hayashi, Eiji
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 241 - 244
  • [8] Coarse-to-fine animal pose and shape estimation
    Li, Chen
    Lee, Gim Hee
    arXiv, 2021,
  • [9] RNNPose: 6-DoF Object Pose Estimation via Recurrent Correspondence Field Estimation and Pose Optimization
    Xu, Yan
    Lin, Kwan-Yee
    Zhang, Guofeng
    Wang, Xiaogang
    Li, Hongsheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (07) : 4669 - 4683
  • [10] Two-Steps Framework for Highly Accurate 6-DoF Pose Estimation
    Piriyatharawet, Teerawat
    Teo, Wei-De
    Chong, Shin-Horng
    2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,