Generative Category-Level Shape and Pose Estimation with Semantic Primitives

被引:0
|
作者
Li, Guanglin [1 ,2 ,4 ]
Li, Yifeng [2 ]
Ye, Zhichao [1 ]
Zhang, Qihang [3 ]
Kong, Tao [2 ]
Cui, Zhaopeng [1 ]
Zhang, Guofeng [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou, Zhejiang, Peoples R China
[2] Chinese Univ Hong Kong, ByteDance AI Lab, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Multimedia Lab, Hong Kong, Peoples R China
[4] ByteDance, Hong Kong, Peoples R China
来源
关键词
Category-level Pose Estimation; Shape Estimation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Empowering autonomous agents with 3D understanding for daily objects is a grand challenge in robotics applications. When exploring in an unknown environment, existing methods for object pose estimation are still not satisfactory due to the diversity of object shapes. In this paper, we propose a novel framework for category-level object shape and pose estimation from a single RGB-D image. To handle the intra-category variation, we adopt a semantic primitive representation that encodes diverse shapes into a unified latent space, which is the key to establish reliable correspondences between observed point clouds and estimated shapes. Then, by using a SIM(3)-invariant shape descriptor, we gracefully decouple the shape and pose of an object, thus supporting latent shape optimization of target objects in arbitrary poses. Extensive experiments show that the proposed method achieves SOTA pose estimation performance and better generalization in the real-world dataset. Code and video are available at https://zju3dv.github.io/gCasp.
引用
收藏
页码:1390 / 1400
页数:11
相关论文
共 50 条
  • [31] Category-Level Object Detection, Pose Estimation and Reconstruction from Stereo Images
    Zhang, Chuanrui
    Ling, Yonggen
    Lu, Minglei
    Qin, Minghan
    Wang, Haoqian
    COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 332 - 349
  • [32] An efficient network for category-level 6D object pose estimation
    Shantong Sun
    Rongke Liu
    Shuqiao Sun
    Xinxin Yang
    Guangshan Lu
    Signal, Image and Video Processing, 2021, 15 : 1643 - 1651
  • [33] GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement
    Zheng, Linfang
    Tse, Tze Ho Elden
    Wang, Chen
    Sun, Yinghan
    Chen, Hua
    Leonardis, Ales
    Zhang, Wei
    Chang, Hyung Jin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10693 - 10703
  • [34] Fine segmentation and difference-aware shape adjustment for category-level 6DoF object pose estimation
    Chongpei Liu
    Wei Sun
    Jian Liu
    Xing Zhang
    Shimeng Fan
    Qiang Fu
    Applied Intelligence, 2023, 53 : 23711 - 23728
  • [35] Bi-directional attention based RGB-D fusion for category-level object pose and shape estimation
    Tang, Kaifeng
    Xu, Chi
    Chen, Ming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 53043 - 53063
  • [36] Fine segmentation and difference-aware shape adjustment for category-level 6DoF object pose estimation
    Liu, Chongpei
    Sun, Wei
    Liu, Jian
    Zhang, Xing
    Fan, Shimeng
    Fu, Qiang
    APPLIED INTELLIGENCE, 2023, 53 (20) : 23711 - 23728
  • [37] Self-Supervised Category-Level 6D Object Pose Estimation with Deep Implicit Shape Representation
    Peng, Wanli
    Yan, Jianhang
    Wen, Hongtao
    Sun, Yi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2082 - 2090
  • [38] UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose Estimation
    Lee, Taeyeop
    Lee, Byeong-Uk
    Shin, Inkyu
    Choe, Jaesung
    Shin, Ukcheol
    Kweon, In So
    Yoon, Kuk-Jin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14871 - 14880
  • [39] Bi-directional attention based RGB-D fusion for category-level object pose and shape estimation
    Kaifeng Tang
    Chi Xu
    Ming Chen
    Multimedia Tools and Applications, 2024, 83 : 53043 - 53063
  • [40] Category-Level Adversaries for Semantic Domain Adaptation
    Ruan, Congcong
    Wang, Wei
    Hu, Haifeng
    Chen, Dihu
    IEEE ACCESS, 2019, 7 : 83198 - 83208