KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking

被引:0
|
作者
Liu, Liu [1 ]
Huang, Anran [1 ]
Wu, Qi [2 ]
Guo, Dan [1 ]
Yang, Xun [3 ]
Wang, Meng [1 ]
机构
[1] Hefei Univ Technol, Hefei, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[3] Univ Sci & Technol China, Hefei, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our life is populated with articulated objects. Current category-level articulation estimation works largely focus on predicting part-level 6D poses on static point cloud observations. In this paper, we tackle the problem of category-level online robust and real-time 6D pose tracking of articulated objects, where we propose KPA-Tracker, a novel 3D KeyPoint based Articulated object pose Tracker. Given an RGB-D image or a partial point cloud at the current frame as well as the estimated per-part 6D poses from the last frame, our KPA-Tracker can effectively update the poses with learned 3D keypoints between the adjacent frames. Specifically, we first canonicalize the input point cloud and formulate the pose tracking as an inter-frame pose increment estimation task. To learn consistent and separate 3D keypoints for every rigid part, we build KPA-Gen that outputs the high-quality ordered 3D keypoints in an unsupervised manner. During pose tracking on the whole video, we further propose a keypoint-based articulation tracking algorithm that mines keyframes as reference for accurate pose updating. We pro-vide extensive experiments on validating our KPA-Tracker on various datasets ranging from synthetic point cloud observation to real-world scenarios, which demonstrates the superior performance and robustness of the KPA-Tracker. We believe that our work has the potential to be applied in many fields including robotics, embodied intelligence and augmented reality. All the datasets and codes are available at https://github.com/hhhhhar/KPA-Tracker.
引用
收藏
页码:3684 / 3692
页数:9
相关论文
共 50 条
  • [31] SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation
    Lin, Haitao
    Liu, Zichang
    Cheang, Chilam
    Fu, Yanwei
    Guo, Guodong
    Xue, Xiangyang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6697 - 6707
  • [32] ROFT: Real-Time Optical Flow-Aided 6D Object Pose and Velocity Tracking
    Piga, Nicola A.
    Onyshchuk, Yuriy
    Pasquale, Giulia
    Pattacini, Ugo
    Natale, Lorenzo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01) : 159 - 166
  • [33] Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset
    Fu, Yang
    Wang, Xiaolong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [34] Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation
    Lin, Xiao
    Yang, Wenfei
    Gao, Yuan
    Zhan, Tianzhu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21040 - 21049
  • [35] Real-Time Seamless Single Shot 6D Object Pose Prediction
    Tekin, Bugra
    Sinha, Sudipta N.
    Fua, Pascal
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 292 - 301
  • [36] CPPF: Towards Robust Category-Level 9D Pose Estimation in the Wild
    You, Yang
    Shi, Ruoxi
    Wang, Weiming
    Lu, Cewu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6856 - 6865
  • [37] Kinematic sets for real-time robust articulated object tracking
    Comport, Andrew I.
    Marchand, Eric
    Chaumette, Francois
    IMAGE AND VISION COMPUTING, 2007, 25 (03) : 374 - 391
  • [38] 6D-ViT: Category-Level 6D Object Pose Estimation via Transformer-Based Instance Representation Learning
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6907 - 6921
  • [39] VoCAPTER: Voting-based Pose Tracking for Category-level Articulated Object via Inter-frame Priors
    Zhang, Li
    Han, Zean
    Zhong, Yan
    Yu, Qiaojun
    Wu, Xingyu
    Wang, Xue
    Wang, Rujing
    MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia, : 8942 - 8951
  • [40] Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation Networks
    Lin, Jiehong
    Wei, Zewei
    Ding, Changxing
    Jia, Kui
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 19 - 34