KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking

被引：0

作者：

Liu, Liu ^{[1
]}

Huang, Anran ^{[1
]}

Wu, Qi ^{[2
]}

Guo, Dan ^{[1
]}

Yang, Xun ^{[3
]}

Wang, Meng ^{[1
]}

机构：

[1] Hefei Univ Technol, Hefei, Peoples R China

[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[3] Univ Sci & Technol China, Hefei, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4 | 2024年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Our life is populated with articulated objects. Current category-level articulation estimation works largely focus on predicting part-level 6D poses on static point cloud observations. In this paper, we tackle the problem of category-level online robust and real-time 6D pose tracking of articulated objects, where we propose KPA-Tracker, a novel 3D KeyPoint based Articulated object pose Tracker. Given an RGB-D image or a partial point cloud at the current frame as well as the estimated per-part 6D poses from the last frame, our KPA-Tracker can effectively update the poses with learned 3D keypoints between the adjacent frames. Specifically, we first canonicalize the input point cloud and formulate the pose tracking as an inter-frame pose increment estimation task. To learn consistent and separate 3D keypoints for every rigid part, we build KPA-Gen that outputs the high-quality ordered 3D keypoints in an unsupervised manner. During pose tracking on the whole video, we further propose a keypoint-based articulation tracking algorithm that mines keyframes as reference for accurate pose updating. We pro-vide extensive experiments on validating our KPA-Tracker on various datasets ranging from synthetic point cloud observation to real-world scenarios, which demonstrates the superior performance and robustness of the KPA-Tracker. We believe that our work has the potential to be applied in many fields including robotics, embodied intelligence and augmented reality. All the datasets and codes are available at https://github.com/hhhhhar/KPA-Tracker.

引用

页码：3684 / 3692

页数：9

共 50 条

[31] SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation
Lin, Haitao
Liu, Zichang
Cheang, Chilam
Fu, Yanwei
Guo, Guodong
Xue, Xiangyang
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6697 - 6707
[32] ROFT: Real-Time Optical Flow-Aided 6D Object Pose and Velocity Tracking
Piga, Nicola A.
Onyshchuk, Yuriy
Pasquale, Giulia
Pattacini, Ugo
Natale, Lorenzo
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01) : 159 - 166
[33] Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset
Fu, Yang
Wang, Xiaolong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[34] Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation
Lin, Xiao
Yang, Wenfei
Gao, Yuan
Zhan, Tianzhu
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21040 - 21049
[35] Real-Time Seamless Single Shot 6D Object Pose Prediction
Tekin, Bugra
Sinha, Sudipta N.
Fua, Pascal
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 292 - 301
[36] CPPF: Towards Robust Category-Level 9D Pose Estimation in the Wild
You, Yang
Shi, Ruoxi
Wang, Weiming
Lu, Cewu
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6856 - 6865
[37] Kinematic sets for real-time robust articulated object tracking
Comport, Andrew I.
Marchand, Eric
Chaumette, Francois
IMAGE AND VISION COMPUTING, 2007, 25 (03) : 374 - 391
[38] 6D-ViT: Category-Level 6D Object Pose Estimation via Transformer-Based Instance Representation Learning
Zou, Lu
Huang, Zhangjin
Gu, Naijie
Wang, Guoping
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6907 - 6921
[39] VoCAPTER: Voting-based Pose Tracking for Category-level Articulated Object via Inter-frame Priors
Zhang, Li
Han, Zean
Zhong, Yan
Yu, Qiaojun
Wu, Xingyu
Wang, Xue
Wang, Rujing
MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia, : 8942 - 8951
[40] Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation Networks
Lin, Jiehong
Wei, Zewei
Ding, Changxing
Jia, Kui
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 19 - 34

← 1 2 3 4 5 →