KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking

被引：0

作者：

Liu, Liu ^{[1
]}

Huang, Anran ^{[1
]}

Wu, Qi ^{[2
]}

Guo, Dan ^{[1
]}

Yang, Xun ^{[3
]}

Wang, Meng ^{[1
]}

机构：

[1] Hefei Univ Technol, Hefei, Peoples R China

[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[3] Univ Sci & Technol China, Hefei, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4 | 2024年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Our life is populated with articulated objects. Current category-level articulation estimation works largely focus on predicting part-level 6D poses on static point cloud observations. In this paper, we tackle the problem of category-level online robust and real-time 6D pose tracking of articulated objects, where we propose KPA-Tracker, a novel 3D KeyPoint based Articulated object pose Tracker. Given an RGB-D image or a partial point cloud at the current frame as well as the estimated per-part 6D poses from the last frame, our KPA-Tracker can effectively update the poses with learned 3D keypoints between the adjacent frames. Specifically, we first canonicalize the input point cloud and formulate the pose tracking as an inter-frame pose increment estimation task. To learn consistent and separate 3D keypoints for every rigid part, we build KPA-Gen that outputs the high-quality ordered 3D keypoints in an unsupervised manner. During pose tracking on the whole video, we further propose a keypoint-based articulation tracking algorithm that mines keyframes as reference for accurate pose updating. We pro-vide extensive experiments on validating our KPA-Tracker on various datasets ranging from synthetic point cloud observation to real-world scenarios, which demonstrates the superior performance and robustness of the KPA-Tracker. We believe that our work has the potential to be applied in many fields including robotics, embodied intelligence and augmented reality. All the datasets and codes are available at https://github.com/hhhhhar/KPA-Tracker.

引用

页码：3684 / 3692

页数：9

共 50 条

[1] Category-Level 6D Object Pose Recovery in Depth Images
Sahin, Caner
Kim, Tae-Kyun
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 665 - 681
[2] An efficient network for category-level 6D object pose estimation
Sun, Shantong
Liu, Rongke
Sun, Shuqiao
Yang, Xinxin
Lu, Guangshan
SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (07) : 1643 - 1651
[3] CatFormer: Category-Level 6D Object Pose Estimation with Transformer
Yu, Sheng
Zhai, Di-Hua
Xia, Yuanqing
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6808 - 6816
[4] RANSAC Optimization for Category-level 6D Object Pose Estimation
Chen, Ying
Kang, Guixia
Wang, Yiping
2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 50 - 56
[5] An efficient network for category-level 6D object pose estimation
Shantong Sun
Rongke Liu
Shuqiao Sun
Xinxin Yang
Guangshan Lu
Signal, Image and Video Processing, 2021, 15 : 1643 - 1651
[6] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
Wang, He
Sridhar, Srinath
Huang, Jingwei
Valentin, Julien
Song, Shuran
Guibas, Leonidas J.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2637 - 2646
[7] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
Liu, Jierui
Cao, Zhiqiang
Tang, Yingbo
Liu, Xilong
Tan, Min
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740
[8] SD-Pose: Structural Discrepancy Aware Category-Level 6D Object Pose Estimation
Li, Guowei
Zhu, Dongchen
Zhang, Guanghui
Shi, Wenjun
Zhang, Tianyu
Zhang, Xiaolin
Li, Jiamao
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5674 - 5683
[9] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
Yu, Sheng
Zhai, Di-Hua
Xia, Yuanqing
Li, Dong
Zhao, Shiqi
IEEE Transactions on Multimedia, 2024, 26 : 1665 - 1680
[10] GSNet: Model Reconstruction Network for Category-level 6D Object Pose and Size Estimation
Liu, Penglei
Zhang, Qieshi
Cheng, Jun
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2898 - 2904

← 1 2 3 4 5 →