Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation

被引:3
|
作者
Niu, Zehai [1 ]
Lu, Ke [1 ,2 ]
Xue, Jian [1 ]
Wang, Jinbao [3 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Engn Sci, 19A Yuquan Rd, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Vanke Cloud City Phase I Bldg 8,Xili St, Shenzhen 518055, Guangdong, Peoples R China
[3] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
[4] Guangdong Prov Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
关键词
3D human pose estimation; Motion capture; Deep learning;
D O I
10.1016/j.cviu.2024.104059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multi -view 3D human pose estimation task relies on 2D human pose estimation for each view; however, severe occlusion, truncation, and human interaction lead to incorrect 2D human pose estimation for some views. The traditional "Matching-Lifting-Tracking"paradigm amplifies the incorrect 2D human pose into an incorrect 3D human pose, which significantly challenges the robustness of multi -view 3D human pose estimation. In this paper, we propose a novel method that tackles the inherent difficulties of the traditional paradigm. This method is rooted in the newly devised "Skeleton Pooling -Clustering -Tracking (SPCT)"paradigm. It initiates a 2D human pose estimation for each perspective. Then a symmetrical dilated network is created for skeleton pool estimation. Upon clustering the skeleton pool, we introduce and implement an innovative tracking method that is explicitly designed for the SPCT paradigm. The tracking method refines and filters the skeleton clusters, thereby enhancing the robustness of the multi -person 3D human pose estimation results. By coupling the skeleton pool with the tracking refinement process, our method obtains high -quality multi -person 3D human pose estimation results despite severe occlusions that produce erroneous 2D and 3D estimates. By employing the proposed SPCT paradigm and a computationally efficient network architecture, our method outperformed existing approaches regarding robustness on the Shelf, 4D Association, and CMU Panoptic datasets, and could be applied in practical scenarios such as markerless motion capture and animation production.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] PoseTrack: Joint Multi-Person Pose Estimation and Tracking
    Iqbal, Umar
    Milan, Anton
    Gall, Juergen
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4654 - 4663
  • [42] Multi-person 3D Pose Estimation from Monocular Image Sequences
    Li, Ran
    Xu, Nayun
    Lu, Xutong
    Xing, Yucheng
    Zhao, Haohua
    Niu, Li
    Zhang, Liqing
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 15 - 24
  • [43] 3D Human Pose Estimation from Deep Multi-View 2D Pose
    Schwarcz, Steven
    Pollard, Thomas
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2326 - 2331
  • [44] A practical framework of multi-person 3D human pose estimation with a single RGB camera
    Ma, Le
    Lian, Sen
    Wang, Shandong
    Meng, Weiliang
    Xiao, Jun
    Zhang, Xiaopeng
    2021 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS (VRW 2021), 2021, : 420 - 421
  • [45] Towards Robust and Smooth 3D Multi-Person Pose Estimation from Monocular Videos in the Wild
    Park, Sungchan
    You, Eunyi
    Lee, Inhoe
    Lee, Joonseok
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14726 - 14736
  • [46] Efficient Multi-Person Hierarchical 3D Pose Estimation for Autonomous Driving
    Gu, Renshu
    Wang, Gaoang
    Hwang, Jenq-Neng
    2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 163 - 168
  • [47] Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation
    Zhang, Juze
    Wang, Jingya
    Shi, Ye
    Gao, Fei
    Xu, Lan
    Yu, Jingyi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1788 - 1796
  • [48] Multi-Person Absolute 3D Pose and Shape Estimation from Video
    Zhang, Kaifu
    Li, Yihui
    Guan, Yisheng
    Xi, Ning
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2021, PT III, 2021, 13015 : 189 - 200
  • [49] Practical 3D human skeleton tracking based on multi-view and multi-Kinect fusion
    Manh-Hung Nguyen
    Ching-Chun Hsiao
    Wen-Huang Cheng
    Ching-Chun Huang
    Multimedia Systems, 2022, 28 : 529 - 552
  • [50] Practical 3D human skeleton tracking based on multi-view and multi-Kinect fusion
    Nguyen, Manh-Hung
    Hsiao, Ching-Chun
    Cheng, Wen-Huang
    Huang, Ching-Chun
    MULTIMEDIA SYSTEMS, 2022, 28 (02) : 529 - 552