Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation

被引:3
|
作者
Niu, Zehai [1 ]
Lu, Ke [1 ,2 ]
Xue, Jian [1 ]
Wang, Jinbao [3 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Engn Sci, 19A Yuquan Rd, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Vanke Cloud City Phase I Bldg 8,Xili St, Shenzhen 518055, Guangdong, Peoples R China
[3] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
[4] Guangdong Prov Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
关键词
3D human pose estimation; Motion capture; Deep learning;
D O I
10.1016/j.cviu.2024.104059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multi -view 3D human pose estimation task relies on 2D human pose estimation for each view; however, severe occlusion, truncation, and human interaction lead to incorrect 2D human pose estimation for some views. The traditional "Matching-Lifting-Tracking"paradigm amplifies the incorrect 2D human pose into an incorrect 3D human pose, which significantly challenges the robustness of multi -view 3D human pose estimation. In this paper, we propose a novel method that tackles the inherent difficulties of the traditional paradigm. This method is rooted in the newly devised "Skeleton Pooling -Clustering -Tracking (SPCT)"paradigm. It initiates a 2D human pose estimation for each perspective. Then a symmetrical dilated network is created for skeleton pool estimation. Upon clustering the skeleton pool, we introduce and implement an innovative tracking method that is explicitly designed for the SPCT paradigm. The tracking method refines and filters the skeleton clusters, thereby enhancing the robustness of the multi -person 3D human pose estimation results. By coupling the skeleton pool with the tracking refinement process, our method obtains high -quality multi -person 3D human pose estimation results despite severe occlusions that produce erroneous 2D and 3D estimates. By employing the proposed SPCT paradigm and a computationally efficient network architecture, our method outperformed existing approaches regarding robustness on the Shelf, 4D Association, and CMU Panoptic datasets, and could be applied in practical scenarios such as markerless motion capture and animation production.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Multi-Person Hierarchical 3D Pose Estimation in Natural Videos
    Gu, Renshu
    Wang, Gaoang
    Jiang, Zhongyu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4245 - 4257
  • [32] Dynamic Graph Reasoning for Multi-person 3D Pose Estimation
    Qiu, Zhongwei
    Yang, Qiansheng
    Wang, Jian
    Fu, Dongmei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3521 - 3529
  • [33] Multi-person 3D pose estimation from unlabelled data
    Daniel Rodriguez-Criado
    Pilar Bachiller-Burgos
    George Vogiatzis
    Luis J. Manso
    Machine Vision and Applications, 2024, 35
  • [34] Multi-person 3D pose estimation from unlabelled data
    Rodriguez-Criado, Daniel
    Bachiller-Burgos, Pilar
    Vogiatzis, George
    Manso, Luis J.
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [35] Multi-person Absolute 3D Human Pose Estimation with Weak Depth Supervision
    Veges, Marton
    Lorincz, Andras
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 258 - 270
  • [36] Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
    Fabbri, Matteo
    Lanzi, Fabio
    Calderara, Simone
    Alletto, Stefano
    Cucchiara, Rita
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7202 - 7211
  • [37] DEEP, ROBUST AND SINGLE SHOT 3D MULTI-PERSON HUMAN POSE ESTIMATION FROM MONOCULAR IMAGES
    Benzine, Abdallah
    Luvison, Bertrand
    Quoc Cuong Pham
    Achard, Catherine
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 584 - 588
  • [38] Multi-Domain Pose Network for Multi-Person Pose Estimation and Tracking
    Guo, Hengkai
    Tang, Tang
    Luo, Guozhong
    Chen, Riwei
    Lu, Yongchen
    Wen, Linfu
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 209 - 216
  • [39] JOINT MULTI-VIEW PEOPLE TRACKING AND POSE ESTIMATION FOR 3D SCENE RECONSTRUCTION
    Tang, Zheng
    Gu, Renshu
    Hwang, Jenq-Neng
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [40] Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
    Jiang, Boyuan
    Hu, Lei
    Xia, Shihong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14804 - 14814