Single-Stage is Enough: Multi-Person Absolute 3D Pose Estimation

被引:20
|
作者
Jin, Lei [1 ]
Xu, Chenyang [1 ]
Wang, Xiaojuan [1 ]
Xiao, Yabo [1 ]
Guo, Yandong [2 ]
Nie, Xuecheng [3 ]
Zhao, Jian [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] OPPO Res Inst, Hyderabad, Telangana, India
[3] Natl Univ Singapore, Singapore, Singapore
[4] Inst North Elect Equipment, Bengaluru, Karnataka, India
关键词
D O I
10.1109/CVPR52688.2022.01274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existing multi-person absolute 3D pose estimation methods are mainly based on two-stage paradigm, i.e., top-down or bottom-up, leading to redundant pipelines with high computation cost. We argue that it is more desirable to simplify such two-stage paradigm to a single-stage one to promote both efficiency and performance. To this end, we present an efficient single-stage solution, Decoupled Regression Model (DRM), with three distinct novelties. First, DRM introduces a new decoupled representation for 3D pose, which expresses the 2D pose in image plane and depth information of each 3D human instance via 2D center point (center of visible keypoints) and root point (denoted as pelvis), respectively. Second, to learn better feature representation for the human depth regression, DRM introduces a 2D Pose-guided Depth Query Module (PDQM) to extract the features in 2D pose regression branch, enabling the depth regression branch to perceive the scale information of instances. Third, DRM leverages a Decoupled Absolute Pose Loss (DAPL) to facilitate the absolute root depth and root-relative depth estimation, thus improving the accuracy of absolute 3D pose. Comprehensive experiments on challenging benchmarks including MuPoTS-3D and Panoptic clearly verify the superiority of our framework, which outperforms the state-of-the-art bottom-up absolute 3D pose estimation methods.
引用
收藏
页码:13076 / 13085
页数:10
相关论文
共 50 条
  • [41] Depth Decoupling for Bottom-Up Multi-Person 3D Pose Estimation
    Lie, Zhaokun
    Liu, Qiong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 412 - 428
  • [42] Unsupervised universal hierarchical multi-person 3D pose estimation for natural scenes
    Renshu Gu
    Zhongyu Jiang
    Gaoang Wang
    Kevin McQuade
    Jenq-Neng Hwang
    Multimedia Tools and Applications, 2022, 81 : 32883 - 32906
  • [43] Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution
    Gu, Renshu
    Wang, Gaoang
    Hwang, Jenq-Neng
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8243 - 8250
  • [44] MMDA: Multi-person marginal distribution awareness for monocular 3D pose estimation
    Liu, Sheng
    Shuai, Jianghai
    Li, Yang
    Du, Sidan
    IET IMAGE PROCESSING, 2023, 17 (07) : 2182 - 2191
  • [45] Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement
    Cha, Junuk
    Saqlain, Muhammad
    Kim, GeonU
    Shin, Mingyu
    Baek, Seungryul
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 660 - 677
  • [46] Unsupervised universal hierarchical multi-person 3D pose estimation for natural scenes
    Gu, Renshu
    Jiang, Zhongyu
    Wang, Gaoang
    McQuade, Kevin
    Hwang, Jenq-Neng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 32883 - 32906
  • [47] DEEP, ROBUST AND SINGLE SHOT 3D MULTI-PERSON HUMAN POSE ESTIMATION FROM MONOCULAR IMAGES
    Benzine, Abdallah
    Luvison, Bertrand
    Quoc Cuong Pham
    Achard, Catherine
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 584 - 588
  • [48] Enhanced Two-Stage Multi-person Pose Estimation
    Honda, Hiroto
    Kato, Tomohiro
    Uchida, Yusuke
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 217 - 220
  • [49] Unsupervised Multi-view Multi-person 3D Pose Estimation Using Reprojection Error
    de Franca Silva, Diogenes Wallis
    Do Monte Lima, Joao Paulo Silva
    Macedo, David
    Zanchettin, Cleber
    Thomas, Diego Gabriel Francis
    Uchiyama, Hideaki
    Teichrieb, Veronica
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 482 - 494
  • [50] Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video
    Cheng, Yu
    Wang, Bo
    Tan, Robby T. T.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1636 - 1651