Single-Stage is Enough: Multi-Person Absolute 3D Pose Estimation

被引:20
|
作者
Jin, Lei [1 ]
Xu, Chenyang [1 ]
Wang, Xiaojuan [1 ]
Xiao, Yabo [1 ]
Guo, Yandong [2 ]
Nie, Xuecheng [3 ]
Zhao, Jian [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] OPPO Res Inst, Hyderabad, Telangana, India
[3] Natl Univ Singapore, Singapore, Singapore
[4] Inst North Elect Equipment, Bengaluru, Karnataka, India
关键词
D O I
10.1109/CVPR52688.2022.01274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existing multi-person absolute 3D pose estimation methods are mainly based on two-stage paradigm, i.e., top-down or bottom-up, leading to redundant pipelines with high computation cost. We argue that it is more desirable to simplify such two-stage paradigm to a single-stage one to promote both efficiency and performance. To this end, we present an efficient single-stage solution, Decoupled Regression Model (DRM), with three distinct novelties. First, DRM introduces a new decoupled representation for 3D pose, which expresses the 2D pose in image plane and depth information of each 3D human instance via 2D center point (center of visible keypoints) and root point (denoted as pelvis), respectively. Second, to learn better feature representation for the human depth regression, DRM introduces a 2D Pose-guided Depth Query Module (PDQM) to extract the features in 2D pose regression branch, enabling the depth regression branch to perceive the scale information of instances. Third, DRM leverages a Decoupled Absolute Pose Loss (DAPL) to facilitate the absolute root depth and root-relative depth estimation, thus improving the accuracy of absolute 3D pose. Comprehensive experiments on challenging benchmarks including MuPoTS-3D and Panoptic clearly verify the superiority of our framework, which outperforms the state-of-the-art bottom-up absolute 3D pose estimation methods.
引用
收藏
页码:13076 / 13085
页数:10
相关论文
共 50 条
  • [21] Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB
    Mehta, Dushyant
    Sotnychenko, Oleksandr
    Mueller, Franziska
    Xu, Weipeng
    Sridhar, Srinath
    Pons-Moll, Gerard
    Theobalt, Christian
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 120 - 130
  • [22] Multi-person 3D pose estimation from a single image captured by a fisheye camera
    Zhang, Yahui
    You, Shaodi
    Karaoglu, Sezer
    Gevers, Theo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [23] Multi-Person Pose Regression With Distribution-Aware Single-Stage Models
    Zhu, Leyan
    Wang, Zitian
    Liu, Si
    Nie, Xuecheng
    Liu, Luoqi
    Li, Bo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5384 - 5397
  • [24] Multi-person 3D Pose Estimation from Monocular Image Sequences
    Li, Ran
    Xu, Nayun
    Lu, Xutong
    Xing, Yucheng
    Zhao, Haohua
    Niu, Li
    Zhang, Liqing
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 15 - 24
  • [25] Efficient Multi-Person Hierarchical 3D Pose Estimation for Autonomous Driving
    Gu, Renshu
    Wang, Gaoang
    Hwang, Jenq-Neng
    2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 163 - 168
  • [26] Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation
    Zhang, Juze
    Wang, Jingya
    Shi, Ye
    Gao, Fei
    Xu, Lan
    Yu, Jingyi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1788 - 1796
  • [27] Multi-Person 3D Human Pose Estimation from Monocular Images
    Dabral, Rishabh
    Gundavarapu, Nitesh B.
    Mitra, Rahul
    Sharma, Abhishek
    Ramakrishnan, Ganesh
    Jain, Arjun
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 405 - 414
  • [28] Center point to pose: Multiple views 3D human pose estimation for multi-person
    Liu, Huan
    Wu, Jian
    He, Rui
    PLOS ONE, 2022, 17 (09):
  • [29] VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild
    Zhang, Yifu
    Wang, Chunyu
    Wang, Xinggang
    Liu, Wenyu
    Zeng, Wenjun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2613 - 2626
  • [30] Top-Down System for Multi-Person 3D Absolute Pose Estimation from Monocular Videos
    El Kaid, Amal
    Brazey, Denis
    Barra, Vincent
    Baina, Karim
    SENSORS, 2022, 22 (11)